Baidu ERNIE: how the new multimodal AI surpassed GPT and Gemini

Author: Redakcija
event 13.11.2025.
Foto: Shutterstock

Chinese tech giant Baidu has unveiled ERNIE 4.5 VL 28B A3B Thinking — a multimodal artificial intelligence model that reportedly outperforms leading systems like GPT-5 and Gemini 2.5 Pro in complex tasks involving text, images and data visualization. Baidu claims its mixture-of-experts architecture enables exceptional results while using only a fraction of its total parameters. The company has also open-sourced the model under the Apache 2.0 license, encouraging broad adoption and industry collaboration.

Baidu’s new ERNIE model represents an important milestone in the global race for AI leadership. Unlike many Western systems focused on narrow applications, ERNIE is designed as a truly multimodal platform, capable of understanding, analyzing and combining text, images, graphs, and even complex data tables into a unified context. Benchmark tests show that the model surpasses GPT and Gemini in tasks that measure the ability to interpret and connect visual information with linguistic data.

According to Baidu, ERNIE leverages an innovative mixture-of-experts approach: instead of activating all parameters at once, the system dynamically engages only the modules most relevant to a given task. This significantly reduces energy consumption and accelerates processing time, addressing one of the key challenges in modern AI development. Furthermore, by open-sourcing the model under Apache 2.0, Baidu allows researchers and companies worldwide to freely use, modify and build upon the system without restrictions.

This level of openness could reshape the competitive landscape. While Western tech giants often guard their model architectures and training data, China’s approach promotes transparency and collaboration. In doing so, Baidu not only demonstrates technical excellence but also positions China as a major force in advancing open and accessible artificial intelligence.

Ultimately, ERNIE’s success in benchmark tests represents more than just a technical win over GPT or Gemini — it signals a broader shift in the global AI dynamic. Open, efficient and versatile systems may define the next phase of AI evolution, where the speed of innovation and collaboration outweighs the mere size of a model. If Baidu continues at this pace, the balance of power in artificial intelligence could soon look very different.

Comments

Zainteresirani ste za jedan od treninga?

Ispunite prijavu i javit ćemo Vam se u najkraćem mogućem roku!

Markoja d.o.o.
Selska cesta 93
OIB: 10585552225

    Ispunite prijavu i javit ćemo Vam se u najkraćem mogućem roku!



    All news

    Podržava