Skip to main content
  1. Home
  2. Computing
  3. News

Nvidia just released an open-source LLM to rival GPT-4

Add as a preferred source on Google
Nvidia CEO Jensen in front of a background.
Nvidia

Nvidia, which builds some of the most highly sought-after GPUs in the AI industry, has announced that it has released an open-source large language model that reportedly performs on par with leading proprietary models from OpenAI, Anthropic, Meta, and Google.

The company introduced its new NVLM 1.0 family in a recently released white paper, and it’s spearheaded by the 72 billion-parameter NVLM-D-72B model. “We introduce NVLM 1.0, a family of frontier-class multimodal large language models that achieve state-of-the-art results on vision-language tasks, rivaling the leading proprietary models (e.g., GPT-4o) and open-access models,” the researchers wrote.

Recommended Videos

Introducing NVLM 1.0, a family of frontier-class multimodal LLMs that achieve state-of-the-art results on vision-language tasks, rivaling the leading proprietary models (e.g., GPT-4o) and open-access models (e.g., InternVL 2).
Remarkably, NVLM 1.0 shows improved text-only… pic.twitter.com/yKGyOqHnsp

— Wei Ping (@_weiping) September 18, 2024

The new model family is reportedly already capable of “production-grade multimodality,” with exceptional performance across a variety of vision and language tasks, in addition to improved text-based responses compared to the base LLM that the NVLM family is based on. “To achieve this, we craft and integrate a high-quality text-only dataset into multimodal training, alongside a substantial amount of multimodal math and reasoning data, leading to enhanced math and coding capabilities across modalities,” the researchers explained.

The result is an LLM that can just as easily explain why a meme is funny as it can solve complex mathematics equations, step by step. Nvidia also managed to increase the model’s text-only accuracy by an average of 4.3 points across common industry benchmarks, thanks to its multimodal training style.

screenshot of the NVLM white paper explaining the process of explaining why a meme is funny
Nvidia

Nvidia appears serious about ensuring that this model meets the Open Source Initiative’s newest definition of “open source” by not only making its training weights available for public review, but also promising to release the model’s source code in the near future. This is a marked departure from the actions of rivals like OpenAI and Google, who jealously guard the details of their LLMs’ weights and source code. In doing so, Nvidia has positioned the NVLM family to not necessarily compete directly against ChatGPT-4o and Gemini 1.5 Pro, but rather serve as a foundation for third-party developers to build their own chatbots and AI applications.

Andrew Tarantola
Former Computing Writer
Andrew Tarantola is a journalist with more than a decade reporting on emerging technologies ranging from robotics and machine…
A YouTuber 3D printed an entire outfit, but the comfort and cost are more complicated than you’d think
The 3D-printed outfit is real. Whether it's practical is a different conversation entirely.
Adult, Male, Man

YouTuber Matthew Trahan has made a career out of 3D printing increasingly unusual things. He has printed musical instruments, bedroom furniture, and, in one particularly memorable video, himself.

His latest project is a full outfit, from shirt to shoes, belt to glasses, because apparently nobody told him 3D printers are for creating engineering prototypes or structures that aren’t otherwise feasible, not for fashion week.

Read more
The memory crisis isn’t going to ease, and you will pay the price for it, says a research firm
Forty to 50% higher this quarter, 30 to 40% more next quarter, and no real relief until 2028. Plan accordingly.
RAM memory chips

If you were hoping the memory crisis was about to ease up, I have some bad news for you. It comes directly from Wall Street.

Your next smartphone, laptop, or tablet could cost even more, regardless of whether it has recently been subject to a price hike.

Read more
Apple’s next Mac Studio could get a new M5 Ultra chip and a cooler upgrade
The desktop workstation is tipped to receive an M5 Ultra this year, an M7 Ultra later, and a redesigned heat sink.
Apple Mac Studio Featured

Apple's Mac Studio may not be getting a fresh new look anytime soon, but it could be getting a meaningful upgrade where it matters most. According to Mark Gurman in the latest edition of his Power On newsletter, Apple is preparing an M5 Ultra-powered Mac Studio as early as this year, while an even more powerful M7 Ultra version is already on the company's roadmap for 2028. Interestingly, the report also claims Apple is redesigning one component most users will never see: the heat sink.

More power is coming, and Apple wants to keep it cool

Read more