Skip to main content

Grok 2.0 takes the guardrails off AI image generation

Elon Musk’s xAI company has released two updated iterations of its Grok chatbot model, Grok-2 and Grok-2 mini. They promise improved performance over their predecessor, as well as new image-generation capabilities that will enable X (formerly Twitter) users to create AI imagery directly on the social media platform.

“We are excited to release an early preview of Grok-2, a significant step forward from our previous model, Grok-1.5, featuring frontier capabilities in chat, coding, and reasoning. At the same time, we are introducing Grok-2 mini, a small but capable sibling of Grok-2. An early version of Grok-2 has been tested on the LMSYS leaderboard under the name ‘sus-column-r,’” xAI wrote in a recent blog post. The new models are currently in beta and reserved for Premium and Premium+ subscribers, though the company plans to make them available through its Enterprise API later in the month.

The image-generation feature appears to be powered by the Flux.1 model developed by Black Forest Labs. While virtually every other image-generation system on the market — whether that’s OpenAI’s Dall-E, StableDiffusion, or Adobe’s Firefly — has guardrails to prevent users from misusing them to generate racist, bigoted, or violent content (especially when featuring celebrities, politicians, and other public figures), Grok-2 apparently does not.

One early user declared, “grok 2.0 image generation is better than llama’s and has no dumb guardrails” while posting images of Meta CEO Mark Zuckerberg and xAI CEO Elon Musk boxing, as well as Donald Trump wearing a turban.

“Grok 2.0 will do political illustrations and real people, while ChatGPT refuses. This instantly makes Grok 10x more fun……” another user argued.

Grok 2.0 will do political illustrations and real people, while ChatGPT refuses.

This instantly makes Grok 10x more fun…… pic.twitter.com/yDBJO0jWba

— Benjamin De Kraker 🏴‍☠️ (@BenjaminDEKR) August 14, 2024

This new feature will surely prove a boon to internet trolls and, given the highly contentious presidential election slated for November (one of 50 national elections being held across the globe this year), it will likely aid in misinformation efforts across social media as well.

Andrew Tarantola
Andrew has spent more than a decade reporting on emerging technologies ranging from robotics and machine learning to space…
The best free AI video generators you can try out today
dogs running and melting

The days of needing extensive editing experience and a hefty budget just to make a professional-quality video are over. A new wave of AI-powered video generators has arrived, empowering anyone with a laptop and internet connection to craft stunning and engaging videos in just a few clicks.
Leading the way is OpenAI's Sora, an AI capable of generating minutes' worth of photorealistic video in moments -- or it will when it's actually released to the public. Until then, you can try out any of these innovative AI tools for free and easily turn your cinematic ideas into reality. At the very least, they're pretty fun to play around with.

Haiper 1.5

Read more
Meta’s new AI model can turn text into 3D images in under a minute
an array of 3D generated images made by Meta 3D Gen

Meta's latest foray into AI image generation is a quick one. The company introduced its new "3D Gen" model on Tuesday, a "state-of-the-art, fast pipeline" for transforming input text into high-fidelity 3D images that can output them in under a minute.

What's more, the system is reportedly able to apply new textures and skins to both generated and artist-produced images using text prompts.

Read more
Google’s new AI generates audio soundtracks from pixels
An AI generated wolf howling

Deep Mind showed off the latest results from its generative AI video-to-audio research on Tuesday. It's a novel system that combines what it sees on-screen with the user's written prompt to create synced audio soundscapes for a given video clip.

The V2A AI can be paired with vide -generation models like Veo, Deep Mind's generative audio team wrote in a blog post, and can create soundtracks, sound effects, and even dialogue for the on-screen action. What's more, Deep Mind claims that its new system can generate "an unlimited number of soundtracks for any video input" by tuning the model with positive and negative prompts that encourage or discourage the use of a particular sound, respectively.

Read more