Skip to main content

OpenAI’s Advanced Voice Mode can now see your screen and analyze videos

Advanced Santa voice mode
OpenAI

OpenAI’s “12 Days of OpenAI” continued apace on Wednesday with the development team announcing a new seasonal voice for ChatGPT’s Advanced Voice Mode (AVM), as well as new video and screen-sharing capabilities for the conversational AI feature.

Santa Mode, as OpenAI is calling it, is a seasonal feature for AVM, and offers St. Nick’s dulcet tones as a preset voice option. It is being released to Plus and Pro subscribers through the website and mobile and desktop apps starting today and will remain so until early January. To access the limited-time feature, first sign in to your Plus or Pro account, then click on the snowflake icon next to the text prompt window.

Recommended Videos

Select Santa’s voice from the popup menu, confirm your choice, and start chatting. I, for one, am not entirely clear on why you’d want to talk to a large language model masquerading as a fictional religious figure, much less shell out $20 for the privilege, but OpenAI seems to believe it holds value. Note that the system will not log your chats with Santa, they won’t be saved to your chat history, nor will they impact the memory of ChatGPT.

Just in time for the holidays, video and screensharing are now starting to roll out in Advanced Voice in the ChatGPT mobile app. pic.twitter.com/HFHX2E33S8

— OpenAI (@OpenAI) December 12, 2024

The company is also rolling out a long-awaited feature for Advanced Voice Mode: the ability to analyze video and screen shares through the mobile AVM interface. With it, you’ll be able to share your screen or video feed with ChatGPT, hold real-time conversations, and get it to answer questions about what you see, without needing to describe your surroundings or upload photos.

The new feature is rolling out to Plus and Pro subscribers “in most countries” according to the company, as well as to all Teams users. Stringent privacy laws are delaying the feature’s release in the EU, Switzerland, Iceland, Norway, and Liechtenstein, though the company hopes to get it to Plus and Pro subscribers in those regions “soon.” Enterprise and Edu users will have to wait until January to try it for themselves. If you have access, you can launch the new feature by opening voice mode, then tapping the video camera icon in the lower left. To launch screen share, just tap the three-dot menu and select “Share Screen.”

Wednesday’s announcement marks the fourth day of OpenAI’s live stream event. The company has already unveiled its fully-functional 01 reasoning model, its Sora video generation model, a $200/month Pro subscription tier, and updates to ChatGPT’s Canvas.

Andrew Tarantola
Former Computing Writer
Andrew Tarantola is a journalist with more than a decade reporting on emerging technologies ranging from robotics and machine…
Google might have to sell Chrome — and OpenAI wants to buy it
OpenAI press image

It feels like all of the big tech companies practically live in courtrooms lately, but it also feels like not much really comes of it. Decisions get made and unmade again, and it takes a long time for anything to affect consumers. At the moment, Google is in danger of getting dismantled and sold for parts -- and if it really happens, OpenAI has told the judge that it would be interested in buying.

OpenAI, the company behind ChatGPT, currently doesn't work with Google at all. Apparently, it wanted to make a deal last year to use Google's search technology with ChatGPT but it didn't work out. Instead, OpenAI is now working on its own search index but it's turning out to be a much more time-consuming project than anticipated.

Read more
The original AI model behind ChatGPT will live on in your favorite apps
OpenAI press image

OpenAI has released its GPT‑3.5 Turbo API to developers as of Monday, bringing back to life the base model that powered the ChatGPT chatbot that took the world by storm in 2022. It will now be available for use in several well-known apps and services. The AI brand has indicated that the model comes with several optimizations and will be cheaper for developers to build upon, making the model a more efficient option for features on popular applications, including Snapchat and Instacart. 

Apps supporting GPT‑3.5 Turbo API

Read more
Your politeness toward ChatGPT is increasing OpenAI’s energy costs 
ChatGPT's Advanced Voice Mode on a smartphone.

Everyone’s heard the expression, “Politeness costs nothing,” but with the advent of AI chatbots, it may have to be revised.

Just recently, someone on X wondered how much OpenAI spends on electricity at its data centers to process polite terms like “please” and “thank you” when people engage with its ChatGPT chatbot.

Read more