Skip to main content

OpenAI’s Advanced Voice Mode can now see your screen and analyze videos

Advanced Santa voice mode
OpenAI

OpenAI’s “12 Days of OpenAI” continued apace on Wednesday with the development team announcing a new seasonal voice for ChatGPT’s Advanced Voice Mode (AVM), as well as new video and screen-sharing capabilities for the conversational AI feature.

Santa Mode, as OpenAI is calling it, is a seasonal feature for AVM, and offers St. Nick’s dulcet tones as a preset voice option. It is being released to Plus and Pro subscribers through the website and mobile and desktop apps starting today and will remain so until early January. To access the limited-time feature, first sign in to your Plus or Pro account, then click on the snowflake icon next to the text prompt window.

Recommended Videos

Select Santa’s voice from the popup menu, confirm your choice, and start chatting. I, for one, am not entirely clear on why you’d want to talk to a large language model masquerading as a fictional religious figure, much less shell out $20 for the privilege, but OpenAI seems to believe it holds value. Note that the system will not log your chats with Santa, they won’t be saved to your chat history, nor will they impact the memory of ChatGPT.

Just in time for the holidays, video and screensharing are now starting to roll out in Advanced Voice in the ChatGPT mobile app. pic.twitter.com/HFHX2E33S8

— OpenAI (@OpenAI) December 12, 2024

The company is also rolling out a long-awaited feature for Advanced Voice Mode: the ability to analyze video and screen shares through the mobile AVM interface. With it, you’ll be able to share your screen or video feed with ChatGPT, hold real-time conversations, and get it to answer questions about what you see, without needing to describe your surroundings or upload photos.

The new feature is rolling out to Plus and Pro subscribers “in most countries” according to the company, as well as to all Teams users. Stringent privacy laws are delaying the feature’s release in the EU, Switzerland, Iceland, Norway, and Liechtenstein, though the company hopes to get it to Plus and Pro subscribers in those regions “soon.” Enterprise and Edu users will have to wait until January to try it for themselves. If you have access, you can launch the new feature by opening voice mode, then tapping the video camera icon in the lower left. To launch screen share, just tap the three-dot menu and select “Share Screen.”

Wednesday’s announcement marks the fourth day of OpenAI’s live stream event. The company has already unveiled its fully-functional 01 reasoning model, its Sora video generation model, a $200/month Pro subscription tier, and updates to ChatGPT’s Canvas.

Andrew Tarantola
Andrew Tarantola is a journalist with more than a decade reporting on emerging technologies ranging from robotics and machine…
ChatGPT prototypes its next strike against Google Search: browsers
ChatGPT on a laptop

ChatGPT developer OpenAI may be one step closer to creating a third-party search tool that integrates the chatbot into other websites as primary feature. If the project comes to fruition, OpenAI could target Google as both a search engine and web browser.

A source told The Information the project is a search tool called NLWeb, Natural Language Web, and that it is currently in a prototype phase. OpenAI has showcased the prototype to several potential partners in travel, retail, real estate, and food industries, with Conde Nast, Redfin, Eventbrite, and Priceline being named by brand. The tool would enable ChatGPT search features onto the websites of these brands' products and services.

Read more
ChatGPT’s latest model may be a regression in performance
chatGPT on a phone on an encyclopedia

According to a new report from Artificial Analysis, OpenAI's flagship large language model for ChatGPT, GPT-4o, has significantly regressed in recent weeks, putting the state-of-the-art model's performance on par with the far smaller, and notably less capable, GPT-4o-mini model.

This analysis comes less than 24 hours after the company announced an upgrade for the GPT-4o model. "The model’s creative writing ability has leveled up–more natural, engaging, and tailored writing to improve relevance & readability," OpenAI wrote on X. "It’s also better at working with uploaded files, providing deeper insights & more thorough responses." Whether those claims continue to hold up is now being cast in doubt.

Read more
ChatGPT just improved its creative writing chops
a phone displaying the ChatGPT homepage on a beige bbackground.

One of the great strengths of ChatGPT is its ability to aid in creative writing. ChatGPT's latest large language model, GPT-4o, has received a bit of a performance boost, OpenAI announced Wednesday. Users can reportedly expect "more natural, engaging, and tailored writing to improve relevance & readability" moving forward.

https://twitter.com/OpenAI/status/1859296125947347164

Read more