OpenAI launches ChatGPT advanced voice mode and releases a dataset containing 14 languages

2024-09-25 14:55:00

Recently, OpenAI has taken an important step in the globalization of AI.

Not only did the company launch ChatGPT's Advanced Voice Mode (AVM), but it also released a multilingual dataset of 14 languages to evaluate the performance of language models.

Both initiatives aim to increase the global accessibility and usefulness of AI technology.

OpenAI has announced that it is expanding its AVM to more paying users. This audio feature allows users to communicate with ChatGPT more naturally, and will initially be rolled out to ChatGPT Plus and Teams customers. Enterprise and education customers will begin gaining access next week.

As part of the promotion, AVM underwent a design revamp. For now, the blue Jumping Sphere is used as a display for this feature, replacing the black dot that OpenAI used when it demonstrated the technology in May.

If the AVM feature is opened to the user, the user will receive a pop-up window next to the voice icon in the ChatGPT app.

In addition, ChatGPT has added 5 new voices that users can try, namely: Arbor, Maple, Sol, Spruce, and Vale.

On top of that, ChatGPT's total number of voice types has reached nine, almost as many as Google's Gemini Live.

Interestingly, the origin of its name is inspired by nature, perhaps because it wants to make ChatGPT feel more intimate to use.

It's worth noting that the "Sky" sound that OpenAI showed during the Spring Update did not appear in this update. The reason is that the famous actor Scarlett Johansson · raised objections.

Johnson, who played an AI system in the movie Her, claimed that Sky's voice was too similar to her own.

In response, OpenAI quickly took Sky's voice down, saying it never intended to emulate Johnson's voice, even though several employees tweeted about the film at the time.

OpenAI launches ChatGPT advanced voice mode and releases a dataset containing 14 languages

(Source: OpenAI)

OpenAI told the media that they have made a series of improvements since announcing the AVM's alpha test.

ChatGPT's voice features now have a better understanding of accents and conversations are smoother and faster than before.

In addition, OpenAI has extended some of ChatGPT's customization features to AVMs, such as allowing users to customize ChatGPT's responses.

However, ChatGPT's video and screen-sharing features have yet to appear in this promotion. The feature was supposed to allow GPT-4 to process both visual and auditory information. At the moment, OpenAI has not provided a timeline for when these multimodal features will be rolled out.

除了高级语音模式，OpenAI 还在开放数据平台 Hugging Face 上发布了多语言大规模多任务语言理解（MMMLU，Multilingual Massive Multitask Language Understanding）数据集。

This new assessment tool is based on the MMLU benchmark.

Originally for English, the MMLU tests the AI system's knowledge in 57 subject areas such as math, law, and computer science. The new MMMLU dataset includes 14 languages, including Chinese, Arabic, German, and Bengali.

By incorporating these diverse languages into the new multilingual assessment, especially with limited training data resources for some of these languages, OpenAI has set a new benchmark for multilingual AI capabilities.

This benchmark could lead to more equitable global access to the technology. The AI industry has long been criticized for its inability to develop language models that can understand the language spoken by millions of people around the world.

Until recently, AI research focused on English and a handful of widely spoken languages, resulting in many low-resource languages being overlooked.

OpenAI decided to include languages including Kiswahili and Yoruba, which, despite their large number of speakers, are often overlooked in AI research. It's also a sign that AI technology is moving in a more inclusive direction.

To ensure the accuracy of the MMMLU dataset, OpenAI hired professional human translators, which are more accurate than comparable datasets that rely on machine translation, especially in languages with fewer training resources.

By relying on human expertise, OpenAI ensures that the dataset provides a more reliable basis for evaluating multilingual AI models.

For enterprises, the MMMLU dataset provides an opportunity to benchmark their own AI systems in a global context.

As companies expand into international markets, the ability to deploy AI solutions that can understand multiple languages becomes critical.

Whether it's customer service, content moderation, or data analytics, AI systems that perform well in multiple languages can provide a competitive advantage by reducing communication friction and improving the user experience.

In addition to the release of the MMMLU dataset, OpenAI has also launched the OpenAI Academy program to further its commitment to global AI accessibility.

(Source: OpenAI)

According to the presentation, the academy aims to invest in developers and mission-oriented organizations that are using AI to solve critical problems in their communities, especially in low- and middle-income countries.

The Academy will provide training, technical mentorship, and $1 million in Application Programming Interface (API) credits to ensure local AI talent has access to cutting-edge resources.

By supporting developers who understand the unique social and economic challenges of their region, OpenAI hopes to empower communities to build AI applications tailored to local needs.

Resources:

https://techcrunch.com/2024/09/24/openai-rolls-out-advanced-voice-mode-with-more-voices-and-a-new-look/

https://venturebeat.com/ai/openai-tackles-global-language-divide-with-massive-multilingual-ai-dataset-release/

Operation/Typesetting: He Chenlong

OpenAI launches ChatGPT advanced voice mode and releases a dataset containing 14 languages

Read on

OpenAI Launches ChatGPT's New Interface, Canvas, More Efficient Writing and Coding, A New Era of AI Collaboration!

Generative AI such as ChatGPT challenges and responses to academic integrity

AI Daily: Conch AI Launches Picture Generating Video Function; Tiangong AI search has added color pages, and it also intends to take you to make money; ChatGPT's new version of the gray test

ChatGPT幕后大佬、o1推理模型作者官宣离职！OpenAI大洗牌

Is the Nobel Prize in Literature going to ChatGPT? Ultraman's clamor for winning the award is high, and Hinton angrily denounces him as unworthy

After reading the Nobel Prize in Chemistry, I began to fantasize that ChatGPT would win the Literature Prize

ChatGPT新能力要做Copilot?

The Nobel Prize in Physics was awarded to the AI boss, and the father of generative AI angrily denounced: they don't deserve the prize! Netizen: ChatGPT is expected to win a literary prize?

Wired survey: A large number of developers did not receive dividends from the OpenAI GPT Store, but they had the opportunity to make money

After AI won the Nobel Prize in a row: Diss OpenAI, the godfather of AI, Musk took the opportunity to step on it, how to go end-to-end

Will it be profitable in 2029? Time is running out for OpenAI

ChatGPT predicts: Messi will win the World Cup in 2026, and Mbappe is expected to win another championship in 2034

ChatGPT combined with big data analysis to analyze the research hotspots of embryonic stem cells in China

OpenAI Releases Real-Time API, How to Seize the Opportunity in the Era of AI Real-time Voice?

OpenAI Shocking Plagiarism! The 20-year-old founder revealed that the code structure was plagiarized, and the multi-agent was mired in controversy

From a nonprofit to a $157 billion subsidiary, here's how OpenAI did it

Microsoft's AI veteran defects, but OpenAI faces a new threat: former CTO or entrepreneurial poaching!

Tesla's ChatGPT moment is coming?

Game Science leaps to the top of the Steam publisher revenue charts; Adobe launches AI video generator to compete with OpenAI and Meta; The pre-sale price of Xpeng P7+ starts at 209,800 yuan, and the order has exceeded 30,000

Depth: OpenAI Purge

The 38-year-old Mac "returned to work" and was transformed to the Internet! With a speed of only 400B/s, it can chat with ChatGPT and code with Claude

OpenAI's behind-the-scenes entry into defense: with an annual income of $16 billion, it has won a large order from the United States government

Kai-Fu Lee responded to the dilemma of the AI Six Little Tigers: There are funds to train the model, financing and chips are not a problem; Ali said that the new AI translation tool beats Google and ChatGPT丨 AI Intelligence Agency

4 months ahead of OpenAI? How this product brings a whole new experience to professional creation

英伟达开源新王登基！70B刷爆SOTA，击败GPT-4o只服OpenAI o1

The AI background, technical doorway and business application behind ChatGPT (10,000 words long article, recommended collection)

Microsoft will end Azure OpenAI Service for individuals in China, which is only available to enterprise customers

Google's most out-of-the-circle AI product also amazed the CEO of OpenAI

The past and future of OpenAI o1 and artificial intelligence

AI Daily: Fudan and Baidu's new models can generate 1-hour long videos; The new version of ChatGPT for Windows is launched; Two new features have been added to NotebookLM

JD Finance responds to run rumors; Yu Chengdong talks about FSD's entry into China; ChatGPT is coming to Windows | Evening

AI Weekly | ByteDance's large model training was "poisoned"; Microsoft will terminate the Azure OpenAI service for individuals in China