Meet Moshi, a new GPT-4o challenger with a surprisingly competitive voice mode

It looks like Moshi has been named after the Japanese phrase for answering a phone call

Written by Poulami Saha

July 8, 2024 13:57 IST

Say hello to ‘Moshi’, a French AI chatbot

Say hello to ‘Moshi’, a French AI chatbot

It looks like after GPT-4o, Kyutai, a French AI company has developed a new AI-powered chatbot called “Moshi”. Reportedly, French artificial intelligence developer Kyutai has introduced a real-time voice AI assistant named Moshi.

It looks like after OpenAI’s chatGPT faced many backlashes, Kyutai Labs brought a new approach to AI chatbot. It is expected that Moshi is touted as a rival to OpenAI’s GPT-4o. Early reports suggest that many users were not happy with the ‘voice mode’ of GPT-4o. Amidst this, Kyutai seems to bring a better ‘voice mode’ with its Moshi AI chatbot.

Also Read

ChatGPT Mac app raises security concern; Here’s what you can do

Meet ‘Moshi’

It looks like Moshi has been named after the Japanese phrase for answering a phone call. The company seems to boast of capabilities. The French company claims that their voice mode is better than that of OpenAI’s highly anticipated GPT-4o Advanced Voice Mode.

As reported by Kyutai, Moshi can speak in various accents. In addition to this Moshi is expected to have about 70 different emotional and speaking styles. The AI can even handle two audio streams simultaneously. This means you can expect a ‘human like’ conversation from the AI chatbot.

Also Read

New OpenAI tool ‘CriticGPT’ to help find errors in AI-generated code

Key features of ‘Moshi’

So what are the key features of Moshi? Given below are the key features that you can expect from Moshi AI chatbot.

The Moshi AI chatbot can interpret the user’s tone of voice. In addition to this it can also add a layer of emotional intelligence to interactions.
Similar to other AI assistants, Moshi can be interrupted mid-response, mimicking natural conversation flow.
The company claimed that with a mere 200-millisecond response time, Moshi can outpace GPT-4o’s reported 232-320 millisecond range.
The AI chatbot can also operate without an internet connection. This is expected to help in enhancing privacy and accessibility.
Moshi is expected to speak in various accents. The AI chatbot can also emulate 70 different emotional and speaking styles.

Other updates

Kyutai seems to have said that the development of Moshi involved fine-tuning over 100,000 synthetic dialogues made using Text-to-Speech (TTS) technology. Kyutai also plans to help teach Moshi the nuances and tones of human communication. Reportedly, the brand also collaborated with a professional voice artist to enhance Moshi’s voice quality.

Moshi is designed to provide lifelike conversations with users through voice, like Alexa or Google Assistant. However, Moshi is powered by the Helium 7B model. During a demonstration video Kyutai showed the capabilities of Moshi. During the presentation, the Kyutai team interacted with Moshi to illustrate it as a coach or companion. The demonstration also showcased its creativity through the incarnation of characters in roleplays.

Also Read

Believe it or not but a robot just died by suicide

Furthermore, the company also plans to develop an AI-powered audio identification, watermarking, and signature tracking system for future integration with Moshi.

Follow FE Tech Bytes on Twitter, Instagram, LinkedIn, Facebook

Read Next

Brett Adcock to Nikhil Kamath: Humanoids are next big leap

Zoho hiring alert: Sridhar Vembu invites candidates with THIS key skill to contact him directly

‘Trump if you’re watching, I have 4 words’: Mamdani’s message to US prez in NYC victory speech

TOPICS

Artificial Intelligence

Technology Features

Technology news

+ 0 More

This article was first uploaded on July eight, twenty twenty-four, at fifty-seven minutes past one in the afternoon.

/

MUST READ SEBI cautions investors- ‘Digital gold products different from SEBI regulated gold products’

X