Based on a 7B parameter large language model (LLM) called Helium, the chatbot is currently available for all and can speak in various accents and 70 different emotional and speaking styles. Moshi can also handle two audio streams simultaneously, meaning it can listen and talk at the same time.
Kyutai says that it aimed to teach Moshi various nuances and tones of human conversations. To enhance the voice quality, the company even collaborated with a professional voice artist.
Kyutai says its goal is to make the chatbot an open source project, that is, make the model’s code and framework available to all, so that users can safely use the chatbot without having to worry about privacy. While Moshi is faster than GPT-4o, the company says it is a research prototype and is a way for them to showcase the bot’s response time and ability to replicate not only sentences but tones and voices as well.
However, unlike GPT-40, Moshi is pretty small and was developed from scratch in six months by a team of just eight researchers. It was reportedly trained on 1,00,000 synthetic dialogues using Text-to-Speech technology.
Read More-Motorola Edge 50 Pro gets a new color in India
5 Good Stocks to invest in 2024
5 tips and tricks to fix the most annoying things about your wireless earbuds
Bharat Serums Advent Gear up
BLACKPINK’s Jisoo-upcoming drama Monthly Boyfriend
Energy prices require to remain stable and predictable: Oil Minister Puri
LIC amends norms for inclusion of shareholders’ directors on its board , The government raised Rs 20,557 crore
New iPhones usually come with upgraded processors.