latentbrief
Back to news
Launch1d ago

Google Unveils Real-Time Voice Translation Across 70 Languages

DeepMind Safety, The Decoder1 min brief

In brief

  • Google has launched Gemini 3.5 Live Translate, a groundbreaking audio model that offers real-time speech-to-speech translation in over 70 languages.
    • This innovation eliminates the need for awkward pauses between sentences, instead providing continuous translation that stays just a few seconds behind the speaker.
    • It captures the speaker's tone, pacing, and pitch, making the experience feel natural and seamless.
  • The new system is now available across Google products, including public preview via the Gemini Live API for developers, private preview in Google Meet for enterprises, and soon on mobile devices through Google Translate.
  • For businesses like Grab, it’s being tested to enable near-real-time communication between drivers and passengers, handling over 10 million voice calls monthly.
  • Gemini 3.5 Live Translate marks a significant step forward in bridging language barriers globally, offering flexibility for developers to integrate into various applications while ensuring robust performance even in noisy environments.
  • Look out for further integrations with platforms like Agora and LiveKit as they continue to enhance real-time communication tools worldwide.

Terms in this brief

Gemini 3.5 Live Translate
A new audio model by Google that translates speech in real-time across over 70 languages, keeping up with the speaker's flow and maintaining their tone. It’s available in various Google products and is being tested to help services like Grab handle millions of voice calls monthly.

Read full story at DeepMind Safety, The Decoder

More briefs