Zoom-Simultaneous Live Interpreter

Zoomtopia. In between Snoop Dogg and Lynyrd Skynyrd, they are having a technology conference.

You have to see it to believe it so I found a short 30 second clip of the demo.

Someone once asked how Zoom was leveraging AI. Live transcription is one way. Now this. Simultaneous Live Interpreter. Exactly what it sounds like. Speak in native tongue and the sound that comes out is in the native language of the receiver.

Well… just watch




Ok here’s more of a description of how it works. Still need an actual interpreter apparently. For now.

Simultaneous interpretation: Users bring their own interpreters and Zoom provides multiple audio channels that mix the original and interpreter audio at an optimal ratio that enables the listener to understand the interpreter but also hear the tone of the original speaker.

Pretty incredible.


I was thinking live interpretation by a computer which then output via audio would require a crap load of hardware. Still, a step in the right direction. I like seeing innovation like this. Just opened up a position in ZM yesterday.

Can’t help but wonder how that works with a tonal language like Mandarin . . .

Can’t help but wonder how that works with a tonal language like Mandarin . . .

The Chinese have speech recognition software too. I’m guessing tonality is just another feature of the sound heard by the computer that is taken into account. Mandarin is not the only language that uses different intonation.

Considering Zoom has much of its research headquartered in China, I bet it will not be a problem.

1 Like

The ZOOm CEO is a native Chinese so he knows Chinese very well to tackle this problem.