I already wrote this below, but Yandex Browser already does this! It only translates to Russian, and with live streams (on Youtube for example) you get a ~15 seconds delay.
It’s basically as a real-time transcription -> translation -> voice generation pipeline so the accuracy is as good as the transcript it manages to extract from.
I already wrote this below, but Yandex Browser already does this! It only translates to Russian, and with live streams (on Youtube for example) you get a ~15 seconds delay.
It’s basically as a real-time transcription -> translation -> voice generation pipeline so the accuracy is as good as the transcript it manages to extract from.
nice