Yes - ASR on browser/device has reached the quality threshold pretty much with distil whisper/ whisper 3.
Quality TTS with fast latency is still not there but getting better (you can use Tortoise but its slow and compute expensive as far as I know). Bark is another option but has mixed results.
I played around with a recent transformers.js one (using SpeechT5) you can run in your browser and am optimistic where we can go with some improvements:
Quality TTS with fast latency is still not there but getting better (you can use Tortoise but its slow and compute expensive as far as I know). Bark is another option but has mixed results.
I played around with a recent transformers.js one (using SpeechT5) you can run in your browser and am optimistic where we can go with some improvements:
https://tinyllms.vercel.app/dashboard/tts