Hacker Newsnew | past | comments | ask | show | jobs | submit | davitb's commentslogin

This is for listener-side, not speaker-side. So no misuse case here.


A small, audio-only turn-taking model from Krisp


They recently added noise cancellation to realtime transcription.

https://platform.openai.com/docs/guides/realtime-transcripti...


The hard part is to separate background voices (e.g. TV, chatter, etc) from the primary speaker's voice. Basically do voice isolation. Voice fingerprinting would help only in this context.


On average, an hour of speech contains about 9,000 to 15,000 words. This range accounts for different speaking speeds, which typically vary from 150 to 250 words per minute.

So this translates to tens of millions of words.


In the last 3 months our team worked hard to build Krisp Chrome Extension.

We had to squeeze our noise cancellation DNN 30x to fit it inside Chrome.

This was quite challenging and thought it’s a story worth sharing.


Very cool stuff, Davit! Any idea if the same can be done in Firefox?


For the next 6 months, Krisp is free for all students, teachers, hospital and government workers.

Also, it just went Freemium, comes with 120min/week free noise cancellation.

More here: https://krisp.ai/blog/covid19-response/

(I'm the CEO & Co-Founder)


Just signed up to try it. The Mac menubar window footer says my free subscription expires on April 1, 2020?


You have trial until April 1st. Then it converts to 120mins/week. If you are a student, teacher, hospital worker or a non-profit - reach out to support@krisp.ai and you will receive 6 months free.


We have built a DNN which does 2 things on real-time audio:

a) remove background noise b) remove room echo

We’ve then embedded it in a virtual microphone which can be integrated into ZoomRooms or similar products.

Demo with video here: https://www.youtube.com/watch?v=AnoWG1JBe8A


It definitely removes the background noise and increases legibility.

However the voices sound "pinched" to me. It is a lot like one of those head related transfer functions that is supposed to make you think the sound comes from above, but it sounds like multiple band reject filters were applied and makes me feel some kind of pressure in my head.


CallerID is not supported yet but it will come soon.


Yes, and hopefully very soon.

Apparently getting access to the microphone stream during calls, even from your own app, is really tough. There is only one provider that we know that implements the concept of "advanced audio filters" and that's Twilio's Voice SDK for iOS.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: