Determines speech segments in the input audio stream.
Voice to text (V2T)
Converts detected speech segments into text with time stamps for indexing.
Formats the stream of recognized words into a proper form using a set of grammars.
Add your own custom words with every request.
Processing more than hours of audio every day
Accuracy varies depending on task and language. Contact us to try V2T on your own data.
All our APIs process audio and return results in real-time, so you can react immediately
V2T is available for 18 languages with optional live updates, custom language models and ability to add words to dictionary.
Supporting on premise installations without internet access at any scale.
NEWTON SpeechGrid is a complete workflow for automatic transcription and processing of audio recordings and voice-dictation. It was developed on top of our platform by company NEWTON Technologies.
Deployed in NEWTON Media, a. s., a leading media monitoring and analysis group on 10 European markets, for transcription of TV and radio broadcast in Slavic languages.
Company NEWTON Media, a.s., utilizes our platform to provide its customers with an immediate notification of the occurrence of the monitored keyword from television and radio brodcast.
Batch processing of offline recordings.
Real-time communication inside web browser.
Non-web applications with fast response time requirements.
See our API documentation here.
High available cluster