Extract speech from video and audio files

yuribudilov · September 19, 2022, 5:10am

Hello everyone

This is not Rust specific topic, other than I want to use it from a Rust program.

Is there any Rust-accessible library (ideally a Rust crate) that can be used to take input video file (i.e. video.mp4) or input audio file and to extract the audio voice/speech content from the file, in form of Text output?
Or even a tool/utility that can do this, from command line or CLI ?

I want to be able to analyze, index and search for words/phrases said in a video and audio files.

I just Googled for answers and the only thing I found on Cloud was Microsoft Azure Cognitive Services API. I have not tried it (could be great!!) yet but perhaps there are more options to investigate - such as AWS cloud, Google GCP cloud or C/C++ libraries and Rust crates?

Also this: Picovoice (github.com)

Any more ideas? Or perhaps someone has used something else?

Many thanks

Thank you.

system · December 18, 2022, 5:11am

This topic was automatically closed 90 days after the last reply. We invite you to open a new topic if you have further questions or comments.

Topic		Replies	Views
A crate that listens to the microphone help	4	5542	May 17, 2023
How to convert text to voice help	4	5431	July 17, 2020
Speech-prep: a focused Rust crate for speech audio preprocessing announcements	0	40	April 5, 2026
Azure Speech SDK code review	1	154	November 5, 2024
Get native microphone input - Audio Engine? help	9	1335	August 25, 2024

Extract speech from video and audio files

Related topics