FFmpeg 8.0 Merges OpenAI Whisper Filter For Automatic Speech Recognition

([Multimedia] 5 Hours Ago FFmpeg + Whisper)

Reference: 0001569145
News link: https://www.phoronix.com/news/FFmpeg-Lands-Whisper
Source link:

The upcoming [1]FFmpeg 8.0 multimedia library release continues to get more exciting almost by the day. The newest feature being squeezed into this next release is a Whisper audio filter for making use of OpenAI's Whisper model for providing automatic speech recognition / transcription capabilities.

For those unaware, Whisper is an automatic speech recognition model trained on a very large dataset and has proven to be extremely capable. FFmpeg 8.0 can be built with the "--enable-whisper" library when the Whisper.cpp library is present on the system for having OpenAI Whisper model support. There is optional GPU acceleration and various tunables that can be used for then running automatic transcription with FFmpeg to dump the text to a SRT file, sending the output in JSON format to an HTTP web service, and other capabilities.

Those interested in this OpenAI Whisper audio filter support that was merged to FFmpeg over the weekend can be found via [2]this Git commit .

[3]FFmpeg 8.0 should release within a few weeks and also feature a number of Vulkan acceleration enhancements, new CPU performance optimizations, and a wide variety of other improvements for this widely-used open-source multimedia library.

[1] https://www.phoronix.com/search/FFmpeg+8.0

[2] https://git.ffmpeg.org/gitweb/ffmpeg.git/commit/13ce36fef98a3f4e6d8360c24d6b8434cbb8869b

[3] https://www.phoronix.com/news/FFmpeg-8.0-Coming-Soon

News: 0001569145

FFmpeg 8.0 Merges OpenAI Whisper Filter For Automatic Speech Recognition

mSparks

skeevy420

schmidtbag

skeevy420

mSparks

shmerl

Danny3

schmidtbag