News: 0001569145

  ARM Give a man a fire and he's warm for a day, but set fire to him and he's warm for the rest of his life (Terry Pratchett, Jingo)

FFmpeg 8.0 Merges OpenAI Whisper Filter For Automatic Speech Recognition

([Multimedia] 5 Hours Ago FFmpeg + Whisper)


The upcoming [1]FFmpeg 8.0 multimedia library release continues to get more exciting almost by the day. The newest feature being squeezed into this next release is a Whisper audio filter for making use of OpenAI's Whisper model for providing automatic speech recognition / transcription capabilities.

For those unaware, Whisper is an automatic speech recognition model trained on a very large dataset and has proven to be extremely capable. FFmpeg 8.0 can be built with the "--enable-whisper" library when the Whisper.cpp library is present on the system for having OpenAI Whisper model support. There is optional GPU acceleration and various tunables that can be used for then running automatic transcription with FFmpeg to dump the text to a SRT file, sending the output in JSON format to an HTTP web service, and other capabilities.

Those interested in this OpenAI Whisper audio filter support that was merged to FFmpeg over the weekend can be found via [2]this Git commit .

[3]FFmpeg 8.0 should release within a few weeks and also feature a number of Vulkan acceleration enhancements, new CPU performance optimizations, and a wide variety of other improvements for this widely-used open-source multimedia library.



[1] https://www.phoronix.com/search/FFmpeg+8.0

[2] https://git.ffmpeg.org/gitweb/ffmpeg.git/commit/13ce36fef98a3f4e6d8360c24d6b8434cbb8869b

[3] https://www.phoronix.com/news/FFmpeg-8.0-Coming-Soon



mSparks

skeevy420

schmidtbag

skeevy420

mSparks

shmerl

Danny3

schmidtbag

Adde parvum parvo manus acervus erit.
[Add little to little and there will be a big pile.]
-- Ovid