Ggml-medium.bin
Journalists transcribing a 1-hour interview. Using the ggml-medium.bin model on a MacBook Air (M1) takes approximately 4 minutes to transcribe the hour. The "Large" model would take 15 minutes. The "Tiny" model would take 1 minute, but produce gibberish on thick accents.
./stream -m ggml-medium.bin -t 8 --step 3000 --length 10000 ggml-medium.bin
At its core, ggml-medium.bin is a serialized weight file for the automatic speech recognition (ASR) model, specifically formatted for use with the GGML library. To break that down: Journalists transcribing a 1-hour interview