Whisper for Large Audio

Whisper for Large Audio

Your audio files too big for Whisper? Not anymore.

5.0
β€’1 reviewβ€’

33 followers

πŸš€ Simple solution to your large audio transcription woes πŸŽ™οΈ Split any large audio file *directly* in your browser πŸ’‘ Transcribe the partitions with OpenAI's Whisper πŸ”— Get a complete transcript with just one click βœ… No more size limits. No more manual effort
Whisper for Large Audio gallery image
Whisper for Large Audio gallery image
Whisper for Large Audio gallery image
Free
Launch tags:
Productivityβ€’Moviesβ€’Audio
Launch Team

What do you think? …

Trevor Healy
Hi, Product Hunt community, My name's Trevor Healy, I'm a frontend engineer. I'm excited to share my first Product Hunt creation "Whisper for Large Audio" (product naming is my strong suit) with you today! https://whisper.trevorhealy.me As some of you may know, Whisper is an audio transcription technology created by OpenAI. In my experience, it's a solid balance of cost-effectiveness, accuracy, and ease of use. However, I stumbled upon a constraint when I began transcribing over 100 hours of audio - the 25MB upper file size limit. For someone looking at transcribing days worth of audio content, sometimes several GB in size, this posed a challenge. Sure, I could partition all the files locally, but what about the future audio files? Would this always be a manual process? What if someone non-technical wanted to achieve the same thing? I wondered: what if all you needed was the audio file and a browser? I decided to build a tool that anyone could use. Introducing "Whisper for Large Audio": πŸŽ™οΈ Utilizes FFmpeg in the browser using Web Assembly – in my opinion, this is the coolest part of the product. This lets us split even the heftiest audio files right where you are: in your browser. This processing is memory intensive, and I notice it has spotty success on mobile devices. πŸ“œ Transcribes each section using OpenAI's Whisper. Rather than send 100 partitions at a time (which will certainly be rate-limited), the app will concurrently process the partitions in batches. Since Whisper prices by source audio time, the 4 hour file cut into 100 pieces will be the same price as the large one source file. 🀝 No time-saving utility app would be complete without a final copy to save button that ties it all together. Simply bundles the partitions into one large cohesive transcript. The idea is simple: skip the need for servers to process the audio, eliminate manual slicing, one-click processing, and one-click copying. All within your browser. I genuinely hope this tool is helpful to even one person. I love talking about these technologies and am keen to hear your thoughts, feedback, and ways you might find "Whisper for Large Audio" useful. Cheers to making things a bit simpler! πŸ™‡β€β™‚οΈ
Abhishek Choudhury
Neatly done, Trevor, My organization Blue Hex Software is also working on a similar use case.
Trevor Healy
Appreciate the kind words @yesabhishek πŸ™
Vrushank Vyas
Love it. This is what actually makes Whisper immediately helpful for me @trevorwhealy. Super impressed that the partitioning happens directly in the browser.
Trevor Healy
@vrv18 Thank you so much, I'm really glad to hear - your feedback makes my day πŸ™