OpenAI’s Whisper is a new AI-powered solution that can turn your voice into text.
Best of all, it comes at zero cost.
Especially if you want to use your Nvidia GPU’s Tensor Cores to give it a nice boost.
Don’t fret, though.
That’s why we’re here!
What Is OpenAI’s Whisper?
ChatGPT is all the rage nowadays, and we already sawhow you could use ChatGPT by OpenAI.
And yet, it’s not the only interesting project by OpenAI.
For GPUs to be useful for more than graphics, they’d have to act as fully programmable processors.
That’s why Nvidia created CUDA, officially deemed “a parallel computing platform and programming model”.
CUDA is proprietary Nvidia technology, only compatible with Nvidia GPUs.
The closest alternatives for AMD’s hardware are OpenCL and Radeon Compute Platform.
Compared to the alternatives, CUDA is considered more mature, performant, and easier to use.
And that includes Whisper.
It relies on other software, which must also be installed.
Check our guide onthe quickest way to install Windows softwarefor more info on Chocolatey.
Use this command to do that:
Replace “OLDER_VERSION” with a version, like 3.10.
Windows includes such an appfor more info on that, seehow to use the Windows 10 Voice Recorder app.
For a more full-featured option, tryAudacity.
Learn how to do it with our guide onhow to use Audacity to record audio on Windows and Mac.
Once processed, the text file (named “LatestNote.mp3.txt”) will appear in the same folder.
Open it in a text editor likeNotepadto view the translated text.
Let’s expand on them to help you choose the best for your needs.
Which Model to Choose?
Whisper offers various language models.
The larger the model, the more improved its accuracy, but also the higher its hardware requirements.
They are:
Most native English speakers should be fine with thetinyorbasemodels.
Non-native English speakers may see better results with larger models, likesmallandmedium.
Let’s make a globally accessible batch file to streamline the process.
However, until recently, talking instead of typing wasn’t optimal for creating documents.
Most voice-to-text solutions produced mediocre results.
You could find a few solutions worth trying, but they were complicated to use, or costly.
Thankfully, Whisper changed all that.