OpenAI’s Whisper is a new AI-powered solution that can turn your voice into text.

Best of all, it comes at zero cost.

Especially if you want to use your Nvidia GPU’s Tensor Cores to give it a nice boost.

Featured Whisper

Don’t fret, though.

That’s why we’re here!

What Is OpenAI’s Whisper?

pip install python ffmpeg

ChatGPT is all the rage nowadays, and we already sawhow you could use ChatGPT by OpenAI.

And yet, it’s not the only interesting project by OpenAI.

For GPUs to be useful for more than graphics, they’d have to act as fully programmable processors.

pip3 install torch torchvision torchaudio

That’s why Nvidia created CUDA, officially deemed “a parallel computing platform and programming model”.

CUDA is proprietary Nvidia technology, only compatible with Nvidia GPUs.

The closest alternatives for AMD’s hardware are OpenCL and Radeon Compute Platform.

choco install python alternate version

Compared to the alternatives, CUDA is considered more mature, performant, and easier to use.

And that includes Whisper.

It relies on other software, which must also be installed.

Recording voice with Audacity

Check our guide onthe quickest way to install Windows softwarefor more info on Chocolatey.

Use this command to do that:

Replace “OLDER_VERSION” with a version, like 3.10.

Windows includes such an appfor more info on that, seehow to use the Windows 10 Voice Recorder app.

Whisper translate gr

For a more full-featured option, tryAudacity.

Learn how to do it with our guide onhow to use Audacity to record audio on Windows and Mac.

Once processed, the text file (named “LatestNote.mp3.txt”) will appear in the same folder.

whisper model small

Open it in a text editor likeNotepadto view the translated text.

Let’s expand on them to help you choose the best for your needs.

Which Model to Choose?

Windows Start Edit The System Environment Variables

Whisper offers various language models.

The larger the model, the more improved its accuracy, but also the higher its hardware requirements.

They are:

Most native English speakers should be fine with thetinyorbasemodels.

Environment Variables User Account Path

Non-native English speakers may see better results with larger models, likesmallandmedium.

Let’s make a globally accessible batch file to streamline the process.

However, until recently, talking instead of typing wasn’t optimal for creating documents.

Creating WHT Batch File

Most voice-to-text solutions produced mediocre results.

You could find a few solutions worth trying, but they were complicated to use, or costly.

Thankfully, Whisper changed all that.