My version of pytorch didn't have CUDA. I had to install conda to get it, and now it's currently installing.
Whatever the default version that `pip install git+https://github.com/openai/whisper.git` grabbed didn't include it by default.
Shocked at how good the results are, and how easy of an installation it is.
Here are the exact steps to follow to get it running on Ubuntu 22.04 via WSL and yt-dlp:
1. pip install git+https://github.com/openai/whisper.git
2. yt-dlp -f 'ba' -x --audio-format mp3 https://www.youtube.com/watch/?v\=bZkNIzeRBk4
3. renamed the file to test.mp3
4. whisper test.mp3 --language Japanese --task translate --model large
Note: the large model will download a ~3Gb fileFor those on NixOS, here's a quick and dirty flake.nix that will let you make a venv in which to "pip install"'
Just put it in a flake.nix, and "nix develop" followed by "virtualenv ./venv; . ./venv/bin/activate; pip install git+https://github.com/openai/whisper.git"
{
description = "Python 3.9 development environment";
outputs = { self, nixpkgs }:
let
system = "x86_64-linux";
pkgs = import nixpkgs { inherit system; };
in {
devShells.${system}.default = pkgs.mkShell {
buildInputs = [
pkgs.ffmpeg
pkgs.python39
pkgs.python39Packages.pip
pkgs.python39Packages.numpy
pkgs.python39Packages.pytorch
pkgs.python39Packages.virtualenv
];
};
};
}