This looks like a fork of alpaca.cpp, which is a fork of llama.cpp. I noticed it doesn't seem to support the ggml weights released with those two.

I’ve also seen mention of ggml in the context of llama and alpaca and other released models. What is ggml? ChatGPT couldn’t help with this question since it’s probably a term that came after its knowledge cutoff :)

It's an ML library written by Georgi Gerganov. It prioritizes inference on Apple hardware and low resource machines. https://github.com/ggerganov/ggml

llama.cpp and alpaca.cpp (and other derivatives) all require model weights to be converted to the ggml format to run.