Hey guys, we are ExecuTorch team, we're excited about launching this! Please ask us anything :)
Is it possible to execute a light weight language model, perhaps this https://github.com/facebookresearch/llama using ExecuTorch to run on smartphone in real time for a chatbot app ? Please share some guidance.