What does HackerNews think of PaLM-rlhf-pytorch?
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
Language:
Python
#33
in
Deep learning
You might also want to check out https://github.com/lucidrains/PaLM-rlhf-pytorch
Closest you can get is probably with Google T5-Flan [1].
It is not the size of the model or the text it was trained on that makes ChatGPT so performant. It is the additional human assisted training to make it respond well to instructions. Open source versions of that are just starting to see the light of day [2].
Possibly this "PaLM + RLHF - Pytorch" project: