What does HackerNews think of self-instruct?

Aligning pretrained language models with instruction data generated by themselves.

Language: Python

#11 in R

The next generation of AI for developers and Google Workspace | Mar 2023

When they say Augment your dataset with synthetic data on https://developers.googleblog.com/2023/03/announcing-palm-ap... do they mean something like this https://github.com/yizhongw/self-instruct ?

Alpaca: An Instruct Tuned LLaMA 7B – Responses on par with txt-DaVinci-3 | Mar 2023

Expand Context ↕

It says

> We train the Alpaca model on 52K instruction-following demonstrations generated in the style of self-instruct using text-davinci-003

Which leads to self-instruct https://github.com/yizhongw/self-instruct

From a glimpse they used a LM to classify instructions & train the model which IMHO is very similar to RLHF

Alpaca: A Strong Open-Source Instruction-Following Model | Mar 2023

Self Instruct:

https://arxiv.org/pdf/2212.10560.pdf

https://github.com/yizhongw/self-instruct