OpenAI thinks their code transformer can already do 37% of all coding “tasks”?

Buzz off.

On a dataset of 164 programming problems: https://arxiv.org/pdf/2107.03374.pdf (Section 2.2). The dataset is here: https://github.com/openai/human-eval