Even OpenAI is blown absent by how ChatGPT has actually been been given. In the organization’s 1st demo, which it gave me the day right before ChatGPT was released on the internet, it had been pitched as an incremental update to InstructGPT. Like that model, ChatGPT was properly trained using reinforcement learning on opinions from human testers