News & Trends

June 23, 2022

Learning to play Minecraft with Video PreTraining

We trained a neural network to play Minecraft by Video PreTraining (VPT) on a massive unlabeled video dataset of human Minecraft play, while using only a small amount of labeled contractor data. With fine-tuning, our model can learn to craft diamond tools, a task that usually takes proficient humans over 20 minutes (24,000 actions). Our…

June 17, 2022

Evolution through large models

June 13, 2022

AI-Written Critiques Help Humans Notice Flaws

We trained “critique-writing” models to describe flaws in summaries. Human evaluators find flaws in summaries much more often when shown our model’s critiques. Larger models are better at self-critiquing, with scale improving critique-writing more than summary-writing. This shows promise for using AI systems to assist human supervision of AI systems on difficult tasks. Read paperView…

June 9, 2022

Techniques for Training Large Neural Networks

Large neural networks are at the core of many recent advances in AI, but training them is a difficult engineering and research challenge which requires orchestrating a cluster of GPUs to perform a single synchronized calculation. As cluster and model sizes have grown, machine learning practitioners have developed an increasing variety of techniques to parallelize…

May 28, 2022

Teaching models to express their uncertainty in words

April 13, 2022

Measuring Goodhart’s law

Goodhart’s law famously says: “When a measure becomes a target, it ceases to be a good measure.” Although originally from economics, it’s something we have to grapple with at OpenAI when figuring out how to optimize objectives that are difficult or costly to measure.

April 13, 2022

Learning to play Minecraft with Video PreTraining

Evolution through large models

AI-Written Critiques Help Humans Notice Flaws

Techniques for Training Large Neural Networks

Teaching models to express their uncertainty in words

Measuring Goodhart’s law

Hierarchical text-conditional image generation with CLIP latents

Lessons learned on language model safety and misuse

A research agenda for assessing the economic impacts of code generation models

Solving (some) formal math olympiad problems