Video generation models as world simulators
We explore large-scale training of generative models on video data. Specifically, we train text-conditional diffusion models jointly on videos and images of variable durations, resolutions and aspect ratios. We leverage a transformer architecture that operates on spacetime patches of video and image latent codes. Our largest model, Sora, is capable of generating a minute of…
Read More
Providing the right products at the right time with machine learning
Whether your favorite condiment is Heinz ketchup or your preferred spread for your bagel is Philadelphia cream cheese, ensuring that all customers have access to their preferred products at the right place, at the right price, and at the right time requires careful supply chain organization and distribution. Amid the proliferation of e-commerce and shifting…
Read MoreDisrupting malicious uses of AI by state-affiliated threat actors
We terminated accounts associated with state-affiliated threat actors. Our findings show our models offer only limited, incremental capabilities for malicious cybersecurity tasks.
Read More
Why Big Tech’s watermarking plans are some welcome good news
This story originally appeared in The Algorithm, our weekly newsletter on AI. To get stories like this in your inbox first, sign up here. This week I am happy to bring you some encouraging news from the world of AI. Following the depressing Taylor Swift deepfake porn scandal and the proliferation of political deepfakes, such as AI-generated robocalls…
Read MoreMemory and new controls for ChatGPT
We’re testing the ability for ChatGPT to remember things you discuss to make future chats more helpful. You’re in control of ChatGPT’s memory.
Read More
Google’s Gemini is now in everything. Here’s how you can try it out.
In the biggest mass-market AI launch yet, Google is rolling out Gemini, its family of large language models, across almost all its products, from Android to the iOS Google app to Gmail to Docs and more. A new subscription plan will also give users access to Gemini Ultra, the most powerful version of the model,…
Read More
A chatbot helped more people access mental-health services
An AI chatbot helped increase the number of patients referred for mental-health services through England’s National Health Service (NHS), particularly among underrepresented groups who are less likely to seek help, new research has found. Demand for mental-health services in England is on the rise, particularly since the covid-19 pandemic. Mental-health services received 4.6 million patient…
Read More
This robot can tidy a room without any help
Robots are good at certain tasks. They’re great at picking up and moving objects, for example, and they’re even getting better at cooking. But while robots may easily complete tasks like these in a laboratory, getting them to work in an unfamiliar environment where there’s little data available is a real challenge. Now, a new…
Read MoreBuilding an early warning system for LLM-aided biological threat creation
We’re developing a blueprint for evaluating the risk that a large language model (LLM) could aid someone in creating a biological threat. In an evaluation involving both biology experts and students, we found that GPT-4 provides at most a mild uplift in biological threat creation accuracy. While this uplift is not large enough to be conclusive,…
Read More
Dear Taylor Swift, we’re sorry about those explicit deepfakes
This story originally appeared in The Algorithm, our weekly newsletter on AI. To get stories like this in your inbox first, sign up here. Hi, Taylor. I can only imagine how you must be feeling after sexually explicit deepfake videos of you went viral on X. Disgusted. Distressed, perhaps. Humiliated, even. I’m really sorry this is…
Read More