
Transforming document understanding and insights with generative AI
At some point over the last two decades, productivity applications enabled humans (and machines!) to create information at the speed of digital—faster than any person could possibly consume or understand it. Modern inboxes and document folders are filled with information: digital haystacks with needles of insight that too often remain undiscovered. Generative AI is an…
Read More
I went for a walk with Gary Marcus, AI’s loudest critic
Gary Marcus meets me outside the post office of Vancouver’s Granville Island wearing neon-coral sneakers and a blue Arcteryx jacket. I’m in town for a family thing, and Marcus has lived in the city since 2018 after 20 years in New York City. “I just find it to be paradise,” he tells me, as I…
Read More
OpenAI teases an amazing new generative video model called Sora
OpenAI has built a striking new generative video model called Sora that can take a short text description and turn it into a detailed, high-definition film clip up to a minute long. Based on four sample videos that OpenAI shared with MIT Technology Review ahead of today’s announcement, the San Francisco-based firm has pushed the…
Read More
Responsible technology use in the AI age
The sudden appearance of application-ready generative AI tools over the last year has confronted us with challenging social and ethical questions. Visions of how this technology could deeply alter the ways we work, learn, and live have also accelerated conversations—and breathless media headlines—about how and whether these technologies can be responsibly used. Responsible technology use,…
Read More
Google’s new version of Gemini can handle far bigger amounts of data
Google DeepMind today launched the next generation of its powerful artificial intelligence model Gemini, which has an enhanced ability to work with large amounts of video, text, and images. It’s an advancement from the three versions of Gemini 1.0 that Google announced back in December, ranging in size and complexity from Nano to Pro to…
Read MoreVideo generation models as world simulators
We explore large-scale training of generative models on video data. Specifically, we train text-conditional diffusion models jointly on videos and images of variable durations, resolutions and aspect ratios. We leverage a transformer architecture that operates on spacetime patches of video and image latent codes. Our largest model, Sora, is capable of generating a minute of…
Read More
Providing the right products at the right time with machine learning
Whether your favorite condiment is Heinz ketchup or your preferred spread for your bagel is Philadelphia cream cheese, ensuring that all customers have access to their preferred products at the right place, at the right price, and at the right time requires careful supply chain organization and distribution. Amid the proliferation of e-commerce and shifting…
Read MoreDisrupting malicious uses of AI by state-affiliated threat actors
We terminated accounts associated with state-affiliated threat actors. Our findings show our models offer only limited, incremental capabilities for malicious cybersecurity tasks.
Read More
Why Big Tech’s watermarking plans are some welcome good news
This story originally appeared in The Algorithm, our weekly newsletter on AI. To get stories like this in your inbox first, sign up here. This week I am happy to bring you some encouraging news from the world of AI. Following the depressing Taylor Swift deepfake porn scandal and the proliferation of political deepfakes, such as AI-generated robocalls…
Read MoreMemory and new controls for ChatGPT
We’re testing the ability for ChatGPT to remember things you discuss to make future chats more helpful. You’re in control of ChatGPT’s memory.
Read More