December 9, 2024

How to use Sora, OpenAI’s new video generating tool

MIT Technology Review’s How To series helps you get things done

Today, OpenAI released its video generation model Sora to the public. The announcement comes on the fifth day of the company’s “shipmas” event, a twelve-day long marathon of tech releases and demos. Here’s what you should know—and how you can use the video model right now.

What is Sora?

Sora is a powerful AI video generation model that can create videos from text prompts, animate images, or remixing videos in new styles. OpenAI first previewed the model back in February, but today is the first time the company is releasing it for broader use. 

What’s new about this release?

The core function of Sora—creating impressive videos with simple prompts—remains similar to what was previewed in February, but OpenAI worked to make the model faster and cheaper ahead of this wider release. There are a few new features, and two stand out.

One is called Storyboard. With it, you can create multiple AI-generated videos, and then assemble them together on a timeline, much the way you would with conventional video editors like Adobe Premiere Pro. 

The second is a feed that functions as a sort of creative gallery. Users can post their Sora-generated videos to the feed, see the prompts behind certain videos, tweak them, and generally get inspiration, OpenAI says. 

How much can you do with it?

You can generate videos from text prompts, change the style of videos and change elements with a tool called Remix, and assemble multiple clips together with Storyboard. Sora also provides style presets you can apply to your videos, like a moody film noir preset or cardboard and papercraft, which gives a stop-motion feel. You can also trim and loop the videos that you make. 

Who can use it?

You’ll need to subscribe to one of OpenAI’s premium plans to generate videos with Sora, either ChatGPT Plus ($20 per month) or ChatGPT Pro ($200 per month). Both subscriptions include access to other OpenAI products as well. Users with ChatGPT Plus can generate videos as long as 5 seconds with a resolution up to 720p, and can create 50 videos per month. 

Users with a ChatGPT Pro subscription can generate longer, higher resolution videos. Their videos are capped at a resolution of 1080p and a duration of 20 seconds. They can also have Sora generate up to 5 variations at once of a video from a single prompt, allowing you to review options faster. Pro users are limited to 500 videos per month, but can also create unlimited “relaxed” videos, which are not generated in the moment but rather queued to generate when site traffic is low. 

Both subscription levels can create videos in three aspect ratios: vertical, horizontal, and square. 

If you don’t have a subscription, you’ll be limited to viewing the feed of Sora-generated videos. 

OpenAI is starting its global launch of Sora today, but will take longer to launch in “most of Europe,” the company said. 

OPENAI

Where can I access it?

OpenAI has broken Sora out from ChatGPT. To access it, go to Sora.com and log in with your ChatGPT Plus or Pro account. (MIT Technology Review was unable to access the site at press time—a note on the site indicated that signups were paused because they were “currently experiencing heavy traffic”.) 

How’d we get here?

A number of things have happened since OpenAI first unveiled Sora back in February. Other tech companies have also launched video generation tools, like Meta Movie Gen and Google Veo,. There’s also been plenty of backlash, including when artists who had early access to experiment with Sora leaked the tool to protest the way OpenAI has trained Sora on artists’ work without compensation. 

What’s next?

Like with any new release of a model, it remains to be seen what steps OpenAI has taken Sora from being used for nefarious, illegal, or unethical purposes, like for the creation of deepfakes. On the question of moderation and safety, an OpenAI employee said “we might not get it perfect on day one” during the announcement. 

Another looming question is how much compute and energy Sora will use up every time it creates a video. Generating a video uses much more computing time, and therefore energy, than generating a typical text response in a tool like ChatGPT.  Since the AI boom has already been an energy hog, and has presented a challenge to tech companies aiming to rein in their emissions, the wide availability of Sora and other video models like it has the potential to make that problem worse.

Share This Post
en_USEnglish