Trending
Video: OpenAI Unveils Next-Level Generative AI Text-to-Video Tool
OpenAI, the company behind the ChatGPT AI chatbot, has unveiled a revolutionary new platform able to transform text prompts into sophisticated 3D videos almost indiscernible from reality.
On Thursday, OpenAI posted videos of its new AI text-to-video model “Sora” to social media, showing how the company’s latest software tool generates complex 3D videos of virtually anything imaginable, similar to current AI image generators.
“Sora can create videos of up to 60 seconds featuring highly detailed scenes, complex camera motion, and multiple characters with vibrant emotions,” wrote OpenAI on X.
See videos created by the new “Sora” tool, and their corresponding text-to-video prompts, below (Reminder: these videos are 100% AI generated):
Prompt: “Beautiful, snowy Tokyo city is bustling. The camera moves through the bustling city street, following several people enjoying the beautiful snowy weather and shopping at nearby stalls. Gorgeous sakura petals are flying through the wind along with snowflakes.”
Introducing Sora, our text-to-video model.
Sora can create videos of up to 60 seconds featuring highly detailed scenes, complex camera motion, and multiple characters with vibrant emotions. https://t.co/7j2JN27M3W
Prompt: “Beautiful, snowy… pic.twitter.com/ruTEWn87vf
— OpenAI (@OpenAI) February 15, 2024
Prompt: “Several giant wooly mammoths approach treading through a snowy meadow, their long wooly fur lightly blows in the wind as they walk, snow covered trees and dramatic snow capped mountains in the distance, mid afternoon light with wispy clouds and a sun high in the distance creates a warm glow, the low camera view is stunning capturing the large furry mammal with beautiful photography, depth of field.”
Prompt: “Several giant wooly mammoths approach treading through a snowy meadow, their long wooly fur lightly blows in the wind as they walk, snow covered trees and dramatic snow capped mountains in the distance, mid afternoon light with wispy clouds and a sun high in the distance… pic.twitter.com/Um5CWI18nS
— OpenAI (@OpenAI) February 15, 2024
Prompt: “A movie trailer featuring the adventures of the 30 year old space man wearing a red wool knitted motorcycle helmet, blue sky, salt desert, cinematic style, shot on 35mm film, vivid colors.”
Prompt: “A movie trailer featuring the adventures of the 30 year old space man wearing a red wool knitted motorcycle helmet, blue sky, salt desert, cinematic style, shot on 35mm film, vivid colors.” pic.twitter.com/0JzpwPUGPB
— OpenAI (@OpenAI) February 15, 2024
Prompt: “A stylish woman walks down a Tokyo street filled with warm glowing neon and animated city signage. she wears a black leather jacket, a long red dress, and black boots, and carries a black purse. she wears sunglasses and red lipstick. she walks confidently and casually. the street is damp and reflective, creating a mirror effect of the colorful lights. many pedestrians walk about.”
Prompt: “A stylish woman walks down a Tokyo street filled with warm glowing neon and animated city signage. she wears a black leather jacket, a long red dress, and black boots, and carries a black purse. she wears sunglasses and red lipstick. she walks confidently and casually.… pic.twitter.com/cjIdgYFaWq
— OpenAI (@OpenAI) February 15, 2024
Prompt: “Animated scene features a close-up of a short fluffy monster kneeling beside a melting red candle. the art style is 3d and realistic, with a focus on lighting and texture. the mood of the painting is one of wonder and curiosity, as the monster gazes at the flame with wide eyes and open mouth. its pose and expression convey a sense of innocence and playfulness, as if it is exploring the world around it for the first time. the use of warm colors and dramatic lighting further enhances the cozy atmosphere of the image.”
Prompt: “Animated scene features a close-up of a short fluffy monster kneeling beside a melting red candle. the art style is 3d and realistic, with a focus on lighting and texture. the mood of the painting is one of wonder and curiosity, as the monster gazes at the flame with… pic.twitter.com/aLMgJPI0y6
— OpenAI (@OpenAI) February 15, 2024
Prompt: Historical footage of California during the gold rush.
This is an example of OpenAI’s new text to video product. Prompt: “Historical footage of California during the gold rush” pic.twitter.com/teczQsWzyn
— Historic Vids (@historyinmemes) February 15, 2024
Prompt: Photorealistic closeup video of two pirate ships battling each other as they sail inside a cup of coffee.
There are several more examples in higher video quality at OpenAI’s website.
“The current model has weaknesses,” OpenAI wrote in a blog post. “It may struggle with accurately simulating the physics of a complex scene, and may not understand specific instances of cause and effect. For example, a person might take a bite out of a cookie, but afterward, the cookie may not have a bite mark.”
“The model may also confuse spatial details of a prompt, for example, mixing up left and right, and may struggle with precise descriptions of events that take place over time, like following a specific camera trajectory.”
OpenAI says it’s currently working on censoring “misinformation, hateful content, and bias” from the platform before making it available.
We’ll be taking several important safety steps ahead of making Sora available in OpenAI’s products.
We are working with red teamers — domain experts in areas like misinformation, hateful content, and bias — who are adversarially testing the model.
— OpenAI (@OpenAI) February 15, 2024
“We’re also building tools to help detect misleading content such as a detection classifier that can tell when a video was generated by Sora,” the company writes.
The company claims its ultimate goal is to “create safe and beneficial AI” coupled with a “deep understanding of the potential risks and benefits.”
Could revolutionary new AI technologies like Sora and its implications be why Open AI CEO Sam Altman attended the 69th Bilderberg meeting in 2023, where AI was one of the key topics of discussion?
And what will be its implications once it’s rolled out on a larger scale?
Read the full article here