OpenAI creates SORA, an AI Model that creates videos & Images from text prompts

14th Apr 2024 | category: AI - Artificial Intelligence | Hits: 355 OpenAI creates SORA, an AI Model that creates videos & Images from text prompts

In February 2024, OpenAI, the creators of ChatGPT, unveiled their latest AI model called SORA. This groundbreaking model has the ability to generate lifelike and imaginative video scenes based on text prompts.

Sora, is capable of generating a minute of high fidelity video. According to OpenAI, scaling video generation model SORA, is a promising path towards building general purpose simulators of the physical world.

The text-to-video model "Sora" can generate videos up to a minute long while maintaining visual quality and adherence to the user’s prompt spanning diverse durations, aspect ratios and resolutions of high definition videos.

Monster with melting candle.

Prompt: Animated scene features a close-up of a short fluffy monster kneeling beside a melting red candle. The art style is 3D and realistic, with a focus on lighting and texture. The mood of the painting is one of wonder and curiosity, as the monster gazes at the flame with wide eyes and open mouth. Its pose and expression convey a sense of innocence and playfulness, as if it is exploring the world around it for the first time. The use of warm colors and dramatic lighting further enhances the cozy atmosphere of the image.


SORA's text-to-video capabilities encompass a wide range of video durations, aspect ratios, and high definition resolutions while maintaining visual quality and fidelity to the user's input. Currently, OpenAI is in the testing phase, seeking input from red teamers, as well as visual artists, designers, and filmmakers to refine and enhance the model's utility for creative professionals.

Here is what you can do with SORA AI Model:-

  • 3D consistency. Sora can generate videos with dynamic camera motion.
  • Long-range coherence and object permanence - can generate multiple shots of the same character in a single sample, maintaining their appearance throughout the video.
  • Interacting with the world Sora can sometimes simulate actions that affect the state of the world in simple ways.
  • Simulating digital worlds - the model is able to simulate artificial processes–one example is video games.
  • Image generation capabilities - the model can generate images of variable sizes—up to 2048x2048 resolution.
Tokyo Walk

Prompt: A stylish woman walks down a Tokyo street filled with warm glowing neon and animated city signage. She wears a black leather jacket, a long red dress, and black boots, and carries a black purse. She wears sunglasses and red lipstick. She walks confidently and casually. The street is damp and reflective, creating a mirror effect of the colorful lights. Many pedestrians walk about.

With these impressive capabilities, SORA opens up new possibilities for video creation and simulation, offering immense potential for creative professionals and digital content creators.