Sora video generation Node

Overview

The Sora Video Generation Node enables your flow to create high-quality videos using OpenAI's Sora model. This cutting-edge video generation node transforms text prompts into realistic and imaginative video content, supporting various resolutions, aspect ratios, and durations. Currently, the node supports Azure OpenAI providers only.

Usage cost: 2 credits

Configuration Settings

1. Model Selection

Model* Select Sora model from available providers

2. Video Generation

Prompt* Text description of the video to generate
Resolution* Video quality and pixel height
- 480p: Lower resolution, faster generation
- 720p: Standard HD quality
- 1080p: Full HD quality, highest detail
Aspect Ratio* Video dimensions and orientation
- Square (1:1): Equal width and height, ideal for social media
- Landscape (16:9): Widescreen format for traditional video content
- Portrait (9:16): Vertical format for mobile and social stories
Duration* Video length in seconds
- 5 seconds: Short clips, quick demonstrations
- 10 seconds: Standard short-form content
- 15 seconds: Extended clips with more detail
- 20 seconds: Longer narrative sequences

Outputs

video (Video): Generated video object for use in subsequent nodes
base64_video (string): Base64-encoded video data with data URL format

Best Practices

1. Prompt Engineering

Provide clear, descriptive narratives
Include specific visual and motion elements:
- Subject and action: "A person walking through a forest"
- Camera movement: "Close-up shot panning left to right"
- Environment: "Sunny morning with dappled light"
- Style and mood: "Cinematic, warm lighting, peaceful atmosphere"
Structure prompts for video storytelling:
- "A [subject] [action] in [environment], [camera angle], [lighting], [mood]"
Specify temporal elements:
- Movement direction and speed
- Scene transitions
- Environmental changes

2. Resolution Selection

Use 480p for:
- Rapid prototyping and concept testing
- Draft versions and previews
- Cost-effective generation
Use 720p for:
- Standard content creation
- Social media videos
- Balanced quality and generation time
Use 1080p for:
- Professional content
- Final deliverables
- High-quality presentations

3. Aspect Ratio Selection

Choose 1:1 (Square) for:
- Instagram posts and feeds
- Profile videos
- Platform-agnostic content
Choose 16:9 (Landscape) for:
- YouTube videos
- Website headers
- Traditional video content
Choose 9:16 (Portrait) for:
- TikTok and Instagram Stories
- Mobile-first content
- Vertical video platforms

4. Duration Selection

Use 5 seconds for:
- Quick animations
- Logo reveals
- Simple demonstrations
Use 10 seconds for:
- Standard social media clips
- Product showcases
- Brief narratives
Use 15-20 seconds for:
- Complex scenes
- Detailed storytelling
- Multi-action sequences

5. Variable Integration

All fields support variable interpolation for dynamic content creation.

Prompt: Accepts any string value
Resolution: Must resolve to '480', '720', or '1080'
Aspect Ratio: Must resolve to '1:1', '16:9', or '9:16'
Duration: Must resolve to '5', '10', '15', or '20'

Technical Considerations

File Handling

Videos are returned as MP4 format
Base64 encoding for easy integration

Common Issues

Provider Limitations: Currently only Azure OpenAI is supported; OpenAI direct API is not yet available
Generation Timeouts: Complex scenes or longer durations may approach the 10-minute timeout limit
Network Issues: Large video files may experience download delays
API Rate Limiting: Azure OpenAI quotas may affect generation frequency

Troubleshooting

"Video generation timed out after 10 minutes"

Simplify the prompt to reduce complexity
Try shorter duration settings
Use lower resolution for faster processing

Example Prompts

Simple Scene:

A golden retriever playing in a sunny park, close-up shot, natural lighting, joyful atmosphere

Cinematic Style:

A vintage car driving down a coastal highway at sunset, drone shot following from behind, warm golden hour lighting, peaceful and nostalgic mood

Product Demonstration:

Hands opening a sleek smartphone box on a clean white desk, overhead shot, soft studio lighting, minimalist and professional

Abstract/Artistic:

Colorful paint drops falling into clear water in slow motion, macro lens, high contrast lighting, mesmerizing and fluid motion

PreviousDall-E 3 (image generation) Node NextText-to-Speech (TTS) Node

Last updated 1 month ago