Waterflai
  • Welcome to Waterflai
  • Getting Started
    • Concepts
    • Quickstart
  • Providers
    • Providers Overview
    • Providers setup
    • AI models
    • Choose the right models
  • Knowledge
    • Knowledge Overview
    • Knowledge connectors
    • Knowledge collections
  • Studio
    • Studio Overview
    • Studio Builders
      • Light Builder
      • Dream Builder
      • Workflow Builder
      • Flow components (nodes)
        • Input Node
        • Output Node
        • LLM model Node
        • Multimodal LLM Node
        • Dall-E 2 (image generation) Node
        • Dall-E 3 (image generation) Node
        • Sora video generation Node
        • Text-to-Speech (TTS) Node
        • Speech-to-Text (STT) Node
        • OCR Node
        • Agent Node
        • Reranker Node
        • Knowledge retrieval Node
        • Vector store insert Node
        • Vector store record delete Node
        • Gitbook loader
        • Notion Database Node
        • Figma Node
        • Webpage scraper Node
        • Sitemap Scraper Node
        • API Request Node
        • Document metadata extraction Node
        • Document metadata update Node
        • Character splitter Node
        • HTML splitter Node
        • Markdown Splitter
        • Calculator tool Node
        • Text as tool Node
        • Knowledge retrieval tool Node
        • Conditional Node
        • Iteration loop Node
      • Testing and Debugging
    • Publishing
    • Integration with API
    • Embedding in website
  • Analytics
    • Analytics Overview
    • Dashboards
    • Logs
  • Administration
    • Organization users
    • Workspace
    • Security and permissions
  • Troubleshooting
    • Support
Powered by GitBook
On this page
  • Overview
  • Configuration Settings
  • Outputs
  • Best Practices
  • Technical Considerations
  • Common Issues
  • Troubleshooting
  • Example Prompts
  1. Studio
  2. Studio Builders
  3. Flow components (nodes)

Sora video generation Node

Overview

The Sora Video Generation Node enables your flow to create high-quality videos using OpenAI's Sora model. This cutting-edge video generation node transforms text prompts into realistic and imaginative video content, supporting various resolutions, aspect ratios, and durations. Currently, the node supports Azure OpenAI providers only.

Usage cost: 2 credits

Configuration Settings

1. Model Selection

  • Model* Select Sora model from available providers

2. Video Generation

  • Prompt* Text description of the video to generate

  • Resolution* Video quality and pixel height

    • 480p: Lower resolution, faster generation

    • 720p: Standard HD quality

    • 1080p: Full HD quality, highest detail

  • Aspect Ratio* Video dimensions and orientation

    • Square (1:1): Equal width and height, ideal for social media

    • Landscape (16:9): Widescreen format for traditional video content

    • Portrait (9:16): Vertical format for mobile and social stories

  • Duration* Video length in seconds

    • 5 seconds: Short clips, quick demonstrations

    • 10 seconds: Standard short-form content

    • 15 seconds: Extended clips with more detail

    • 20 seconds: Longer narrative sequences

Outputs

  • video (Video): Generated video object for use in subsequent nodes

  • base64_video (string): Base64-encoded video data with data URL format

Best Practices

1. Prompt Engineering

  • Provide clear, descriptive narratives

  • Include specific visual and motion elements:

    • Subject and action: "A person walking through a forest"

    • Camera movement: "Close-up shot panning left to right"

    • Environment: "Sunny morning with dappled light"

    • Style and mood: "Cinematic, warm lighting, peaceful atmosphere"

  • Structure prompts for video storytelling:

    • "A [subject] [action] in [environment], [camera angle], [lighting], [mood]"

  • Specify temporal elements:

    • Movement direction and speed

    • Scene transitions

    • Environmental changes

2. Resolution Selection

  • Use 480p for:

    • Rapid prototyping and concept testing

    • Draft versions and previews

    • Cost-effective generation

  • Use 720p for:

    • Standard content creation

    • Social media videos

    • Balanced quality and generation time

  • Use 1080p for:

    • Professional content

    • Final deliverables

    • High-quality presentations

3. Aspect Ratio Selection

  • Choose 1:1 (Square) for:

    • Instagram posts and feeds

    • Profile videos

    • Platform-agnostic content

  • Choose 16:9 (Landscape) for:

    • YouTube videos

    • Website headers

    • Traditional video content

  • Choose 9:16 (Portrait) for:

    • TikTok and Instagram Stories

    • Mobile-first content

    • Vertical video platforms

4. Duration Selection

  • Use 5 seconds for:

    • Quick animations

    • Logo reveals

    • Simple demonstrations

  • Use 10 seconds for:

    • Standard social media clips

    • Product showcases

    • Brief narratives

  • Use 15-20 seconds for:

    • Complex scenes

    • Detailed storytelling

    • Multi-action sequences

5. Variable Integration

All fields support variable interpolation for dynamic content creation.

  • Prompt: Accepts any string value

  • Resolution: Must resolve to '480', '720', or '1080'

  • Aspect Ratio: Must resolve to '1:1', '16:9', or '9:16'

  • Duration: Must resolve to '5', '10', '15', or '20'

Technical Considerations

File Handling

  • Videos are returned as MP4 format

  • Base64 encoding for easy integration

Common Issues

  • Provider Limitations: Currently only Azure OpenAI is supported; OpenAI direct API is not yet available

  • Generation Timeouts: Complex scenes or longer durations may approach the 10-minute timeout limit

  • Network Issues: Large video files may experience download delays

  • API Rate Limiting: Azure OpenAI quotas may affect generation frequency

Troubleshooting

"Video generation timed out after 10 minutes"

  • Simplify the prompt to reduce complexity

  • Try shorter duration settings

  • Use lower resolution for faster processing

Example Prompts

Simple Scene:

A golden retriever playing in a sunny park, close-up shot, natural lighting, joyful atmosphere

Cinematic Style:

A vintage car driving down a coastal highway at sunset, drone shot following from behind, warm golden hour lighting, peaceful and nostalgic mood

Product Demonstration:

Hands opening a sleek smartphone box on a clean white desk, overhead shot, soft studio lighting, minimalist and professional

Abstract/Artistic:

Colorful paint drops falling into clear water in slow motion, macro lens, high contrast lighting, mesmerizing and fluid motion
PreviousDall-E 3 (image generation) NodeNextText-to-Speech (TTS) Node

Last updated 1 day ago