Waterflai
  • Welcome to Waterflai
  • Getting Started
    • Concepts
    • Quickstart
  • Providers
    • Providers Overview
    • Providers setup
    • AI models
    • Choose the right models
  • Knowledge
    • Knowledge Overview
    • Knowledge connectors
    • Knowledge collections
  • Studio
    • Studio Overview
    • Studio Builders
      • Light Builder
      • Dream Builder
      • Workflow Builder
      • Flow components (nodes)
        • Input Node
        • Output Node
        • LLM model Node
        • Multimodal LLM Node
        • Dall-E 2 (image generation) Node
        • Dall-E 3 (image generation) Node
        • Sora video generation Node
        • Text-to-Speech (TTS) Node
        • Speech-to-Text (STT) Node
        • OCR Node
        • Agent Node
        • Reranker Node
        • Knowledge retrieval Node
        • Vector store insert Node
        • Vector store record delete Node
        • Gitbook loader
        • Notion Database Node
        • Figma Node
        • Webpage scraper Node
        • Sitemap Scraper Node
        • API Request Node
        • Document metadata extraction Node
        • Document metadata update Node
        • Character splitter Node
        • HTML splitter Node
        • Markdown Splitter
        • Calculator tool Node
        • Text as tool Node
        • Knowledge retrieval tool Node
        • Conditional Node
        • Iteration loop Node
      • Testing and Debugging
    • Publishing
    • Integration with API
    • Embedding in website
  • Analytics
    • Analytics Overview
    • Dashboards
    • Logs
  • Administration
    • Organization users
    • Workspace
    • Security and permissions
  • Troubleshooting
    • Support
Powered by GitBook
On this page
  • Overview
  • Outputs
  • Best Practices
  • Common Issues
  1. Studio
  2. Studio Builders
  3. Flow components (nodes)

Dall-E 3 (image generation) Node

Overview

The DALL-E 3 Node enables your flow to generate high-quality images using OpenAI's DALL-E 3 model. This advanced image generation node offers enhanced capabilities including better prompt interpretation, more detailed outputs, and additional customization options compared to its predecessor. The node supports both OpenAI and Azure OpenAI providers.

Usage cost: 2 credits

Configuration Settings

  1. Model Selection

    • Model*: Select DALL-E 3 model from available providers

    • Supported Providers:

      • OpenAI

      • Azure OpenAI

  2. Image Generation

    • Prompt*: Text description of the image to generate

    • Quality*: Image quality setting

      • Standard: Regular quality, faster generation

      • HD: Higher detail and clarity, longer generation time

    • Size*: Output resolution options

      • 1024x1024: Square format

      • 1792x1024: Landscape format

      • 1024x1792: Portrait format

    • Style*: Image rendering style

      • Vivid: Enhanced colors and contrast

      • Natural: More photorealistic appearance

Outputs

  • image (Image): Generated image object for use in subsequent nodes

  • base64_image (string): Base64-encoded image data

Best Practices

  1. Prompt Engineering

    • Provide detailed, clear descriptions

    • Include specific visual elements:

      • Subject matter and composition

      • Art style and medium

      • Lighting and atmosphere

      • Color palette and mood

    • Use natural language rather than keyword lists

    • Specify important details first

    • Example prompt structure:

      • "A professional photograph of [subject] in [setting], [lighting], [style]"

  2. Quality Selection

    • Use Standard quality for:

      • Rapid prototyping

      • Draft versions

      • Cost-effective generation

    • Use HD quality for:

      • Final deliverables

      • Detailed artwork

      • Professional content

  3. Size Selection

    • Choose 1024x1024 for:

      • Social media posts

      • Profile pictures

      • Universal compatibility

    • Choose 1792x1024 for:

      • Landscape photography

      • Banner images

      • Wide-format content

    • Choose 1024x1792 for:

      • Mobile content

      • Story formats

      • Vertical layouts

  4. Style Selection

    • Use Vivid for:

      • Digital art

      • Marketing materials

      • Eye-catching visuals

    • Use Natural for:

      • Product photography

      • Realistic scenarios

      • Professional documentation

Common Issues

  • API rate limiting and quotas

  • Generation timeouts with HD quality

  • Network issues during image retrieval

PreviousDall-E 2 (image generation) NodeNextSora video generation Node

Last updated 3 months ago