Node Inputs
Required Fields
- Image File: Upload image or PDF (JPG, PNG, GIF, WEBP or PDF)
- Prompt: Question or instruction for analysis. Be detailed here for accurate output
Optional Fields
- Use Link: Enable to use direct image URLs
- Only supports publicly accessible media links (e.g., https://example.com/image.jpg)
- Does not support Google Drive, Dropbox, or other file-sharing links
- URL must point directly to the image file
- Temperature: Controls analysis creativity (0-1)
- 0: More focused, consistent
- 1: More creative, varied
- Cache Response: Save responses for reuse
Show As Input
The node allows you to configure certain parameters as dynamic inputs. You can enable these in the “Configure Inputs” section:-
Use Link: Boolean
- true/false to use image URL instead of file upload
- When enabled, allows input of publicly accessible image URLs
- Remember: Only direct media links are supported
-
Prompt: String
- The specific question or instruction for analyzing the image
- Example: “Describe the main objects in this image”
-
image_model_preference: String
- Name of the AI model to use for image analysis
- Accepted values: “GPT-4.1”, “Claude 3.5 Haiku”, etc.
-
Cache Response: Boolean
- true/false to enable/disable response caching
- Helps reduce API calls for identical inputs
-
Temperature: Number
- Value between 0 and 1
- Controls analysis consistency and creativity
Node Output
- Analysis: AI’s detailed response about the image
Node Functionality
The Analyze Image can:- Processes images with AI vision
- Extracts text from images
- Generates descriptions
- Answers queries about content
- Identifies objects and scenes
- Can read image-based PDFs
Available AI Models
- OpenAI o3
- GPT-4.1 vision
- GPT-4.1 mini
- GPT-4.1 nano
- OpenAI o4-mini
- Claude 3.7 Sonnet
- Claude 3.7 Sonnet Thinking
- Grok 2 Vision
- Claude 3.5 Haiku
- Gemini 2.5 Pro
- Gemini 2.5 Flash
AI Model Selection Guide
When choosing an AI model for your task, consider these key factors:Model Type | Ideal Use Cases | Considerations |
---|---|---|
Standard Models | General content creation, basic Q&A, simple analysis | Lower cost, faster response time, good for most everyday tasks |
Advanced Models | Complex analysis, nuanced content, specialized knowledge domains | Better quality but higher cost, good balance of performance and efficiency |
Expert & Thinking-Enabled Models | Complex reasoning, step-by-step problem-solving, coding, detailed analysis, math problems, technical content | Highest quality but most expensive, best for complex and long-form tasks, longer response time |
- Task complexity and required accuracy
- Response time requirements
- Cost considerations
- Consistency needs across runs
- Specialized knowledge requirements
- Anthropic Models Overview
- Anthropic Extended Thinking Documentation
- OpenAI Reasoning Guide
- OpenAI o3 Models
Common Use Cases
- Text Extraction:
- Visual Description:
- Object Detection:
Important Considerations
- Expert AI models (eg. OpenAI o3) cost 30 credits, Advanced models (GPT-4.1 & Claude 3.7) cost 20 credits, and standard models cost 2 credits per run.
- You can drop the credit cost to 1 by providing your own API key under the credentials page