Extract Data

This document explains the Extract Data node, which uses AI to pull specific information from text content.

Node Inputs

Required Fields

Text: Content to extract data from (documents, scraped website content, etc.)
Data Fields to Extract: Define what you want to extract:
- Name: Label for the data (e.g., “location”)
- Type: Format of the data (text/number/boolean)
- Description: Help the AI understand what to extract

Optional Fields

Extract List: Enable to get multiple items instead of single values
Additional Context: Extra information to guide the extraction
Temperature: Controls AI creativity (0-1)
- 0: More focused, consistent
- 1: More creative, varied
Cache Response: Save responses for reuse

Show As Input

The node allows you to configure certain parameters as dynamic inputs. You can enable these in the “Configure Inputs” section:

Extract List?: Boolean
- true/false to enable/disable list extraction
- When enabled, extracts data as a list of items
Additional Context: String
- Extra information to guide the extraction process
- Example: “The text contains company names and their founding years”
model_preference: String
- Name of the AI model to use
- Accepted values: “Claude 3.7 Sonnet”, “Claude 3.5 Haiku”, “GPT-4.1”, “GPT-4.1 Mini”, etc.
Cache Response: Boolean
- true/false to enable/disable response caching
- Helps reduce API calls for identical inputs
Temperature: Number
- Value between 0 and 1
- Controls extraction consistency

When enabled as inputs, these parameters can be dynamically set by previous nodes in your workflow. If not enabled, the values set in the node configuration will be used.

Node Output

Extracted Data Fields: Single value or list based on your settings

Node Functionality

The Extract Data node:

Analyzes text using AI
Finds specific information
Returns structured data
Handles single or multiple items
Supports various data types

Available AI Models

Claude 3.7 Sonnet
Claude 3.5 Haiku
OpenAI o3
OpenAI o4-mini
GPT-4.1
GPT-4.1 Mini
DeepSeek V3
DeepSeek R1
Gemini 2.5 Pro
Gemini 2.5 Flash
Grok 3
Grok 3 Mini
Azure OpenAI
And more

Note: Auto-Select uses a third-party model routing service and automatically chooses the appropriate model for cost, performance, and availability. Not ideal if consistent model behavior is needed.

Example Use Cases

Contact Information:

Extract: Email, Phone, Address
From: Company websites or documents

Product Details:

Extract: Price, Features, Specifications
From: Product descriptions

Data Extraction from Documents:

Extract: Date, invoice amount, vendor, address
From: Financial documents or invoices

Important Considerations

Expert models (OpenAI o3) cost 30 credits, advanced models (GPT-4.1, Claude 3.7 & Grok 4) cost 20 credits, and standard models cost 2 credits per run
You can drop the credit cost to 1 by providing your own API key under the credentials page
Enable “Extract List” when you need multiple items
Be specific in your data descriptions for accurate outputs

In summary, the Extract Data node is your tool for pulling structured information from unstructured text, whether you need single values or lists of data.

Get Started

Nodes

Common Errors

Node Inputs

Required Fields

Optional Fields

Show As Input

Node Output

Node Functionality

Available AI Models

Example Use Cases

Important Considerations

Get Started

Nodes

Common Errors

​Node Inputs

​Required Fields

​Optional Fields

​Show As Input

​Node Output

​Node Functionality

​Available AI Models

​Example Use Cases

​Important Considerations

Node Inputs

Required Fields

Optional Fields

Show As Input

Node Output

Node Functionality

Available AI Models

Example Use Cases

Important Considerations