Using AI
Extract Data
This document explains the Extract Data node, which uses AI to pull specific information from text content.
Node Inputs
Required Fields
- Text: Content to extract data from (documents, scraped website content, etc.)
- Data Fields to Extract: Define what you want to extract:
- Name: Label for the data (e.g., “location”)
- Type: Format of the data (text/number/boolean)
- Description: Help the AI understand what to extract
Optional Fields
- Extract List: Enable to get multiple items instead of single values
- Additional Context: Extra information to guide the extraction
- Temperature: Controls AI creativity (0-1)
- 0: More focused, consistent
- 1: More creative, varied
- Cache Response: Save responses for reuse
Show As Input
The node allows you to configure certain parameters as dynamic inputs. You can enable these in the “Configure Inputs” section:
-
Extract List?: Boolean
- true/false to enable/disable list extraction
- When enabled, extracts data as a list of items
-
Additional Context: String
- Extra information to guide the extraction process
- Example: “The text contains company names and their founding years”
-
model_preference: String
- Name of the AI model to use
- Accepted values: “Claude 3.5 Sonnet”, “Claude 3 Haiku”, “GPT-4o”, “GPT-4o Mini”, etc.
-
Cache Response: Boolean
- true/false to enable/disable response caching
- Helps reduce API calls for identical inputs
-
Temperature: Number
- Value between 0 and 1
- Controls extraction consistency
When enabled as inputs, these parameters can be dynamically set by previous nodes in your workflow. If not enabled, the values set in the node configuration will be used.
Node Output
- Extracted Data Fields: Single value or list based on your settings
Node Functionality
The Extract Data node:
- Analyzes text using AI
- Finds specific information
- Returns structured data
- Handles single or multiple items
- Supports various data types
Available AI Models
- Claude 3.5 Sonnet
- Claude 3 Haiku
- OpenAI o1
- OpenAI o1 mini
- GPT-4o
- GPT-4o Mini
- DeepSeek V3
- DeepSeek R1
- Gemini 1.5 Pro/Flash
- And more
Example Use Cases
- Contact Information:
- Product Details:
- Data Extraction from Documents:
Important Considerations
- Expert models (OpenAI o1) cost 30 credits, advanced models (GPT-4o & Claude 3.5) cost 20 credits, and standard models cost 2 credits per run
- You can drop the credit cost to 1 by providing your own API key under the credentials page
- Enable “Extract List” when you need multiple items
- Be specific in your data descriptions for accurate outputs
In summary, the Extract Data node is your tool for pulling structured information from unstructured text, whether you need single values or lists of data.