This document explains the Extract Data node, which uses AI to pull specific information from text content.

Node Inputs

Required Fields

  • Text: Content to extract data from (documents, scraped website content, etc.)
  • Data Fields to Extract: Define what you want to extract:
    • Name: Label for the data (e.g., “location”)
    • Type: Format of the data (text/number/boolean)
    • Description: Help the AI understand what to extract

Optional Fields

  • Extract List: Enable to get multiple items instead of single values
  • Additional Context: Extra information to guide the extraction
  • Temperature: Controls AI creativity (0-1)
  • Cache Response: Save responses for reuse

Show As Input Options

You can expose these fields as inputs:

  • Additional Context
  • Temperature

Node Output

  • Extracted Data: Single value or list based on your settings

Node Functionality

The Extract Data node:

  • Analyzes text using AI
  • Finds specific information
  • Returns structured data
  • Handles single or multiple items
  • Supports various data types

Available AI Models

  • Claude 3.5 Sonnet
  • Claude 3 Haiku
  • OpenAI o1
  • OpenAI o1 mini
  • GPT-4o
  • GPT-4o Mini
  • Gemini 1.5 Pro/Flash
  • And more

Example Use Cases

  1. Contact Information:
Extract: Email, Phone, Address
From: Company websites or documents
  1. Product Details:
Extract: Price, Features, Specifications
From: Product descriptions
  1. Data Extraction from Documents:
Extract: Date, invoice amount, vendor, address
From: Financial documents or invoices

Important Considerations

  1. Expert models (OpenAI o1) cost 30 credits, advanced models (GPT-4o & Claude 3.5) cost 20 credits, and standard models cost 2 credits per run
  2. You can drop the credit cost to 1 by providing your own API key under the credentials page
  3. Enable “Extract List” when you need multiple items
  4. Be specific in your data descriptions for accurate outputs

In summary, the Extract Data node is your tool for pulling structured information from unstructured text, whether you need single values or lists of data.