This document explains the Speech to Text node, which converts audio content into written text.

Node Inputs

Required Fields

  • Audio File: Recording to transcribe

Optional Fields

  • Use Link: Enable to use audio URL

  • Model: AI system for transcription

    Current: OpenAI Whisper

Node Output

  • Transcript: Converted text content

Node Functionality

The Speech to Text node:

  • Transcribes audio files
  • Handles multiple languages
  • Processes various formats
  • Maintains punctuation
  • Supports batch processing via loop mode

When to Use

Use this node when you need to:

  • Convert Recordings:

    • Meeting recordings
    • Interview audio
    • Voice notes
    • Lecture content
  • Create Documentation:

    • Meeting minutes
    • Interview transcripts
    • Podcast transcripts
    • Course materials

Common Use Cases

  1. Meeting Documentation:
Input: weekly-meeting.mp3
Output: Full meeting transcript
Use: Share with team members
  1. Content Creation:
Input: podcast-episode.mp3
Output: Written content
Use: Blog posts, show notes
  1. Research Analysis:
Input: interview.mp3
Output: Text for analysis
Use: Research documentation

In summary, the Speech to Text node helps convert audio recordings into text format, making content more accessible and searchable in your Gumloop workflows.