How to Capture Text from Audio Streams

Capturing text from audio streams can be a crucial part of data processing and automation workflows. This guide walks you through the steps to achieve this using built-in commands and configurations.

Overview

This process involves downloading audio streams, converting speech to text, and extracting the transcribed content for further use. Follow the steps below to capture and process text from audio streams efficiently.

Step 1: Download the Audio Stream

Add a URL command: Use the URL Command to download the audio file from a specified URL.
Enable Audio Capture:

Check the Capture Audio Stream option in the command properties.
- Specify the audio format (e.g., .mp3, .wav).
- Optionally, set the duration of silence (in seconds) to automatically stop the recording.

Step 2: Convert Audio to Text

Add a Content Converter Command:
- Use the Content Converter to process the downloaded audio.
- Select Voice to Text as the converter type.
Choose a CAPTCHA Service Provider:
- Pick a provider and configure its API for speech-to-text conversion.

Step 3: Input the Downloaded Audio File

Use the Arguments field to specify the audio file for transcription.
Reference the downloaded file using ${Captured File}(refer above snapshot)

Step 4: View the Converted Text

Add a Data Value Command:
- Configure the command to access the transcribed text.
- Set Data to Action Result and choose Converted Value to extract the output.

This streamlined process enables you to efficiently capture text from audio streams and integrate the transcription into your workflows.

In order to get/view the converted text, one needs to add a Data Value command and need to select the Action Result in the Data drop down and use the Converted Value in the following field.