ElevenLabs Cloud
Use ElevenLabs's models with your own API Key
Last updated
Was this helpful?
Use ElevenLabs's models with your own API Key
Last updated
Was this helpful?
The ElevenLabs node allows you to convert text into lifelike speech using the ElevenLabs Text-to-Speech API. With this node, you can select from various models and voices offered by ElevenLabs to generate high-quality, natural-sounding audio from input text.
This node is ideal for use cases such as:
Creating voiceovers for videos or presentations
Generating spoken feedback or summaries
Building voice-enabled assistants
Enhancing accessibility features with audio output
To use this node, you need a valid ElevenLabs API credential. Once configured, you can define the action (currently supports βText to Speechβ and "Speech to Text"), choose the model and voice, and input the text you want to convert.
The resulting audio file can be used in subsequent steps of your workflow or delivered to users.
The ElevenLabs component has the following input connections.
From data Loaders/ Data source/Vector DB
This input connection represents the context information for the ElevenLabs model
Must originate from a Data Loader/Data Source or VectorDB component.
From Input
This input connection represents the user query for the ElevenLabs model.
Must originate from a component that generates a text string as output such as a Python or Text Input component.
Credentials
You can specify to use your own ElevenLabs credentials
Action
Choose the action: - Text to Speech - Speech to Text
Model
The list of models displayed depends on the customer's credentials.
Voice (With action = Text to Speech)
The list of voices displayed depends on the customer's credentials.
Text (With action = Text to Speech)
This is the input field where you provide the content to be converted into speech.
You can either type plain text directly or refer to the output of previous nodes by using the @
symbol to select dynamic data from earlier steps in your workflow. This allows you to generate speech from content that was extracted, generated, or transformed in previous nodes β for example, scraped text, AI-generated summaries, or user input.
Examples:
Static text: Hello, welcome to our service.
Dynamic text: @web-scraper.output.content
(refers to content scraped from a website)
Make sure the referenced data is in plain text format for optimal speech synthesis quality.
Audio file
This field is used to provide the input audio that you want to transcribe into text.
You can either:
Upload a direct audio file (in supported formats like .mp3
, .wav
, etc.), or
Dynamically reference audio output from previous nodes in the workflow by using the @
symbol β for example, an audio file URL returned by a recording tool, voice assistant, or web scraper.
Example usages:
Static file reference: https://example.com/audio/sample.mp3
Dynamic reference: @recorder.output.audioFileUrl
Make sure the referenced file is accessible via a valid URL or is passed from a previous node that provides audio in a compatible format.
Stability (With action = Text to Speech)
This setting controls how consistent the voice sounds between different runs.
Higher values (closer to 1) produce more stable and predictable speech output.
Lower values introduce more variation and spontaneity in the generated voice.
Similarity boost (With action = Text to Speech)
This adjusts how strongly the generated voice should try to match the original voice profile.
Higher values make the voice more closely resemble the reference voice, which may reduce expressiveness.
Lower values allow for more natural variation but may sound less like the reference voice.
Enable catching
This enables local caching of generated audio to improve performance and reduce repeated API calls for identical input.
When turned on, previously generated audio will be reused for the same input.
You can also configure the Caching time, which determines how long the result stays in cache before being refreshed.
Includes an option to Clear cache manually.
The ElevenLabs component has the following output connections.
To Output
This output connection contains the esult of the ElevenLabs component.
Can be connected to any component that accepts a string input.