When you purchase through links on our site, we may earn an affiliate commission.Heres how it works.
Voice is the future of human-computer interaction.
It is fully customizable, letting you select, design or evenclone the voiceit uses.
it’s possible for you to also add your own knowledge base.
For example, if you’re making a math tutor you could include access to SAT prep guides.
The most useful aspect is being able to set the underlying brain, or language model.
pic.twitter.com/JqBlwVczdXDecember 3, 2024
UnlikeChatGPT Advanced Voicethis is not native speech-to-speech.
The AI responds in text and ElevenLabs voices it up using its existing voice models.
This happens so fast it may as well be speech-to-speech.
With Conversational AI, ElevenLabs is directly competing with OpenAI’s Realtime API offering.
This could be in a call center fielding phone calls or something less obvious like learning products.
Creating a voice assistant
Anyone with an ElevenLabs account can create a conversational agent.
It comes with four default templates that can be fully customized.
The fourth is a video game wizard with a mysterious voice.
It uses Gemini 1.5 flash for speed and price reasons.
Making a call to the agent costs 500 credits per minute during development.
The starter plan gives you 30,000 credits for $4 per month.
Overall it is a simple process to set up.
you’re able to also import Twilio phone numbers and hook it up to your voice assistant.
For fun, I created a customer support agent named Ryan that uses a clone of my own voice.