Talk to your AI agent with voice - both ways
Prerequisites
Before starting, make sure you have:
-
Completed the Quick Start Guide -
A running agent daemon ( mutiro start
) -
Mutiro mobile app installed (iOS or Android)
Configure Your Agent's Voice
Open your agent's configuration file in your project directory:
Add the tts_voice
field to your configuration:
Mutiro uses Google's Chirp3 HD voices for natural-sounding speech. Choose from dozens of voices in different languages, genders, and styles. See available voices below.
Choose Your Voice
Select a voice that matches your agent's personality. Here are some popular options:
en-US-Chirp3-HD-Algieba
en-US-Chirp3-HD-Kore
en-US-Chirp3-HD-Charon
en-US-Chirp3-HD-Leda
en-US-Chirp3-HD-Puck
en-US-Chirp3-HD-Zephyr
Chirp3 HD supports voices in multiple languages including Spanish, French, German, Japanese, and more. Each voice has unique characteristics - some are warm and friendly, others are clear and professional.
View the complete voice catalog with audio samples: Google Cloud Chirp3 HD Voices →
Restart Your Agent
After updating your configuration, restart the agent daemon to apply the new voice:
Your agent will now use the configured voice for all audio responses.
Test Voice Interaction
Open the Mutiro mobile app and try voice messaging:
- 1 Tap the microphone button in the chat interface
- 2 Speak your message to your agent
- 3 Your voice is automatically transcribed and sent to the agent
- 4 The agent's response comes back as both text and audio
Mutiro automatically transcribes your voice messages using advanced speech recognition. You don't need any additional configuration - it just works!
Customize Voice Responses (Optional)
You can customize how your agent responds to voice messages by adding instructions to your agent configuration:
Option 1: Using Prompt Append (Claude)
Add a prompt_append
field to guide Claude's responses:
Option 2: Using Genie Persona
For more advanced customization, create a Genie persona that defines your agent's voice personality:
Think of Mutiro as a messaging app where you chat with your AI agent. Use voice for conversations - explaining ideas, brainstorming, discussing concepts. These can be longer and more natural. Use text for quick, practical exchanges - URLs, short code snippets, commands, status updates.
Important: Never mix both modes in one response. Avoid sending huge text blocks or file diffs (document support coming later). Keep text responses WhatsApp-brief, and use voice when you need to actually explain or explore something.
Voice is Ready!
You can now have natural voice conversations with your AI agent from anywhere.
-
Try different voices to find one that matches your agent's personality -
Customize voice behavior with prompt_append or Genie personas for better voice interactions -
Voice works in all languages supported by Chirp3 HD - experiment with multilingual agents -
Each agent can have a different voice - customize each one individually