Frequently Asked Questions (FAQ) about TTS-MCP
Q: What is TTS-MCP? A: TTS-MCP (Text-to-Speech Model Context Protocol) is a server and command-line tool designed for high-quality text-to-speech generation using the OpenAI TTS API. It integrates seamlessly with Model Context Protocol (MCP) compatible clients like Claude Desktop.
Q: What are the main features of TTS-MCP? A: TTS-MCP offers multiple voice options (alloy, nova, echo, etc.), supports various output formats (MP3, WAV, OPUS, AAC), allows customizable speech speed and voice character settings, and provides both a server and a command-line tool for text-to-speech conversion.
Q: How do I install TTS-MCP?
A: You can install TTS-MCP either by cloning the repository and installing dependencies using npm install, or by running it directly with npx without installation.
Q: How do I start the TTS-MCP server?
A: You can start the server using npm run server with optional arguments to customize settings like voice and model. Alternatively, you can use node bin/tts-mcp-server.js with the desired options.
Q: How do I integrate TTS-MCP with Claude Desktop?
A: To integrate with Claude Desktop, you need to add a configuration block in the Claude Desktop configuration file (~/Library/Application Support/Claude/claude_desktop_config.json) that specifies the command and arguments for running the TTS-MCP server, including your OpenAI API key.
Q: What voice characters are supported by TTS-MCP? A: TTS-MCP supports several voice characters, including alloy, ash, coral, echo, fable, onyx, nova, sage, and shimmer.
Q: What output formats are supported by TTS-MCP? A: TTS-MCP supports multiple output formats, including mp3, opus, aac, flac, wav, and pcm.
Q: How do I use the TTS-MCP CLI tool?
A: You can use the TTS-MCP CLI tool to convert text directly using the command tts-mcp -t "Hello, world" -o hello.mp3. You can also convert from a text file using tts-mcp -f speech.txt -o speech.mp3.
Q: How do I specify the OpenAI API key?
A: You can specify the OpenAI API key either directly in the arguments array using the --api-key parameter or by setting it as an environment variable OPENAI_API_KEY.
Q: Can I customize the speech speed?
A: Yes, you can customize the speech speed using the -s or --speed option in the CLI tool, with values ranging from 0.25 to 4.0.
Q: What models are supported by TTS-MCP? A: TTS-MCP supports models such as tts-1, tts-1-hd, and gpt-4o-mini-tts (default).
Q: Where can I find the full list of options for the CLI tool?
A: You can find the full list of options by running tts-mcp --help.
Q: What is the purpose of UBOS in relation to TTS-MCP? A: UBOS is a full-stack AI Agent Development Platform that helps you orchestrate AI Agents, connect them with your enterprise data, build custom AI Agents with your LLM model, and manage Multi-Agent Systems. Integrating TTS-MCP with UBOS enhances voice-based interactions and automates content creation within AI Agent workflows.
Q: Is TTS-MCP free to use? A: TTS-MCP is an open-source tool, but you will need an OpenAI API key, which may incur costs depending on your usage of the OpenAI TTS API.
Q: What license is TTS-MCP released under? A: TTS-MCP is released under the MIT license.
Text-to-Speech MCP Server
Project Details
- nakamurau1/tts-mcp
- MIT License
- Last Updated: 4/17/2025
Recomended MCP Servers
A comprehensive MCP (Model Context Protocol) server for file system operations, providing Claude and other AI assistants with...
youtube embedding
Virtual traveler library for MCP
test stdio mem mcp server
Google Forms MCP
This read-only MCP Server allows you to connect to Trello data from Claude Desktop through CData JDBC Drivers....
Stay on top of trending topics on social media and the web with AI
✍ WeChat Markdown Editor | 一款高度简洁的微信 Markdown 编辑器:支持 Markdown 语法、色盘取色、多图上传、一键下载文档、自定义 CSS 样式、一键重置等特性
Create a new MCP server in TypeScript, batteries included.
Korean to Chinese translator built as an MCP tool using DeepSeek LLM
זה ממש אבל ממש לא אמסיפי לחילנט
MCP Server for reasoning





