Question 1

What is mcp_voice_identify?

Accepted Answer

mcp_voice_identify is a service available on the UBOS Asset Marketplace that provides voice recognition and text extraction capabilities for MCP (Model Context Protocol) servers. It allows AI models to understand and interact with spoken language.

Question 2

What is MCP (Model Context Protocol)?

Accepted Answer

MCP is an open protocol that standardizes how applications provide context to Large Language Models (LLMs). It acts as a bridge, allowing AI models to access and interact with external data sources and tools.

Question 3

What are the key features of mcp_voice_identify?

Accepted Answer

Key features include voice recognition from file and base64 encoded data, text extraction, support for both stdio and MCP modes, and structured voice recognition results.

Question 4

What kind of audio formats does mcp_voice_identify support?

Accepted Answer

The service supports a variety of audio formats. Refer to the documentation for a complete list of supported formats.

Question 5

How does the structured voice recognition result work?

Accepted Answer

The service provides results in a structured JSON format, including language code, emotion state, audio type, speaker identifier, and recognized text content.

Question 6

What special labels does mcp_voice_identify process?

Accepted Answer

Question 7

How do I install and set up mcp_voice_identify?

Accepted Answer

Clone the repository, install the dependencies using `pip install -r requirements.txt`, and set up your API credentials in a `.env` file.

Question 8

What are the differences between stdio and MCP modes?

Accepted Answer

stdio mode is for simple command-line interactions, while MCP mode enables seamless integration with MCP-enabled AI systems.

Question 9

How do I run the service in stdio mode?

Accepted Answer

Run `python stdio_server.py` and send JSON-RPC requests via stdin.

Question 10

How do I run the service in MCP mode?

Accepted Answer

Run `python mcp_server.py`.

Question 11

How do I build the executables?

Accepted Answer

Make the `build_exec.sh` script executable (`chmod +x build_exec.sh`) and then run it using `./build_exec.sh` (for stdio) or `./build_exec.sh mcp` (for MCP).

Question 12

Where are the executables created?

Accepted Answer

The executables are created in the `dist/` directory as `voice_stdio` (stdio mode) and `voice_mcp` (MCP mode).

Question 13

How do I run the tests?

Accepted Answer

Make the test scripts executable (`chmod +x test_*.sh`) and then run them using `./test_help.sh`, `./test_voice_file.sh`, and `./test_voice_base64.sh`.

Question 14

What is the license for mcp_voice_identify?

Accepted Answer

This project is licensed under the MIT License. See the LICENSE file for details.

Question 15

How does mcp_voice_identify integrate with the UBOS platform?

Accepted Answer

UBOS allows you to orchestrate the service with other AI models, connect it to enterprise data sources, build custom AI agents leveraging the service, and develop multi-agent systems with voice-based communication.

Field	Description	Example Value
lan	Language code	“en”
emo	Emotion state	“unknown”
type	Audio type	“speech”
speaker	Speaker identifier	“woitn”
text	Recognized text content	“test test test”

Voice Recognition MCP Service

Features

Project Structure

Installation

Usage

stdio Mode

MCP Mode

Voice Recognition Results

Original API Response

Restructured Response

Label Result Fields

Special Labels

Building Executables

Testing

License

Voice Recognition Service

Resources

Project Details

Recomended MCP Servers

Featured Templates

AI Chat Bot: Text, Voice, and Video Magic

Unified Authorization Template

Multi-language AI Translator

AI Video Generator

Your Speaking Avatar

Python Bug Fixer

Start your free trial