Computer Control MCP
MCP server that provides computer control capabilities, like mouse, keyboard, OCR, etc. using PyAutoGUI, RapidOCR, ONNXRuntime. Similar to ‘computer-use’ by Anthropic. With Zero External Dependencies.
- Only tested on Windows. Should work on other platforms.
Quick Usage (MCP Setup Using uvx
)
{
"mcpServers": {
"computer-control-mcp": {
"command": "uvx",
"args": ["computer-control-mcp@latest"]
}
}
}
OR install globally with pip
:
pip install computer-control-mcp
Then run the server with:
computer-control-mcp # instead of uvx computer-control-mcp, so you can use the latest version, also you can `uv cache clean` to clear the cache and `uvx` again to use latest version.
Features
- Control mouse movements and clicks
- Type text at the current cursor position
- Take screenshots of the entire screen or specific windows with optional saving to downloads directory
- Extract text from screenshots using OCR (Optical Character Recognition)
- List and activate windows
- Press keyboard keys
- Drag and drop operations
Available Tools
Mouse Control
click_screen(x: int, y: int)
: Click at specified screen coordinatesmove_mouse(x: int, y: int)
: Move mouse cursor to specified coordinatesdrag_mouse(from_x: int, from_y: int, to_x: int, to_y: int, duration: float = 0.5)
: Drag mouse from one position to another
Keyboard Control
type_text(text: str)
: Type the specified text at current cursor positionpress_key(key: str)
: Press a specified keyboard key
Screen and Window Management
take_screenshot(title_pattern: str = None, use_regex: bool = False, threshold: int = 60, with_ocr_text_and_coords: bool = False, scale_percent_for_ocr: int = 100, save_to_downloads: bool = False)
: Capture screen or window with optional OCRget_screen_size()
: Get current screen resolutionlist_windows()
: List all open windowsactivate_window(title_pattern: str, use_regex: bool = False, threshold: int = 60)
: Bring specified window to foreground
Development
Setting up the Development Environment
# Clone the repository
git clone https://github.com/AB498/computer-control-mcp.git
cd computer-control-mcp
# Install in development mode
pip install -e .
Running Tests
python -m pytest
API Reference
See the API Reference for detailed information about the available functions and classes.
License
MIT
For more information or help
- Email (abcd49800@gmail.com)
- Discord (CodePlayground)
Computer Control
Project Details
- AB498/computer-control-mcp
- MIT License
- Last Updated: 4/18/2025
Recomended MCP Servers
A Model Context Protocol (MCP) server for Apache Seatunnel. This provides access to your Apache Seatunnel RESTful API...
The Ultimate Model Context Protocol (MCP) Server, providing unified access to a wide variety of useful and powerful...
The ultimate toolkit for working with APIs.
An MCP server implementation that integrates the Rhombus API to provide Chatbot tools.
Code2Flow MCP服务器,用于生成代码调用图并通过MCP协议提供服务
A Slack MCP server
Agentic tool that looks for statistical variations in conversation structure and logs unusual events to a SQLite database.
Trabalho de NLP - PUC-RIO
A powerful Model Context Protocol (MCP) server that provides an access to serverless browser for AI agents and...
mcp server for todoist