MCP Screenshot
An MCP server that captures screenshots and performs OCR text recognition.
Features
- Screenshot capture (left half, right half, full screen)
- OCR text recognition (supports Japanese and English)
- Multiple output formats (JSON, Markdown, vertical, horizontal)
OCR Engines
This server uses two OCR engines:
yomitoku
- Primary OCR engine
- High-accuracy Japanese text recognition
- Runs as an API server
Tesseract.js
- Fallback OCR engine
- Used when yomitoku is unavailable
- Supports both Japanese and English recognition
Installation
npx -y @kazuph/mcp-screenshot
Claude Desktop Configuration
Add the following configuration to your claude_desktop_config.json:
{
"mcpServers": {
"screenshot": {
"command": "npx",
"args": ["-y", "@kazuph/mcp-screenshot"],
"env": {
"OCR_API_URL": "http://localhost:8000" // yomitoku API base URL
}
}
}
}
Environment Variables
| Variable Name | Description | Default Value |
|---|---|---|
| OCR_API_URL | yomitoku API base URL | http://localhost:8000 |
Usage Example
You can use it by instructing Claude like this:
Please take a screenshot of the left half of the screen and recognize the text in it.
Tool Specification
capture
Takes a screenshot and performs OCR.
Options:
region: Screenshot area (‘left’/‘right’/‘full’, default: ‘left’)format: Output format (‘json’/‘markdown’/‘vertical’/‘horizontal’, default: ‘markdown’)
License
MIT
Author
kazuph
Screenshot
Project Details
- kazuph/mcp-screenshot
- @kazuph/mcp-screenshot
- MIT License
- Last Updated: 4/21/2025
Categories
Recomended MCP Servers
story based implementation for sequential thinking
Implement Discord MCP server enabling AI assistants to interact with the Discord platform.
MCP Server para gerenciar o Memory Bank
Model Context Protocol (MCP) Server for Apify's Actors
Model Context Protocol server for Google Analytics, enabling LLMs to fetch and analyze web analytics data
Algorand Model Context Protocol (Server & Client)
An MCP server inspired by devocs.io
A Model Context Protocol (MCP) server that provides tools for fetching Reddit content, including frontpage posts, subreddit information...
GitHub Actions Model Context Protocol Server
A open-source library enabling AI models to control hardware devices via serial communication using the MCP protocol. Initial...
A model context protocol server to migrate data out of code (ts/js) into config (json)





