MCP Screenshot
An MCP server that captures screenshots and performs OCR text recognition.
Features
- Screenshot capture (left half, right half, full screen)
- OCR text recognition (supports Japanese and English)
- Multiple output formats (JSON, Markdown, vertical, horizontal)
OCR Engines
This server uses two OCR engines:
yomitoku
- Primary OCR engine
- High-accuracy Japanese text recognition
- Runs as an API server
Tesseract.js
- Fallback OCR engine
- Used when yomitoku is unavailable
- Supports both Japanese and English recognition
Installation
npx -y @kazuph/mcp-screenshot
Claude Desktop Configuration
Add the following configuration to your claude_desktop_config.json:
{
"mcpServers": {
"screenshot": {
"command": "npx",
"args": ["-y", "@kazuph/mcp-screenshot"],
"env": {
"OCR_API_URL": "http://localhost:8000" // yomitoku API base URL
}
}
}
}
Environment Variables
| Variable Name | Description | Default Value |
|---|---|---|
| OCR_API_URL | yomitoku API base URL | http://localhost:8000 |
Usage Example
You can use it by instructing Claude like this:
Please take a screenshot of the left half of the screen and recognize the text in it.
Tool Specification
capture
Takes a screenshot and performs OCR.
Options:
region: Screenshot area (‘left’/‘right’/‘full’, default: ‘left’)format: Output format (‘json’/‘markdown’/‘vertical’/‘horizontal’, default: ‘markdown’)
License
MIT
Author
kazuph
Screenshot
Project Details
- kazuph/mcp-screenshot
- @kazuph/mcp-screenshot
- MIT License
- Last Updated: 4/21/2025
Categories
Recomended MCP Servers
A Model Context Protocol (MCP) server that provides web search functionality using Perplexity AI's API.
MCP Server for the Slidespeak API. Create PowerPoint Presentations using MCP.
Lightweight MCP server to give your Cursor Agent access to the Cloudflare API.
MCP Server for Hackernews
A Model Context Protocol (MCP) server for interacting with DaVinci Resolve and Fusion
A Model Context Protocol (MCP) server for querying the CVE-Search API
An (eventually) secure open-source MCP Server to turn any REST endpoint into MCP resources automatically
A server implementation for Wikidata API using the Model Context Protocol (MCP).
R MCP Server
MCP server for Qwen Max model





