Simple Document Processing MCP Server
A powerful Model Context Protocol (MCP) server providing comprehensive document processing capabilities.
Features
Document Reader
- Read DOCX, PDF, TXT, HTML, CSV
Document Conversion
- DOCX to HTML/PDF conversion
- HTML to TXT/Markdown conversion
- PDF manipulation (merge, split)
Text Processing
- Multi-encoding transfer support (UTF-8, Big5, GBK)
- Text formatting and cleaning
- Text comparison and diff generation
- Text splitting by lines or delimiter
HTML Processing
- HTML cleaning and formatting
- Resource extraction (images, links, videos)
- Structure-preserving conversion
Installation
Installing via Smithery
To install Document Processing Server for Claude Desktop automatically via Smithery:
npx -y @smithery/cli install @cablate/mcp-doc-forge --client claude
Manual Installation
npm install -g @cablate/mcp-doc-forge
Usage
Cli
mcp-doc-forge
With Dive Desktop
- Click “+ Add MCP Server” in Dive Desktop
- Copy and paste this configuration:
{
"mcpServers": {
"searxng": {
"command": "npx",
"args": [
"-y",
"@cablate/mcp-doc-forge"
],
"enabled": true
}
}
}
- Click “Save” to install the MCP server
License
MIT
Contributing
Welcome community participation and contributions! Here are ways to contribute:
Star the project if you find it helpful
Submit Issues: Report problems or provide suggestions
Create Pull Requests: Submit code improvements
Contact
If you have any questions or suggestions, feel free to reach out:
Email: reahtuoo310109@gmail.com
GitHub: CabLate
Collaboration: Welcome to discuss project cooperation
Technical Guidance: Sincere welcome for suggestions and guidance
Document Processing Server
Project Details
- cablate/mcp-doc-forge
- @cablate/mcp-doc-forge
- MIT License
- Last Updated: 4/16/2025
Recomended MCP Servers
CLI to set up and deploy MCP Servers to Cloudflare Workers in seconds. Just write TypeScript functions to...
A powerful MCP (Model Context Protocol) server for Claude Desktop, integrating task management, SQLite, and Obsidian visualization.
A thin wrapper around the OpenPyXl Python library that exposes some of its features as Model Context Protocol...
A Model Context Protocol (MCP) for Jupyter Notebook
Model Context Protocol server for Google Analytics, enabling LLMs to fetch and analyze web analytics data
A powerful MCP server for fetching and transforming web content into various formats (HTML, JSON, Markdown, Plain Text)...
All-in-one infrastructure for search, recommendations, RAG, and analytics offered via API