A Model Context Protocol server that provides unstructured document processing capabilities. This server enables LLMs to extract and use content from an unstructured document.
This repo is work in progress, proceed with caution :)
Supported file types:
{".abw", ".bmp", ".csv", ".cwk", ".dbf", ".dif", ".doc", ".docm", ".docx", ".dot",
".dotm", ".eml", ".epub", ".et", ".eth", ".fods", ".gif", ".heic", ".htm", ".html",
".hwp", ".jpeg", ".jpg", ".md", ".mcw", ".mw", ".odt", ".org", ".p7s", ".pages",
".pbd", ".pdf", ".png", ".pot", ".potm", ".ppt", ".pptm", ".pptx", ".prn", ".rst",
".rtf", ".sdp", ".sgl", ".svg", ".sxg", ".tiff", ".txt", ".tsv", ".uof", ".uos1",
".uos2", ".web", ".webp", ".wk2", ".xls", ".xlsb", ".xlsm", ".xlsx", ".xlw", ".xml",
".zabw"}
Prerequisites: You’ll need:
- Unstructured API key. Learn how to obtain one here
- Claude Desktop installed locally
Quick TLDR on how to add this MCP to your Claude Desktop:
- Clone the repo and set up the UV environment.
- Create a
.env
file in the root directory and add the following env variable:UNSTRUCTURED_API_KEY
. - Run the MCP server:
uv run doc_processor.py
- Go to
~/Library/Application Support/Claude/
and create aclaude_desktop_config.json
. In that file add:
{
"mcpServers": {
"unstructured_doc_processor": {
"command": "PATH/TO/YOUR/UV",
"args": [
"--directory",
"ABSOLUTE/PATH/TO/YOUR/unstructured-mcp/",
"run",
"doc_processor.py"
],
"disabled": false
}
}
}
- Restart Claude Desktop. You should now be able to use the MCP.
Unstructured Document Processor
Project Details
- MKhalusova/unstructured-mcp
- Last Updated: 3/30/2025
Recomended MCP Servers
Full access postgres mcp server
Model Context Protocol server to allow for reading and writing from Pinecone. Rudimentary RAG
A MCP server to search for accurate academic articles.
MCP server for interacting with esa API
A simple note-taking MCP server for recording and managing notes with AI models.
A simple MCP server to search for documentation (tutorial)
Virtual traveler library for MCP
A Model Context Protocol (MCP) server for analyzing code dependencies
MCP server for Cursor that leverages Gemini's much larger context window to enhance the capabilities of the AI...
APISIX Model Context Protocol (MCP) server is used to bridge large language models (LLMs) with the APISIX Admin...
A TypeScript implementation of a Model Context Protocol (MCP) server that integrates with PiAPI's API. PiAPI makes user...
This is a Model Context Protocol (MCP) server that provides comprehensive financial data from Yahoo Finance. It allows...