Doc Scraper MCP Server
A Model Context Protocol (MCP) server that provides documentation scraping functionality. This server converts web-based documentation into markdown format using jina.ai’s conversion service.
Features
- Scrapes documentation from any web URL
- Converts HTML documentation to markdown format
- Saves the converted documentation to a specified output path
- Integrates with the Model Context Protocol (MCP)
Installation
Installing via Smithery
To install Doc Scraper for Claude Desktop automatically via Smithery:
npx -y @smithery/cli install @askjohngeorge/mcp-doc-scraper --client claude
- Clone the repository:
git clone https://github.com/askjohngeorge/mcp-doc-scraper.git
cd mcp-doc-scraper
- Create and activate a virtual environment:
python -m venv venv
source venv/bin/activate # On Windows, use: venvScriptsactivate
- Install the dependencies:
pip install -e .
Usage
The server can be run using Python:
python -m mcp_doc_scraper
Tool Description
The server provides a single tool:
- Name:
scrape_docs
- Description: Scrape documentation from a URL and save as markdown
- Input Parameters:
url
: The URL of the documentation to scrapeoutput_path
: The path where the markdown file should be saved
Project Structure
doc_scraper/
├── __init__.py
├── __main__.py
└── server.py
Dependencies
- aiohttp
- mcp
- pydantic
Development
To set up the development environment:
- Install development dependencies:
pip install -r requirements.txt
- The server uses the Model Context Protocol. Make sure to familiarize yourself with MCP documentation.
License
MIT License
Doc Scraper MCP Server
Project Details
- askjohngeorge/mcp-doc-scraper
- Last Updated: 4/18/2025
Recomended MCP Servers
This tool is a cutting-edge memory engine that blends real-time learning, persistent three-tier context awareness, and seamless plug-n-play...
hello-mcp is a tour and guide for beginners to Claude Desktop MCP Config Manager, designed to help them...
MCP Mail Tool - AI-powered email management tool | 基于 MCP 的智能邮件管理工具
Excel数据处理微服务
python MCP NAVER
Query and Summarize your chat messages.
An MCP server that enables communication with users through Telegram. This server provides a tool to ask questions...
MCP Server for the Mapbox API.
AI写的七牛上传MCP,以后各种音频图片上传都可以传上去引用,方便很多。