- Updated: January 18, 2026
- 6 min read
Redd‑Archiver: Open‑Source PostgreSQL‑Backed Reddit Archive Generator Introduces New Features
Redd‑Archiver is an open‑source, PostgreSQL‑backed tool that converts Reddit, Voat, and Ruqqus data dumps into fast, searchable, and fully browsable HTML archives, with optional AI‑enhanced query capabilities.
What Is Redd‑Archiver?
Redd‑Archiver is a community‑driven project that lets developers, data archivists, and tech enthusiasts preserve the full history of link‑aggregator platforms. By ingesting compressed data dumps (e.g., Reddit’s .zst files, Voat SQL exports, or Ruqqus .7z archives) it builds a PostgreSQL database and then renders a static, mobile‑first website that can be hosted anywhere—from a USB stick to a public HTTPS server.
The tool is designed for both casual users who need a quick offline copy of a subreddit and power users who require enterprise‑grade full‑text search across billions of posts. Its modular architecture also includes an MCP (Model‑Control‑Protocol) server that exposes 30+ AI‑ready endpoints, making it a natural fit for projects that want to layer ChatGPT or Claude‑style assistants on top of archived content.
Core Features and Capabilities
Multi‑Platform Archiving
- Unified import of Reddit, Voat, and Ruqqus data in a single archive.
- Automatic platform detection based on file extensions.
- Mixed archives allow cross‑platform search and navigation.
Full‑Text Search Powered by PostgreSQL
- GIN‑indexed search delivers sub‑second results even on billions of rows.
- Google‑style query operators (exact, OR, NOT, sub:, author:, score:).
- Highlighting via
ts_headline()for instant context.
Zero‑JavaScript Offline Browsing
- Pure HTML/CSS output works on any device, even without internet.
- Responsive layout with touch‑friendly navigation.
- WCAG‑compliant keyboard navigation and screen‑reader support.
AI‑Ready MCP Server
- 29 auto‑generated tools expose archive data to Claude, ChatGPT, or custom LLMs.
- Field‑selection and truncation options keep LLM payloads lightweight.
- Ready‑to‑copy JSON snippets for rapid integration.
Supported Platforms and Data Formats
| Platform | Format | Status | Notes |
|---|---|---|---|
| .zst (JSON Lines via Pushshift) | ✅ Full support | 2.38 B posts, 40 k subreddits (through Dec 2024) | |
| Voat | SQL dumps | ✅ Full support | 3.81 M posts, 24 M comments |
| Ruqqus | .7z (JSON Lines) | ✅ Full support | 500 K posts across 6 k guilds |
The table above demonstrates the breadth of data Redd‑Archiver can ingest. Because the importers are modular, adding a new platform is as simple as dropping a new importer module into core/importers/.
Deployment Options: Docker, Tor, HTTPS, and Static Hosting
Redd‑Archiver offers four primary deployment pathways, each tailored to a different security or accessibility requirement.
Docker (Recommended)
- All components (PostgreSQL, search server, Nginx) run in isolated containers.
- One‑line
docker compose up -dspins up a fully functional archive. - Supports both HTTPS (Let’s Encrypt) and Tor hidden services via profiles.
Tor‑Only Homelab
- No public IP or port‑forwarding required.
- Access the archive through a
.onionaddress. - Ideal for privacy‑focused researchers or small community backups.
Production HTTPS
- Automated certificate provisioning with Let’s Encrypt.
- Standard web‑friendly URL structure for SEO.
- Scales to millions of concurrent visitors.
Static Hosting (GitHub / Codeberg Pages)
- Generate pure HTML files and push to a Git repository.
- Zero‑cost hosting for small archives (no search, browse‑only).
- Perfect for demo projects or educational use.
Recent Updates: AI Integration and New Features
The latest 1.0 release introduced a full MCP server that automatically generates OpenAPI‑compatible endpoints for AI assistants. This means you can ask Claude or ChatGPT to “show the top‑10 posts about quantum computing in r/science from 2023” and receive a ready‑to‑display JSON payload.
Additional enhancements include:
- Support for OpenAI ChatGPT integration via the MCP server.
- New Chroma DB integration for vector‑based semantic search.
- Improved Docker images with multi‑stage builds, reducing final size by 30%.
- Enhanced accessibility: ARIA labels and better keyboard navigation.
Benefits and Real‑World Use‑Case Scenarios
By turning massive data dumps into searchable archives, Redd‑Archiver solves several pain points for developers and researchers:
Academic Research
Scholars can query historical discussions without hitting rate‑limited APIs. For example, a linguist studying meme evolution can retrieve every post containing “Doge” across Reddit and Voat in a single SQL‑backed query.
Community Preservation
Moderators of niche subreddits can create a permanent snapshot before a potential ban, then host it on a Tor hidden service for safe, anonymous access.
Enterprise Knowledge Bases
Companies can ingest public tech forums (e.g., r/programming) into an internal knowledge portal, then layer AI marketing agents to surface relevant insights for product teams.
Open‑Source Tooling Ecosystem
Developers building new AI‑driven products can reuse the MCP endpoints as a data source, accelerating time‑to‑market for features like “auto‑summarize top comments” or “generate sentiment heatmaps.”
Get Started with Redd‑Archiver Today
Whether you need a quick offline copy of a favorite subreddit or a full‑scale, AI‑enhanced research archive, Redd‑Archiver provides a ready‑made pipeline. Follow the quick‑start guide to have a working instance in under 15 minutes.
Step 1 – Clone the repository: Redd‑Archiver GitHub repository.
Step 2 – Choose a deployment mode: Use Docker for full‑text search, Tor for privacy, or static hosting for a zero‑cost demo.
Step 3 – Explore AI integration: Connect the MCP server to ChatGPT and Telegram integration or the ElevenLabs AI voice integration to add spoken summaries.
Step 4 – Share your archive: Deploy to a public domain, register your instance on the UBOS partner program, and contribute to the global preservation leaderboard.
Explore related UBOS resources that complement Redd‑Archiver:
- UBOS homepage – the central hub for AI‑powered automation.
- About UBOS – learn about the team behind the platform.
- UBOS platform overview – a deeper dive into the modular architecture.
- Enterprise AI platform by UBOS – scale your archives for corporate use.
- UBOS for startups – fast‑track AI integration for new ventures.
- UBOS solutions for SMBs – affordable AI tools for small businesses.
- Web app editor on UBOS – customize the look of your archive without coding.
- Workflow automation studio – automate data ingestion pipelines.
- UBOS pricing plans – find a plan that matches your budget.
- UBOS portfolio examples – see real‑world deployments.
- UBOS templates for quick start – bootstrap new projects instantly.
Boost Your Archive with UBOS Template Marketplace
UBOS offers a growing library of AI‑enhanced templates that can be plugged into a Redd‑Archiver instance. A few standout examples:
- AI SEO Analyzer – automatically audit your archive’s search engine friendliness.
- AI Article Copywriter – generate summary articles from top posts.
- AI Video Generator – turn popular threads into short video clips.
- AI Chatbot template – let visitors ask natural‑language questions about the archive.
- AI LinkedIn Post Optimization – repurpose archive content for professional networks.
Why Redd‑Archiver Matters
The internet’s collective memory is fragile. Platforms shut down, moderators delete content, and APIs become rate‑limited. Redd‑Archiver gives you control over that history, turning raw dumps into a searchable, shareable, and AI‑ready knowledge base. Whether you’re a researcher preserving digital culture, a developer building the next AI‑driven insight engine, or a community leader safeguarding discussions, this tool provides the foundation you need.
Ready to preserve the past and power the future? Clone the repo, spin up a Docker container, and start exploring the data you thought was lost forever.
Start Archiving Today