What is A comprehensive Model Context Protocol (MCP) server that provides advanced PDF text extraction, search, and analysis functionality.?
MCP PDF Reader is an enhanced server that offers features such as text extraction, search, metadata extraction, page-specific processing, text cleaning, and async processing. It includes multiple tools for different PDF operations and ensures security and performance.
Documentation
MCP PDF Reader Enhanced
A comprehensive Model Context Protocol (MCP) server that provides advanced PDF text extraction, search, and analysis functionality.
Features# Core Functionality
✅ Text Extraction: Extract text content from PDF files with customizable options
✅ Text Search: Search for specific text within PDFs with advanced options
✅ Metadata Extraction: Retrieve comprehensive PDF metadata
✅ Page-specific Processing: Extract content from specific page ranges
✅ Text Cleaning: Normalize and clean extracted text
✅ File Size Limits: Protection against overly large files (50MB limit)
✅ Async Processing: Non-blocking file operations
Advanced Features
🔄 Multiple Tools: 3 specialized tools for different PDF operations
🔍 Smart Search: Case-sensitive, whole-word, and regex search options
📊 Rich Metadata: Extract author, title, creation date, keywords, and more
⚡ Performance: Efficient processing with size limits and error handling
🛡️ Security: File validation and path sanitization
Installation
npm install
Tools Available# 1. read-pdf - Enhanced PDF Reading
Extract text from PDF files with customizable options.