Doc To Markdown MCP Server
by abotapi
An MCP server that converts documents to clean Markdown. Convert PDFs, Word docs, Excel spreadsheets, PowerPoints, HTML, images, and more to AI-friend...
Opens on Apify.com
About Doc To Markdown MCP Server
An MCP server that converts documents to clean Markdown. Convert PDFs, Word docs, Excel spreadsheets, PowerPoints, HTML, images, and more to AI-friendly Markdown format.
What does this actor do?
Doc To Markdown MCP Server is a web scraping and automation tool available on the Apify platform. It's designed to help you extract data and automate tasks efficiently in the cloud.
Key Features
- Cloud-based execution - no local setup required
- Scalable infrastructure for large-scale operations
- API access for integration with your applications
- Built-in proxy rotation and anti-blocking measures
- Scheduled runs and webhooks for automation
How to Use
- Click "Try This Actor" to open it on Apify
- Create a free Apify account if you don't have one
- Configure the input parameters as needed
- Run the actor and download your results
Documentation
Document to Markdown MCP Server An Apify MCP server that converts documents to clean Markdown. Convert PDFs, Word docs, Excel spreadsheets, PowerPoints, HTML, images, and more to AI-friendly Markdown format. ## About this MCP Server To understand how to connect to and utilize this MCP server, please refer to the official Model Context Protocol documentation at mcp.apify.com. ### Connection URL MCP clients can connect to this server at: https://<YOUR_USERNAME>--doc-to-markdown-mcp.apify.actor/mcp ## Standby Mode This Actor uses Standby mode, a new, lightweight method for using Actors. Instead of starting an Actor for each input and waiting for results, the Actor remains ready in the background to handle arbitrary HTTP requests, just like any web or API server. Learn more. > This is a new feature, and we'd love to hear your feedback. ### Actor URL Send an HTTP request to this URL, and wait for the response: https://<YOUR_USERNAME>--doc-to-markdown-mcp.apify.actor?token=YOUR_APIFY_TOKEN The MCP endpoint is available at /mcp: https://<YOUR_USERNAME>--doc-to-markdown-mcp.apify.actor/mcp?token=YOUR_APIFY_TOKEN ## Key Features - Batch Processing: Handle up to 20 documents simultaneously for increased productivity - Extensive Format Support: Convert 15+ file formats with intelligent content extraction - Customizable Output: Tailor markdown formatting to meet specific needs - MCP Integration: Compatible with MCP-compatible AI systems and chatbots - Metadata Preservation: Maintain essential document metadata during conversion - Table and Image Handling: Recognize table structures and manage images effectively ## Supported Formats (15+) | Category | Formats | |----------|---------| | Documents | PDF, Word (.docx), PowerPoint (.pptx), Excel (.xlsx, .xls), RTF | | Web | HTML, XML | | Images | JPEG, PNG, GIF, BMP (with EXIF metadata extraction) | | Audio | WAV, MP3 (with speech transcription) | | Data | CSV, JSON, YAML | | Archives | ZIP (recursive extraction) | ## MCP Tools ### Single Document Conversion #### convert_url_to_markdown Convert a document from a URL to Markdown. json { "url": "https://example.com/document.pdf", "options": { "include_metadata": true, "include_toc": true } } #### convert_file_to_markdown Convert a file from Apify Key-Value Store. json { "key": "my-document.pdf", "options": { "include_metadata": true } } #### convert_base64_to_markdown Convert a base64-encoded document. json { "content": "base64-encoded-content", "filename": "document.pdf" } #### convert_html_to_markdown Convert HTML content directly. json { "html": "<h1>Hello World</h1><p>Content here</p>" } ### Batch Processing #### batch_convert_urls Convert multiple documents from URLs simultaneously (max 20). json { "urls": [ "https://example.com/doc1.pdf", "https://example.com/doc2.docx", "https://example.com/doc3.pptx" ], "options": { "include_metadata": true } } #### batch_convert_files Convert multiple files from Key-Value Store (max 20). json { "keys": ["report1.pdf", "report2.docx", "data.xlsx"] } ### Utility Tools #### extract_metadata Extract metadata without full conversion. json { "url": "https://example.com/document.pdf" } #### get_supported_formats List all supported file formats. #### get_output_options Get available formatting options. ## Output Formatting Options Customize your markdown output with these options: | Option | Type | Default | Description | |--------|------|---------|-------------| | include_metadata | boolean | true | Include file metadata header | | include_toc | boolean | false | Generate table of contents | | heading_style | string | "atx" | "atx" (# Heading) or "setext" (underlined) | | table_format | string | "pipe" | "pipe" (|col|) or "simple" | | image_handling | string | "reference" | "inline", "reference", or "extract" | ### Example with Options json { "url": "https://example.com/report.pdf", "options": { "include_metadata": true, "include_toc": true, "heading_style": "atx", "table_format": "pipe" } } ## Usage ### With Claude Desktop Add to your Claude Desktop config (~/.config/claude/claude_desktop_config.json): json { "mcpServers": { "doc-to-markdown": { "url": "https://<YOUR_USERNAME>--doc-to-markdown-mcp.apify.actor/mcp", "headers": { "Authorization": "Bearer YOUR_APIFY_TOKEN" } } } } ### Uploading Files to Key-Value Store #### Via Apify Console 1. Go to Actor's Storage → Key-Value Store 2. Click Add record 3. Upload your file 4. Use the key name in convert_file_to_markdown #### Via API bash curl -X PUT \ "https://api.apify.com/v2/key-value-stores/YOUR_STORE_ID/records/document.pdf" \ -H "Authorization: Bearer YOUR_APIFY_TOKEN" \ -H "Content-Type: application/pdf" \ --data-binary @document.pdf
Categories
Common Use Cases
Market Research
Gather competitive intelligence and market data
Lead Generation
Extract contact information for sales outreach
Price Monitoring
Track competitor pricing and product changes
Content Aggregation
Collect and organize content from multiple sources
Ready to Get Started?
Try Doc To Markdown MCP Server now on Apify. Free tier available with no credit card required.
Start Free TrialActor Information
- Developer
- abotapi
- Pricing
- Paid
- Total Runs
- 14
- Active Users
- 1
Related Actors
Fast Website Content Crawler
by 6sigmag
Domain Availability, Expiry, WHOIS, DNS, IP, ASN, 70+ TLD
by datascoutapi
🧾 YouTube Extractor (Transcripts + Metadata)
by dz_omar
Email Verifier by Million Verifier - $1/1k emails
by account56
Apify provides a cloud platform for web scraping, data extraction, and automation. Build and run web scrapers in the cloud.
Learn more about ApifyNeed Professional Help?
Couldn't solve your problem? Hire a verified specialist on Fiverr to get it done quickly and professionally.
Trusted by millions | Money-back guarantee | 24/7 Support