Doc To Markdown MCP Server

Doc To Markdown MCP Server

by abotapi

An MCP server that converts documents to clean Markdown. Convert PDFs, Word docs, Excel spreadsheets, PowerPoints, HTML, images, and more to AI-friend...

14 runs
1 users
Try This Actor

Opens on Apify.com

About Doc To Markdown MCP Server

An MCP server that converts documents to clean Markdown. Convert PDFs, Word docs, Excel spreadsheets, PowerPoints, HTML, images, and more to AI-friendly Markdown format.

What does this actor do?

Doc To Markdown MCP Server is a web scraping and automation tool available on the Apify platform. It's designed to help you extract data and automate tasks efficiently in the cloud.

Key Features

  • Cloud-based execution - no local setup required
  • Scalable infrastructure for large-scale operations
  • API access for integration with your applications
  • Built-in proxy rotation and anti-blocking measures
  • Scheduled runs and webhooks for automation

How to Use

  1. Click "Try This Actor" to open it on Apify
  2. Create a free Apify account if you don't have one
  3. Configure the input parameters as needed
  4. Run the actor and download your results

Documentation

Document to Markdown MCP Server An Apify MCP server that converts documents to clean Markdown. Convert PDFs, Word docs, Excel spreadsheets, PowerPoints, HTML, images, and more to AI-friendly Markdown format. ## About this MCP Server To understand how to connect to and utilize this MCP server, please refer to the official Model Context Protocol documentation at mcp.apify.com. ### Connection URL MCP clients can connect to this server at: https://<YOUR_USERNAME>--doc-to-markdown-mcp.apify.actor/mcp ## Standby Mode This Actor uses Standby mode, a new, lightweight method for using Actors. Instead of starting an Actor for each input and waiting for results, the Actor remains ready in the background to handle arbitrary HTTP requests, just like any web or API server. Learn more. > This is a new feature, and we'd love to hear your feedback. ### Actor URL Send an HTTP request to this URL, and wait for the response: https://<YOUR_USERNAME>--doc-to-markdown-mcp.apify.actor?token=YOUR_APIFY_TOKEN The MCP endpoint is available at /mcp: https://<YOUR_USERNAME>--doc-to-markdown-mcp.apify.actor/mcp?token=YOUR_APIFY_TOKEN ## Key Features - Batch Processing: Handle up to 20 documents simultaneously for increased productivity - Extensive Format Support: Convert 15+ file formats with intelligent content extraction - Customizable Output: Tailor markdown formatting to meet specific needs - MCP Integration: Compatible with MCP-compatible AI systems and chatbots - Metadata Preservation: Maintain essential document metadata during conversion - Table and Image Handling: Recognize table structures and manage images effectively ## Supported Formats (15+) | Category | Formats | |----------|---------| | Documents | PDF, Word (.docx), PowerPoint (.pptx), Excel (.xlsx, .xls), RTF | | Web | HTML, XML | | Images | JPEG, PNG, GIF, BMP (with EXIF metadata extraction) | | Audio | WAV, MP3 (with speech transcription) | | Data | CSV, JSON, YAML | | Archives | ZIP (recursive extraction) | ## MCP Tools ### Single Document Conversion #### convert_url_to_markdown Convert a document from a URL to Markdown. json { "url": "https://example.com/document.pdf", "options": { "include_metadata": true, "include_toc": true } } #### convert_file_to_markdown Convert a file from Apify Key-Value Store. json { "key": "my-document.pdf", "options": { "include_metadata": true } } #### convert_base64_to_markdown Convert a base64-encoded document. json { "content": "base64-encoded-content", "filename": "document.pdf" } #### convert_html_to_markdown Convert HTML content directly. json { "html": "<h1>Hello World</h1><p>Content here</p>" } ### Batch Processing #### batch_convert_urls Convert multiple documents from URLs simultaneously (max 20). json { "urls": [ "https://example.com/doc1.pdf", "https://example.com/doc2.docx", "https://example.com/doc3.pptx" ], "options": { "include_metadata": true } } #### batch_convert_files Convert multiple files from Key-Value Store (max 20). json { "keys": ["report1.pdf", "report2.docx", "data.xlsx"] } ### Utility Tools #### extract_metadata Extract metadata without full conversion. json { "url": "https://example.com/document.pdf" } #### get_supported_formats List all supported file formats. #### get_output_options Get available formatting options. ## Output Formatting Options Customize your markdown output with these options: | Option | Type | Default | Description | |--------|------|---------|-------------| | include_metadata | boolean | true | Include file metadata header | | include_toc | boolean | false | Generate table of contents | | heading_style | string | "atx" | "atx" (# Heading) or "setext" (underlined) | | table_format | string | "pipe" | "pipe" (|col|) or "simple" | | image_handling | string | "reference" | "inline", "reference", or "extract" | ### Example with Options json { "url": "https://example.com/report.pdf", "options": { "include_metadata": true, "include_toc": true, "heading_style": "atx", "table_format": "pipe" } } ## Usage ### With Claude Desktop Add to your Claude Desktop config (~/.config/claude/claude_desktop_config.json): json { "mcpServers": { "doc-to-markdown": { "url": "https://<YOUR_USERNAME>--doc-to-markdown-mcp.apify.actor/mcp", "headers": { "Authorization": "Bearer YOUR_APIFY_TOKEN" } } } } ### Uploading Files to Key-Value Store #### Via Apify Console 1. Go to Actor's StorageKey-Value Store 2. Click Add record 3. Upload your file 4. Use the key name in convert_file_to_markdown #### Via API bash curl -X PUT \ "https://api.apify.com/v2/key-value-stores/YOUR_STORE_ID/records/document.pdf" \ -H "Authorization: Bearer YOUR_APIFY_TOKEN" \ -H "Content-Type: application/pdf" \ --data-binary @document.pdf

Common Use Cases

Market Research

Gather competitive intelligence and market data

Lead Generation

Extract contact information for sales outreach

Price Monitoring

Track competitor pricing and product changes

Content Aggregation

Collect and organize content from multiple sources

Ready to Get Started?

Try Doc To Markdown MCP Server now on Apify. Free tier available with no credit card required.

Start Free Trial

Actor Information

Developer
abotapi
Pricing
Paid
Total Runs
14
Active Users
1
Apify Platform

Apify provides a cloud platform for web scraping, data extraction, and automation. Build and run web scrapers in the cloud.

Learn more about Apify

Need Professional Help?

Couldn't solve your problem? Hire a verified specialist on Fiverr to get it done quickly and professionally.

Find a Specialist

Trusted by millions | Money-back guarantee | 24/7 Support