- 📁 references/
- 📁 scripts/
- 📄 SKILL.md
Convert Markdown documents to professionally typeset PDF files with reportlab. Handles CJK/Latin mixed text, fenced code blocks, tables, blockquotes, cover pages, clickable TOC, PDF bookmarks, watermarks, and page numbers. Supports multiple color themes (Warm Academic, Nord, GitHub Light, Solarized, etc.) and is battle-tested for Chinese technical reports. Use this skill whenever the user wants to turn a .md file into a styled PDF, generate a report PDF from markdown, or create a print-ready document from markdown content — especially if CJK characters, code blocks, or tables are involved. Also trigger when the user mentions "markdown to PDF", "md2pdf", "any2pdf", "md转pdf", "报告生成", or asks for a "typeset" or "professionally formatted" PDF from markdown source.
- 📁 modules/
- 📁 shared/
- 📄 README.md
- 📄 README_EN.md
- 📄 SKILL.md
论文全能处理工具。每当用户上传论文 PDF、粘贴论文文本/摘要,或提到"论文"、"paper"、"文献"、"arxiv"时必须使用此 skill。功能包括:深度分析报告、思维导图、审稿意见、宣传推广脚本、播客音频生成。即使用户只说"帮我看这篇论文"或直接上传 PDF 也必须触发。
- 📁 references/
- 📁 scripts/
- 📁 tests/
- 📄 .gitignore
- 📄 convert.bat
- 📄 convert.ps1
双向文档转换工具,将 Word (.docx)、Excel (.xlsx)、PowerPoint (.pptx) 和 PDF (.pdf) 转换为 AI 友好的 Markdown 格式,或将 Markdown (.md) 转换为 Word (.docx) 格式。当用户请求以下操作时使用:(1) 明确请求文档转换,包括任何包含"转换"、"转为"、"转成"、"convert"、"导出"、"export"等词汇的请求(例如:"转换文档"、"把这个文件转为docx"、"convert to markdown"、"导出为Word");(2) 需要 AI 理解文档内容("帮我分析这个 Word 文件"、"读取这个 PDF"、"总结这个 Excel");(3) 上传文档文件并询问内容("这是什么"、"帮我看看");(4) 任何涉及 .docx、.xlsx、.pptx、.pdf、.md 文件格式转换的请求。
REQUIRED for any task that involves reading or understanding a PDF's contents. When a user mentions a .pdf and asks what's inside — to read specific pages, answer questions, compare tables or data, extract facts or numbers, count figures, or locate topics — this skill MUST be used instead of generic PDF tools. Provides targeted search, page-level navigation, and precise extraction so you never need to dump an entire PDF into context. Only skip this skill for PDF file operations: merge, split, watermark, create, form-fill, or encrypt. --- # MinerU Document Explorer PDF reading toolkit via `doc-search` CLI. Workflow: **init → search/outline → read pages → (optionally) extract elements**. ## Commands | Command | Purpose | |---|---| | `init` | Upload PDF, start processing, get `doc_id` | | `outline` | Browse TOC/structure | | `pages` | Read specific pages (images or text) | | `search-keyword` | Find pages by regex pattern | | `search-semantic` | Find pages by semantic query | | `elements` | Extract evidence with bboxes and cropped images | All commands output JSON to stdout. Pipe `2>/dev/null` to suppress stderr logs. ```typescript // Shared types used across all commands interface Page {