ocr-and-documents

Name: ocr-and-documents
Author: NousResearch

NousResearch · Development

从 PDF 及扫描文档中提取文本。使用 web_extract 处理远程 URL，pymupdf 处理本地文本型 PDF，marker-pdf 处理 OCR/扫描文档。DOCX 使用 python-docx，PPTX 参见 PowerPoint 技能。

Extract text from PDFs and scanned documents. Use web_extract for remote URLs, pymupdf for local text-based PDFs, marker-pdf for OCR/scanned docs. For DOCX use python-docx, for PPTX see the powerpoint skill.

npx skills add https://github.com/NousResearch/hermes-agent --skill ocr-and-documents

星标 191964 · 安装量 3

GitHub · SkillBox 全部技能