返回 Skill 列表
extension
分类: 开发与工程无需 API Key

libtransform

libtransform - 资源转换工具。pdfToHtml 使用 LLM 视觉能力将 PDF 文档转换为 HTML,通过将页面拆分为图像并使用多模态模型进行处理。可用于文档转换、PDF 处理以及从文档中提取知识。

person作者: jakexiaohubgithub

libtransform Skill

When to Use

  • Converting PDF documents to HTML
  • Extracting content from scanned documents
  • Processing documents with LLM vision models
  • Building document transformation pipelines

Key Concepts

pdfToHtml: Splits PDF into page images, sends to vision-capable LLM, and assembles HTML output with semantic structure.

Usage Patterns

Pattern 1: Convert PDF to HTML

import { pdfToHtml } from "@copilot-ld/libtransform";

const pdfBuffer = await fs.readFile("document.pdf");
const html = await pdfToHtml(pdfBuffer, {
  model: "gpt-4-vision-preview",
  maxPages: 50,
});

Integration

Used by libingest pipeline for document processing. Requires LLM with vision capabilities.