返回 Skill 列表
extension
分类: 其它无需 API Key

OCR Local V2

使用 Tesseract.js OCR 从图像中提取文本(100%本地运行,无需API密钥)。支持中文(简繁体)和英文。

person作者: 15914355527hubclawhub

OCR - Image Text Recognition (Local)

Extract text from images using Tesseract.js. 100% local run, no API key required. Supports Chinese and English.

Quick start

node {baseDir}/scripts/ocr.js /path/to/image.jpg
node {baseDir}/scripts/ocr.js /path/to/image.png --lang chi_sim
node {baseDir}/scripts/ocr.js /path/to/image.jpg --lang chi_tra+eng

Options

  • --lang <langs>: Language codes (default: chi_sim+eng)

    • chi_sim - Simplified Chinese
    • chi_tra - Traditional Chinese
    • eng - English
    • Combine with +: chi_sim+eng
  • --json: Output as JSON instead of plain text

Examples

# Recognize Chinese screenshot
node {baseDir}/scripts/ocr.js screenshot.png

# Recognize English document
node {baseDir}/scripts/ocr.js document.jpg --lang eng

# Mixed Chinese + English
node {baseDir}/scripts/ocr.js mixed.png --lang chi_sim+eng

Notes

  • First run downloads language data (~20MB per language)
  • Subsequent runs are cached locally
  • Works best with clear, high-contrast images
  • For handwritten text, accuracy may vary