返回 Skill 列表
extension
分类: 开发与工程需要 API Key

ElevenLabs STT OpenClaw

使用 ElevenLabs Speech‑to‑Text(Scribe v2)在本地 CLI 转录音频文件,支持说话人分离、事件、JSON 输出、webhooks 及高级 STT 选项。

person作者: xhunxhubclawhub

ElevenLabs Speech-to-Text (Local CLI)

Use

Run the script in scripts/transcribe.sh with an audio file path or URL.

Examples:

scripts/transcribe.sh /path/to/audio.mp3
scripts/transcribe.sh /path/to/audio.mp3 --diarize --lang en
scripts/transcribe.sh /path/to/audio.mp3 --json
scripts/transcribe.sh /path/to/audio.mp3 --webhook --webhook-metadata '{"job":"call-001"}'
scripts/transcribe.sh --url https://example.com/audio.mp3 --lang en

Environment

Set ELEVENLABS_API_KEY in your shell or OpenClaw env before running.

Notes

  • Defaults to scribe_v2 (the Speech-to-Text model) and uses a filesystem lock to avoid parallel requests.
  • Requires curl and jq.
  • For async workflows, use --webhook with optional --webhook-id and --webhook-metadata.
  • Realtime streaming is available via scripts/realtime.sh (requires ffmpeg + websocat) and uses the scribe_v2_realtime model.
  • Live listener mode is available via scripts/live_listen.sh with toggle/always-on modes and optional TTS response.