返回 Skill 列表
extension
分类: 开发与工程需要 API Key

Cloudflare Whisper Worker

使用已部署的Cloudflare Worker Whisper端点转录音频。用于将语音/音频文件(wav、mp3、m4a、ogg、webm)转换为文本。

person作者: lotfinityhubclawhub

Cloudflare Whisper Worker

Use this skill to transcribe audio through the deployed Whisper Worker API.

Endpoint

  • Base URL: https://lotfi-whisper-worker.medtouradmin.workers.dev
  • Route: POST /transcribe
  • Auth: Authorization: Bearer <API_TOKEN>
  • Body: raw audio bytes (--data-binary @file)

Required environment variable

Set token once per shell:

export WHISPER_WORKER_TOKEN="<your_token>"

Transcribe a file (JSON response)

curl -sS -X POST "https://lotfi-whisper-worker.medtouradmin.workers.dev/transcribe" \
  -H "content-type: audio/wav" \
  -H "authorization: Bearer $WHISPER_WORKER_TOKEN" \
  --data-binary "@audio.wav"

Transcribe and return only text

curl -sS -X POST "https://lotfi-whisper-worker.medtouradmin.workers.dev/transcribe" \
  -H "content-type: audio/wav" \
  -H "authorization: Bearer $WHISPER_WORKER_TOKEN" \
  --data-binary "@audio.wav" \
| jq -r '.result.text // .text // .result.response // empty'

Content-Type guide

  • WAV: audio/wav
  • MP3: audio/mpeg
  • M4A: audio/mp4
  • OGG/OPUS: audio/ogg
  • WEBM: audio/webm

Common errors

  • 401 Unauthorized: missing/invalid bearer token
  • 400 Empty audio body: file path wrong or empty file
  • 400 Send raw audio...: invalid content-type header
  • 500: worker/runtime/model error; retry and inspect full JSON