返回 Skill 列表
extension
分类: 内容与媒体无需 API Key

alicloud-ai-audio-asr-realtime

当需要低延迟的实时语音识别时使用,包括流式麦克风输入、实时字幕或双工语音代理,适用于阿里云Model Studio Qwen ASR实时模型。

person作者: jakexiaohubgithub

Category: provider

Model Studio Qwen ASR Realtime

Validation

mkdir -p output/alicloud-ai-audio-asr-realtime
python -m py_compile skills/ai/audio/alicloud-ai-audio-asr-realtime/scripts/prepare_realtime_asr_request.py && echo "py_compile_ok" > output/alicloud-ai-audio-asr-realtime/validate.txt

Pass criteria: command exits 0 and output/alicloud-ai-audio-asr-realtime/validate.txt is generated.

Output And Evidence

  • Save session payloads and response samples under output/alicloud-ai-audio-asr-realtime/.

Critical model names

Use one of these exact model strings:

  • qwen3-asr-flash-realtime
  • qwen3-asr-flash-realtime-2026-02-10

Use cases

  • Realtime subtitles and captions
  • Voice-agent duplex input
  • Streaming speech-to-text in browser or terminal clients

Prerequisites

  • Set DASHSCOPE_API_KEY in your environment, or add dashscope_api_key to ~/.alibabacloud/credentials.
  • Realtime sessions generally require WebSocket or streaming session handling in the client.

Normalized interface (asr.realtime)

Request

  • model (string, optional): default qwen3-asr-flash-realtime
  • language_hints (array<string>, optional)
  • format (string, optional): e.g. pcm, wav
  • sample_rate (int, optional): e.g. 16000
  • chunk_ms (int, optional): frame size in milliseconds

Response

  • text (string): recognized transcript fragment
  • is_final (bool): finalization marker
  • usage (object, optional)

Quick start

Generate a request template:

python skills/ai/audio/alicloud-ai-audio-asr-realtime/scripts/prepare_realtime_asr_request.py \
  --output output/alicloud-ai-audio-asr-realtime/request.json

Operational guidance

  • Prefer 16kHz mono PCM unless your client stack requires another format.
  • Keep chunks small enough for responsive partial results.
  • If you only have recorded files, use skills/ai/audio/alicloud-ai-audio-asr/ instead.

References

  • references/sources.md