返回 Skill 列表
extension
分类: 开发与工程需要 API Key

Elevenlabs

通过ElevenLabs API实现文本转语音、音效、音乐生成、语音管理及配额查询。适用于使用ElevenLabs生成音频或管理音频任务。

person作者: odrobnikhubclawhub

ElevenLabs Skill

Core tools for interacting with the ElevenLabs API for sound generation, music, and voice management.

Setup

See SETUP.md for prerequisites and setup instructions.

Models

| Model | ID | Use Case | |-------|----|----------| | Eleven v3 | eleven_v3 | ⭐ Best for expressive/creative audio. Supports audio tags (square brackets): [laughs], [sighs], [whispers], [excited], [grumpy voice], [clears throat], etc. Use for storytelling, characters, demos. | | Multilingual v2 | eleven_multilingual_v2 | Stable multilingual. No audio tags. Good for straightforward narration. | | Turbo v2.5 | eleven_turbo_v2_5 | Low-latency, good for non-English (German TTS). Required for realtime/conversational. | | Flash v2.5 | eleven_flash_v2_5 | Fastest, lowest cost. |

v3 Audio Tags (square brackets, NOT XML/SSML)

[laughs], [chuckles], [sighs], [clears throat], [whispers], [shouts]
[excited], [sad], [angry], [warmly], [deadpan], [sarcastic]
[grumpy voice], [philosophical], [whiny voice], [resigned]
[laughs hard], [sighs deeply], [pause]

Tags can be placed anywhere in text. Combine freely. v3 understands emotional context deeply.

Output Formats

All scripts support multiple output formats via --format:

| Format | Description | |--------|-------------| | mp3_44100_128 | MP3, 44.1kHz, 128kbps (default) | | mp3_44100_192 | MP3, 44.1kHz, 192kbps | | mp3_44100_96 | MP3, 44.1kHz, 96kbps | | mp3_44100_64 | MP3, 44.1kHz, 64kbps | | mp3_44100_32 | MP3, 44.1kHz, 32kbps | | mp3_24000_48 | MP3, 24kHz, 48kbps | | mp3_22050_32 | MP3, 22.05kHz, 32kbps | | opus_48000_192 | Opus, 48kHz, 192kbps ⭐ best for AirPlay | | opus_48000_128 | Opus, 48kHz, 128kbps | | opus_48000_96 | Opus, 48kHz, 96kbps | | opus_48000_64 | Opus, 48kHz, 64kbps | | opus_48000_32 | Opus, 48kHz, 32kbps | | pcm_16000 | Raw PCM, 16kHz | | pcm_22050 | Raw PCM, 22.05kHz | | pcm_24000 | Raw PCM, 24kHz | | alaw_8000 | A-law, 8kHz (telephony) |

Tools

1. Speech (speech.py)

Text-to-speech using ElevenLabs voices.

# Basic usage
python3 {baseDir}/scripts/speech.py "Hello world" -v <voice_id> -o output.mp3

# With format option
python3 {baseDir}/scripts/speech.py "Hello world" -v <voice_id> -o output.pcm --format pcm_44100

# With voice settings
python3 {baseDir}/scripts/speech.py "Hello" -v <voice_id> -o out.mp3 --stability 0.7 --similarity 0.8

2. Sound Effects (sfx.py)

Generate sound effects and short audio clips.

# Generate a sound
python3 {baseDir}/scripts/sfx.py "Cinematic boom" -o boom.mp3

# Generate a loop
python3 {baseDir}/scripts/sfx.py "Lo-fi hip hop beat" --duration 10 --loop -o beat.mp3

# Different format
python3 {baseDir}/scripts/sfx.py "Whoosh" -o whoosh.pcm --format pcm_44100

3. Music Generation (music.py)

Generate full musical compositions or instrumental tracks.

# Generate instrumental intro
python3 {baseDir}/scripts/music.py --prompt "Upbeat 6s news intro sting, instrumental" --length-ms 6000 -o intro.mp3

# Generate background bed
python3 {baseDir}/scripts/music.py --prompt "Soft ambient synth pad" --length-ms 30000 -o bed.mp3

# High quality MP3
python3 {baseDir}/scripts/music.py --prompt "Jazz piano" --length-ms 10000 -o jazz.mp3 --output-format mp3_44100_192

4. Voices (voices.py)

List available voices and their IDs.

# List voices
python3 {baseDir}/scripts/voices.py

# JSON output
python3 {baseDir}/scripts/voices.py --json

5. Voice Cloning (voiceclone.py)

Create instant voice clones from audio samples.

Security: by default this script will only read files from:

  • ~/.openclaw/elevenlabs/voiceclone-samples/

Copy your samples there (or pass --sample-dir). Reading files outside the sample directory is blocked.

# Clone from audio files (put samples into ~/.openclaw/elevenlabs/voiceclone-samples)
python3 {baseDir}/scripts/voiceclone.py --name "MyVoice" --files sample1.mp3 sample2.mp3

# Use a custom sample dir
python3 {baseDir}/scripts/voiceclone.py --name "Andi" --sample-dir ./samples --files a.m4a b.m4a --language de --gender male

# With description and noise removal
python3 {baseDir}/scripts/voiceclone.py --name "Andi" --files a.m4a b.m4a --description "German male" --denoise

6. Quota & Usage (quota.py)

Check subscription quota and usage statistics.

# Show current quota
python3 {baseDir}/scripts/quota.py

# Include usage breakdown by voice
python3 {baseDir}/scripts/quota.py --usage

# Last 7 days usage
python3 {baseDir}/scripts/quota.py --usage --days 7

# JSON output
python3 {baseDir}/scripts/quota.py --json

Output:

📊 ElevenLabs Quota
=======================================
Plan:      pro (active) — annual
Characters: 66.6K / 500.0K (13.3%)
           [███░░░░░░░░░░░░░░░░░░░░░░░░░░░]
Resets:    2026-02-18 (29 days)
Voices:    22 / 160 (IVC: ✓)
Pro Voice: 0 / 1 (PVC: ✓)