dynamic-budget-orchestrator

Dynamically scale model token budgets using resource telemetry, prompt size, and profile presets. Use when token limits must adapt to hardware constraints, per-request size, or safe/fast/quality modes.

Dynamic Budget Orchestrator

Use this skill to make LLM token limits elastic to local resources and request size.

Workflow

Capture RAM and VRAM snapshot (best effort).
Estimate prompt size and compute a prompt-based cap.
Apply a profile factor (safe, fast, balanced, quality).
Clamp to min/max and emit an effective token budget.

Scripts

Run: python skills/automation/dynamic-budget-orchestrator/scripts/probe_budget.py

References

references/profile_presets.json