Dynamic Budget Orchestrator
Use this skill to make LLM token limits elastic to local resources and request size.
Workflow
- Capture RAM and VRAM snapshot (best effort).
- Estimate prompt size and compute a prompt-based cap.
- Apply a profile factor (safe, fast, balanced, quality).
- Clamp to min/max and emit an effective token budget.
Scripts
- Run: python skills/automation/dynamic-budget-orchestrator/scripts/probe_budget.py
References
- references/profile_presets.json
Scan to join WeChat group