Add idle_stop_after_s to ServiceProfile (default 0 = never stop). Set 600s (10 min) timeout on vllm slot in all single-GPU profiles. Backward compatible; non-vllm services inherit default 0 (no auto-stop). |
||
|---|---|---|
| .. | ||
| config | ||
| db | ||
| llm | ||
| pipeline | ||
| resources | ||
| tasks | ||
| tiers | ||
| vision | ||
| wizard | ||
| __init__.py | ||