max_mb // 2 was too loose — Qwen2.5-3B needs ~5.9 GB on an 8 GB card but the threshold only required 3.25 GB free, allowing Ollama to hold 4.5 GB while a load attempt was still dispatched (causing OOM crash). - node_selector: can_fit = free_mb >= service_max_mb (was // 2) - coordinator /start: same threshold fix + updated error message - tests: two new node_selector tests pin the full-ceiling semantics; updated stale docstring in coordinator app test |
||
|---|---|---|
| .. | ||
| test_resources | ||
| test_tasks | ||
| __init__.py | ||
| test_config.py | ||
| test_db.py | ||
| test_llm_router.py | ||
| test_stubs.py | ||
| test_tiers.py | ||