When the coordinator returns 429 (all nodes at max_concurrent limit), the previous code fell back to LLMRouter which is also overloaded at high concurrency. This caused the request to hang for ~60s before nginx returned a 504. Now: detect 429/max_concurrent in the RuntimeError message and return "" immediately so the caller gets an empty RecipeResult (graceful degradation) rather than a timeout. |
||
|---|---|---|
| .. | ||
| api | ||
| core | ||
| db | ||
| models | ||
| services | ||
| staples | ||
| static | ||
| styles | ||
| tasks | ||
| utils | ||
| __init__.py | ||
| cloud_session.py | ||
| main.py | ||
| tiers.py | ||