From 27d4b0e732aac991aaea88c34f803720ac9db226 Mon Sep 17 00:00:00 2001 From: pyr0ball Date: Sun, 15 Mar 2026 05:01:24 -0700 Subject: [PATCH] =?UTF-8?q?feat:=20LLM=20queue=20optimizer=20complete=20?= =?UTF-8?q?=E2=80=94=20closes=20#2?= MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Resource-aware batch scheduler for LLM tasks: - scripts/task_scheduler.py (new): TaskScheduler singleton with VRAM-aware batch scheduling, durability, thread-safe singleton, memory safety - scripts/task_runner.py: submit_task() routes LLM types through scheduler - scripts/db.py: reset_running_tasks() for durable restart behavior - app/app.py: _startup() preserves queued tasks on restart - config/llm.yaml.example: scheduler VRAM budget config documented - tests/test_task_scheduler.py (new): 24 tests covering all behaviors Pre-existing failure: test_generate_calls_llm_router (issue #12, unrelated)