Circuit-Forge/circuitforge-core

Fork 0

pyr0ball 383897f990

CI / test (push) Waiting to run

Details

Mirror / mirror (push) Waiting to run

Details

Release — PyPI / release (push) Waiting to run

Details

feat: platforms module + docs + scripts

- platforms/: eBay platform adapter (snipe integration layer)
- docs/: developer guide, module reference, getting-started docs
- scripts/: utility scripts for development and deployment

2026-04-24 15:23:16 -07:00

2.4 KiB

Raw Blame History

tasks

VRAM-aware background task scheduler. Manages a queue of LLM inference jobs and coordinates VRAM allocation with the cf-orch coordinator before executing each task.

from circuitforge_core.tasks import TaskScheduler, get_scheduler, reset_scheduler

Why VRAM-aware scheduling

Running multiple LLM inference jobs concurrently on a single GPU causes OOM errors and corrupted outputs. The scheduler serializes LLM work per service and negotiates with the cf-orch coordinator so tasks across multiple products don't compete for the same VRAM budget.

Core API

`get_scheduler() -> TaskScheduler`

Returns the singleton scheduler for the current process. Creates it on first call.

`reset_scheduler()`

Tears down the scheduler (releases VRAM leases, cancels pending tasks). Called during FastAPI lifespan teardown.

# In FastAPI lifespan
from circuitforge_core.tasks import get_scheduler, reset_scheduler

@asynccontextmanager
async def lifespan(app: FastAPI):
    scheduler = get_scheduler()
    yield
    reset_scheduler()

`scheduler.submit(task_type, payload, vram_gb) -> str`

Enqueues a task. Returns the task ID. The scheduler acquires a VRAM lease from the coordinator before executing.

task_id = await scheduler.submit(
    task_type="recipe_llm",
    payload={"pantry_ids": [1, 2, 3]},
    vram_gb=4.0,
)

`scheduler.result(task_id) -> TaskResult | None`

Polls for a completed result. Returns None if still running.

VRAM budgets

Each product defines its VRAM budgets in compose.yml / compose.override.yml:

environment:
  VRAM_BUDGET_RECIPE_LLM: "4.0"
  VRAM_BUDGET_EXPIRY_LLM: "2.0"

These map to task types in the scheduler. If the coordinator is unavailable (local dev without cf-orch), the scheduler falls back to sequential local execution.

Shim pattern

Products that need to re-export scheduler functions for backward compatibility use a shim:

# myproduct/app/tasks/scheduler.py
from circuitforge_core.tasks.scheduler import (
    get_scheduler as _base_get_scheduler,
    reset_scheduler,          # re-export for lifespan teardown
)

def get_scheduler():
    """Product-specific scheduler with service name injected."""
    return _base_get_scheduler(service_name="myproduct")

Always re-export reset_scheduler from the shim so the FastAPI lifespan can import it from one place.

2.4 KiB Raw Blame History

tasks

Why VRAM-aware scheduling

Core API

get_scheduler() -> TaskScheduler

reset_scheduler()

scheduler.submit(task_type, payload, vram_gb) -> str

scheduler.result(task_id) -> TaskResult | None

VRAM budgets

Shim pattern

2.4 KiB

Raw Blame History

`get_scheduler() -> TaskScheduler`

`reset_scheduler()`

`scheduler.submit(task_type, payload, vram_gb) -> str`

`scheduler.result(task_id) -> TaskResult | None`