peregrine

Author	SHA1	Message	Date
pyr0ball	60f267a8a6	feat: add reverse-proxy basepath support (Streamlit MIME fix) - compose.yml: pass STREAMLIT_SERVER_BASE_URL_PATH from .env into container Streamlit prefixes all asset URLs with the path so Caddy handle_path routing works. Without this, /static/* requests skip the /peregrine* route → 503 text/plain MIME error. - config/server.yaml.example: document base_url_path + server_port settings - .gitignore: ignore config/server.yaml (local gitignored instance of server.yaml.example) - app/pages/2_Settings.py: add Deployment/Server expander under System tab Shows active base URL path from env; saves edits to config/server.yaml + .env; prompts user to run ./manage.sh restart to apply. Refs: https://docs.streamlit.io/develop/api-reference/configuration/config.toml#server.baseUrlPath	2026-03-01 22:49:29 -08:00
pyr0ball	d626b20470	feat: add ollama_research service and update profiles for dual-gpu sub-profiles	2026-02-27 06:16:17 -08:00
pyr0ball	9297477ba0	fix: resume parser — max_tokens, json-repair fallback, logging, PYTHONUNBUFFERED	2026-02-26 00:00:23 -08:00
pyr0ball	30542808c7	fix: GPU detection + pdfplumber + pass GPU env vars into app container - preflight.py now writes PEREGRINE_GPU_COUNT and PEREGRINE_GPU_NAMES to .env so the app container gets GPU info without needing nvidia-smi access - compose.yml passes PEREGRINE_GPU_COUNT, PEREGRINE_GPU_NAMES, and RECOMMENDED_PROFILE as env vars to the app service - 0_Setup.py _detect_gpus() reads PEREGRINE_GPU_NAMES env var first; falls back to nvidia-smi (bare / GPU-passthrough environments) - 0_Setup.py _suggest_profile() reads RECOMMENDED_PROFILE env var first - requirements.txt: add pdfplumber (needed for resume PDF parsing)	2026-02-25 21:58:28 -08:00
pyr0ball	010abe6339	fix: ollama docker_owned=True; finetune gets own profile to avoid build on start - preflight: ollama was incorrectly marked docker_owned=False — Docker does define an ollama service, so external detection now correctly disables it via compose.override.yml when host Ollama is already running - compose.yml: finetune moves from [cpu,single-gpu,dual-gpu] profiles to [finetune] profile so it is never built during 'make start' (pytorch/cuda base is 3.7GB+ and unnecessary for the UI) - compose.yml: remove depends_on ollama from finetune — it reaches Ollama via OLLAMA_URL env var which works whether Ollama is Docker or host - Makefile: finetune target uses --profile finetune + compose.gpu.yml overlay	2026-02-25 21:24:33 -08:00
pyr0ball	3518d63ec2	feat: smart service adoption in preflight — use external services instead of conflicting preflight.py now detects when a managed service (ollama, vllm, vision, searxng) is already running on its configured port and adopts it rather than reassigning or conflicting: - Generates compose.override.yml disabling Docker containers for adopted services (profiles: [_external_] — a profile never passed via --profile) - Rewrites config/llm.yaml base_url entries to host.docker.internal:<port> so the app container can reach host-side services through Docker's host-gateway mapping - compose.yml: adds extra_hosts host.docker.internal:host-gateway to the app service (required on Linux; no-op on macOS Docker Desktop) - .gitignore: excludes compose.override.yml (auto-generated, host-specific) Only streamlit is non-adoptable and continues to reassign on conflict.	2026-02-25 19:23:02 -08:00
pyr0ball	cc01f67b04	fix: fix dual-gpu port conflict + move GPU config to overlay files - Remove ollama-gpu service (was colliding with ollama on port 11434) - Strip inline deploy.resources GPU blocks from vision and vllm - Add compose.gpu.yml: Docker NVIDIA overlay for ollama (GPU 0), vision (GPU 0), vllm (GPU 1), finetune (GPU 0) - Fix compose.podman-gpu.yml: rename ollama-gpu → ollama to match service name after removal of ollama-gpu - Update Makefile: apply compose.gpu.yml for Docker + GPU profiles (was only applying podman-gpu.yml for Podman + GPU profiles)	2026-02-25 16:44:59 -08:00
pyr0ball	54de37e5fa	feat: containerize fine-tune pipeline (Dockerfile.finetune + make finetune) - Dockerfile.finetune: PyTorch 2.3/CUDA 12.1 base + unsloth + training stack - finetune_local.py: auto-register model via Ollama HTTP API after GGUF export; path-translate between finetune container mount and Ollama's view; update config/llm.yaml automatically; DOCS_DIR env override for Docker - prepare_training_data.py: DOCS_DIR env override so make prepare-training works correctly inside the app container - compose.yml: add finetune service (cpu/single-gpu/dual-gpu profiles); DOCS_DIR=/docs injected into app + finetune containers - compose.podman-gpu.yml: CDI device override for finetune service - Makefile: make prepare-training + make finetune targets	2026-02-25 16:22:48 -08:00
pyr0ball	78917c8460	feat: startup preflight — port collision avoidance + resource checks scripts/preflight.py (stdlib-only, no psutil): - Port probing: owned services auto-reassign to next free port; external services (Ollama) show ✓ reachable / ⚠ not responding - System resources: CPU cores, RAM (total + available), GPU VRAM via nvidia-smi; works on Linux + macOS - Profile recommendation: remote / cpu / single-gpu / dual-gpu - vLLM KV cache offload: calculates CPU_OFFLOAD_GB when VRAM < 10 GB free and RAM headroom > 4 GB (uses up to 25% of available headroom) - Writes resolved values to .env for docker compose; single-service mode (--service streamlit) for scripted port queries - Exit 0 unless an owned port genuinely can't be resolved scripts/manage-ui.sh: - Calls preflight.py --service streamlit before bind; falls back to pure-bash port scan if Python/yaml unavailable compose.yml: - vllm command: adds --cpu-offload-gb ${CPU_OFFLOAD_GB:-0} Makefile: - start / restart depend on preflight target - PYTHON variable for env portability - test target uses PYTHON variable	2026-02-24 20:36:16 -08:00
pyr0ball	a61fd43eb1	feat: add vision service to compose stack and fine-tune wizard tab to Settings - Add moondream2 vision service to compose.yml (single-gpu + dual-gpu profiles) - Create scripts/vision_service/Dockerfile for the vision container - Add VISION_PORT, VISION_MODEL, VISION_REVISION vars to .env.example - Add Vision Service entry to SERVICES list in Settings (hidden unless gpu profile active) - Add Fine-Tune Wizard tab (Task 10) to Settings with 3-step upload→preview→train flow - Tab is always rendered; shows info message when non-GPU profile is active	2026-02-24 19:37:55 -08:00
pyr0ball	3d8ec6d9a9	feat: add Docker Compose stack with remote/cpu/single-gpu/dual-gpu profiles	2026-02-24 19:31:57 -08:00

11 commits