peregrine

Author	SHA1	Message	Date
pyr0ball	9fe9c6234d	fix: RerankerAdapter falls back to label name when no LABEL_DESCRIPTIONS entry	2026-02-27 14:54:31 -08:00
pyr0ball	23828520f0	feat: label_tool — 9 labels, wildcard Other, InvalidCharacterError fix; sync with avocet canonical	2026-02-27 14:34:24 -08:00
pyr0ball	a316f110c8	feat: add health mission category, trim-to-sign-off, max_tokens cap for cover letters - _MISSION_SIGNALS: add health category (pharma, clinical, patient care, etc.) listed last so music/animals/education/social_impact take priority - _MISSION_DEFAULTS: health note steers toward people-first framing, not industry enthusiasm — focuses on patients navigating rare/invisible journeys - _trim_to_letter_end(): cuts output at first sign-off + first name to prevent fine-tuned models from looping into repetitive garbage after completing letter - generate(): pass max_tokens=1200 to router (prevents runaway output) - user.yaml.example: add health + social_impact to mission_preferences, add candidate_voice field for per-user voice/personality context	2026-02-27 12:31:06 -08:00
pyr0ball	94734ad584	feat: benchmark_classifier — MODEL_REGISTRY, --list-models, --score, --compare modes	2026-02-27 06:19:32 -08:00
pyr0ball	889c55702e	feat: inject DUAL_GPU_MODE sub-profile in Makefile; update manage.sh help	2026-02-27 06:18:34 -08:00
pyr0ball	d626b20470	feat: add ollama_research service and update profiles for dual-gpu sub-profiles	2026-02-27 06:16:17 -08:00
pyr0ball	8e88a99a8e	feat: assign ollama_research to GPU 1 in Docker and Podman GPU overlays	2026-02-27 06:16:04 -08:00
pyr0ball	6ca5893b1c	feat: add DUAL_GPU_MODE default, VRAM warning, and download size report to preflight - Add _mixed_mode_vram_warning() to flag low VRAM on GPU 1 in mixed mode - Wire download size report block into main() before closing border line - Wire mixed-mode VRAM warning into report if triggered - Write DUAL_GPU_MODE=ollama default to .env for new 2-GPU setups (no override if already set) - Promote import os to top-level (was local import inside get_cpu_cores)	2026-02-27 00:17:00 -08:00
pyr0ball	5ab3e2dc39	feat: add _download_size_mb() pure function for preflight size warning	2026-02-27 00:15:26 -08:00
pyr0ball	e79404d316	feat: add ollama_research to preflight service table and LLM backend map	2026-02-27 00:14:04 -08:00
pyr0ball	1c421afbd9	test: add failing tests for dual-gpu preflight additions	2026-02-27 00:11:39 -08:00
pyr0ball	5497674b34	feat: ZeroShotAdapter, GLiClassAdapter, RerankerAdapter with full mock test coverage	2026-02-27 00:10:43 -08:00
pyr0ball	3e47afd953	feat: ClassifierAdapter ABC + compute_metrics() with full test coverage	2026-02-27 00:09:45 -08:00
pyr0ball	f9a329fb57	feat: add vllm_research backend and update research_fallback_order	2026-02-27 00:09:00 -08:00
pyr0ball	96bb1222a6	feat: add scoring JSONL example and gitignore for benchmark data files	2026-02-26 23:46:29 -08:00
pyr0ball	52e972fd69	feat: add job-seeker-classifiers conda env for HF classifier benchmark	2026-02-26 23:43:41 -08:00
pyr0ball	3c0e8e75f7	fix: remove lib-resume-builder-aihawk from Docker requirements The package is never imported in the app — it was pulling torch + CUDA (~7GB) into the main app container for no reason. AIHawk runs in its own conda env (aihawk-env) outside Docker per design.	2026-02-26 22:16:28 -08:00
pyr0ball	e0e7717b56	fix: auto-configure git safe.directory in setup.sh for /opt-style installs Git 2.35.2+ rejects repos where directory owner != current user, which is the common case when cloned as root into /opt. setup.sh now detects this and calls git config --global --add safe.directory automatically. When run via sudo, it writes into SUDO_USER's config rather than root's. README updated with both fixes: git safe.directory and chown for preflight.	2026-02-26 22:07:39 -08:00
pyr0ball	ae29996a8a	docs: add install notes for /opt ownership, Podman rootless, Docker group	2026-02-26 21:15:42 -08:00
pyr0ball	c88b25d1f8	fix: skip --profile for remote profile; fixes podman-compose compat podman-compose 1.0.6 has no --profile flag, causing a fatal parse error. 'remote' profile means base services only — no service in compose.yml is tagged 'remote', so --profile remote was always a no-op with Docker too. Introduce PROFILE_ARG that only adds --profile for cpu/gpu profiles where it actually activates optional services.	2026-02-26 21:12:12 -08:00
pyr0ball	995e9f6aea	fix: render banner link as clickable page_link instead of italic text	2026-02-26 20:53:54 -08:00
pyr0ball	9719de5c43	fix: install make in setup.sh; guard manage.sh against missing make setup.sh now installs make (via apt/dnf/pacman/brew) before git and Docker so that manage.sh commands work out of the box on minimal server installs. manage.sh adds a preflight guard that catches a missing make early and redirects the user to ./manage.sh setup. Also fixes the post-setup next-steps hint to use ./manage.sh instead of bare make.	2026-02-26 20:51:34 -08:00
pyr0ball	a8bee0dc0c	feat: show version tag in sidebar footer	2026-02-26 14:39:47 -08:00
pyr0ball	4a8910540b	feat: multiselect tags for job titles & locations; remove duplicate Notion section; docker detection for services panel - Job titles and locations: replaced text_area with st.multiselect + ＋ add button + paste-list expander - ✨ Suggest now populates the titles dropdown (not auto-selected) — user picks what they want - Suggested exclusions still use click-to-add chip buttons - Removed duplicate Notion expander from System Settings (handled by Integrations tab) - Services panel: show host terminal copy-paste command when docker CLI unavailable (app runs inside container)	2026-02-26 14:26:58 -08:00
pyr0ball	f823f665d1	fix: add address field to Resume Profile — was hidden, triggering false FILL_IN banner	2026-02-26 14:03:55 -08:00
pyr0ball	49513cc081	fix: port drift on restart — down before preflight, read port from .env Makefile restart target now runs compose down before preflight so ports are free when preflight assigns them; previously preflight ran first while the old container still held 8502, causing it to bump to 8503. manage.sh start/restart/open now read STREAMLIT_PORT from .env instead of re-running preflight after startup (which would see the live container and bump the reported port again).	2026-02-26 13:57:12 -08:00
pyr0ball	bf33a584b4	feat: resume upload in Settings + improved config hints - Resume Profile tab: upload widget replaces error+stop when YAML missing; collapsed "Replace Resume" expander when profile exists; saves parsed data and raw text (for LLM context) in one step - FILL_IN banner with clickable link to Setup wizard when incomplete fields detected - Ollama not reachable hint references Services section below - Fine-tune hint clarifies "My Profile tab above" with inference profile names - vLLM no-models hint links to Fine-Tune tab	2026-02-26 13:53:01 -08:00
pyr0ball	6ff26a0c49	refactor: replace sidebar LLM generate panel with inline field buttons Removed the dropdown-based sidebar panel in favour of ✨ Generate buttons placed directly below Career Summary, Voice & Personality, and each Mission & Values row. Prompts now incorporate the live field value as a draft to improve, plus resume experience bullets as context for Career Summary.	2026-02-26 13:40:52 -08:00
pyr0ball	f1decdf89c	feat: searchable tag UI for skills/domains/keywords Replace chip-button tag management with st.multiselect backed by bundled suggestions. Existing user tags are preserved as custom options alongside the suggestion list. Custom tag input validates through filter_tag() before adding — rejects URLs, profanity, overlong strings, and bad characters. Changes auto-save on multiselect interaction; custom tags append on + click.	2026-02-26 13:14:55 -08:00
pyr0ball	cda980da62	feat: bundled skills suggestion list and content filter utility - config/skills_suggestions.yaml: 168 curated tags across skills (77), domains (40), keywords (51) covering CS/TAM/ops and common tech roles; structured for future community aggregate (paid tier backlog) - scripts/skills_utils.py: filter_tag() rejects blanks, URLs, profanity, overlong strings, disallowed chars, and repeated-char runs; load_suggestions() reads bundled YAML per category	2026-02-26 13:09:32 -08:00
pyr0ball	db127848a1	fix: resume CID glyphs, resume YAML path, PyJWT dep, candidate voice & mission UI - resume_parser: add _clean_cid() to strip (cid:NNN) glyph refs from ATS PDFs; CIDs 127/149/183 become bullets, unknowns are stripped; applied to PDF/DOCX/ODT - resume YAML: canonicalize plain_text_resume.yaml path to config/ across all references (Settings, Apply, Setup, company_research, migrate); was pointing at unmounted aihawk/data_folder/ in Docker - requirements/environment: add PyJWT>=2.8 (was missing; broke Settings page) - user_profile: add candidate_voice field - generate_cover_letter: inject candidate_voice into SYSTEM_CONTEXT; add social_impact mission signal category (nonprofit, community, equity, etc.) - Settings: add Voice & Personality textarea to Identity expander; add Mission & Values expander with editable fields for all 4 mission categories - .gitignore: exclude CLAUDE.md, config/plain_text_resume.yaml, config/user.yaml.working - search_profiles: add default profile	2026-02-26 12:32:28 -08:00
pyr0ball	07bdac6302	feat: ODT support, two-column PDF column-split extraction, title/company layout detection hardening	2026-02-26 10:33:28 -08:00
pyr0ball	5af2b20d82	fix: harden resume section detection — anchor patterns to full line, expand header synonyms, fix name heuristic for hyphenated/middle-initial names, add parse diagnostics UI	2026-02-26 09:28:31 -08:00
pyr0ball	b9f5dd1fc3	refactor: replace LLM-based resume parser with section regex parser Primary parse path is now fully deterministic — no LLM, no token limits, no JSON generation. Handles two-column experience headers, institution-before- or-after-degree education layouts, and header bleed prevention via looks_like_header detection. LLM path retained as optional career_summary enhancement only (1500 chars, falls back silently). structure_resume() now returns tuple[dict, str]. Tests updated to match the new API.	2026-02-26 07:34:25 -08:00
pyr0ball	9297477ba0	fix: resume parser — max_tokens, json-repair fallback, logging, PYTHONUNBUFFERED	2026-02-26 00:00:23 -08:00
pyr0ball	4cee76211e	fix: add python-docx to container requirements	2026-02-25 23:43:30 -08:00
pyr0ball	5ac42e4c02	fix: add /v1 prefix to all license server API paths	2026-02-25 23:35:58 -08:00
pyr0ball	4f6d652889	feat: License tab in Settings (activate/deactivate UI) + startup refresh	2026-02-25 23:08:20 -08:00
pyr0ball	58ebd57c49	feat: wire license.effective_tier into tiers.py; add dev_override priority	2026-02-25 23:05:55 -08:00
pyr0ball	bf2d0f81c7	feat: license.py client — verify_local, effective_tier, activate, refresh, report_usage	2026-02-25 22:53:11 -08:00
pyr0ball	30542808c7	fix: GPU detection + pdfplumber + pass GPU env vars into app container - preflight.py now writes PEREGRINE_GPU_COUNT and PEREGRINE_GPU_NAMES to .env so the app container gets GPU info without needing nvidia-smi access - compose.yml passes PEREGRINE_GPU_COUNT, PEREGRINE_GPU_NAMES, and RECOMMENDED_PROFILE as env vars to the app service - 0_Setup.py _detect_gpus() reads PEREGRINE_GPU_NAMES env var first; falls back to nvidia-smi (bare / GPU-passthrough environments) - 0_Setup.py _suggest_profile() reads RECOMMENDED_PROFILE env var first - requirements.txt: add pdfplumber (needed for resume PDF parsing)	2026-02-25 21:58:28 -08:00
pyr0ball	578a4c819a	fix: add app/__init__.py so wizard submodule is importable inside Docker Without __init__.py, Python treats app/ as a namespace package that doesn't resolve correctly when running from WORKDIR /app inside the container. 'from app.wizard.step_hardware import ...' raises ModuleNotFoundError: No module named 'app.wizard'; 'app' is not a package	2026-02-25 21:41:09 -08:00
pyr0ball	1dcf9d47a4	fix: stub-port adoption — stubs bind free ports, app routes to external via host.docker.internal Three inter-related fixes for the service adoption flow: - preflight: stub_port field — adopted services get a free port for their no-op container (avoids binding conflict with external service on real port) while update_llm_yaml still uses the real external port for host.docker.internal URLs - preflight: write_env now uses stub_port (not resolved) for adopted services so SEARXNG_PORT etc point to the stub's harmless port, not the occupied one - preflight: stub containers use sleep infinity + CMD true healthcheck so depends_on: service_healthy is satisfied without holding any real port - Makefile: finetune profile changed from [cpu,single-gpu,dual-gpu] to [finetune] so the pytorch/cuda base image is not built during make start	2026-02-25 21:38:23 -08:00
pyr0ball	010abe6339	fix: ollama docker_owned=True; finetune gets own profile to avoid build on start - preflight: ollama was incorrectly marked docker_owned=False — Docker does define an ollama service, so external detection now correctly disables it via compose.override.yml when host Ollama is already running - compose.yml: finetune moves from [cpu,single-gpu,dual-gpu] profiles to [finetune] profile so it is never built during 'make start' (pytorch/cuda base is 3.7GB+ and unnecessary for the UI) - compose.yml: remove depends_on ollama from finetune — it reaches Ollama via OLLAMA_URL env var which works whether Ollama is Docker or host - Makefile: finetune target uses --profile finetune + compose.gpu.yml overlay	2026-02-25 21:24:33 -08:00
pyr0ball	3518d63ec2	feat: smart service adoption in preflight — use external services instead of conflicting preflight.py now detects when a managed service (ollama, vllm, vision, searxng) is already running on its configured port and adopts it rather than reassigning or conflicting: - Generates compose.override.yml disabling Docker containers for adopted services (profiles: [_external_] — a profile never passed via --profile) - Rewrites config/llm.yaml base_url entries to host.docker.internal:<port> so the app container can reach host-side services through Docker's host-gateway mapping - compose.yml: adds extra_hosts host.docker.internal:host-gateway to the app service (required on Linux; no-op on macOS Docker Desktop) - .gitignore: excludes compose.override.yml (auto-generated, host-specific) Only streamlit is non-adoptable and continues to reassign on conflict.	2026-02-25 19:23:02 -08:00
pyr0ball	cf6dfdbf8a	docs: use ./manage.sh setup in quickstart	2026-02-25 17:18:03 -08:00
pyr0ball	f14483b8ae	docs: update README — manage.sh CLI reference + correct Forgejo clone URL	2026-02-25 16:59:34 -08:00
pyr0ball	4ffcf610b7	feat: add manage.sh — single CLI entry point for beta testers	2026-02-25 16:51:30 -08:00
pyr0ball	cc01f67b04	fix: fix dual-gpu port conflict + move GPU config to overlay files - Remove ollama-gpu service (was colliding with ollama on port 11434) - Strip inline deploy.resources GPU blocks from vision and vllm - Add compose.gpu.yml: Docker NVIDIA overlay for ollama (GPU 0), vision (GPU 0), vllm (GPU 1), finetune (GPU 0) - Fix compose.podman-gpu.yml: rename ollama-gpu → ollama to match service name after removal of ollama-gpu - Update Makefile: apply compose.gpu.yml for Docker + GPU profiles (was only applying podman-gpu.yml for Podman + GPU profiles)	2026-02-25 16:44:59 -08:00
pyr0ball	f38f0c2007	feat: wire fine-tune UI end-to-end + harden setup.sh - setup.sh: replace docker-image-based NVIDIA test with nvidia-ctk validate (faster, no 100MB pull, no daemon required); add check_docker_running() to auto-start the Docker service on Linux or warn on macOS - prepare_training_data.py: also scan training_data/uploads/*.{md,txt} so web-uploaded letters are included in training data - task_runner.py: add prepare_training task type (calls build_records + write_jsonl inline; reports pair count in task result) - Settings fine-tune tab: Step 1 accepts .md/.txt uploads; Step 2 Extract button submits prepare_training background task + shows status; Step 3 shows make finetune command + live Ollama model status poller	2026-02-25 16:31:53 -08:00

1 2 3

102 commits