peregrine

Author	SHA1	Message	Date
pyr0ball	e982fa7a8b	fix: resume CID glyphs, resume YAML path, PyJWT dep, candidate voice & mission UI - resume_parser: add _clean_cid() to strip (cid:NNN) glyph refs from ATS PDFs; CIDs 127/149/183 become bullets, unknowns are stripped; applied to PDF/DOCX/ODT - resume YAML: canonicalize plain_text_resume.yaml path to config/ across all references (Settings, Apply, Setup, company_research, migrate); was pointing at unmounted aihawk/data_folder/ in Docker - requirements/environment: add PyJWT>=2.8 (was missing; broke Settings page) - user_profile: add candidate_voice field - generate_cover_letter: inject candidate_voice into SYSTEM_CONTEXT; add social_impact mission signal category (nonprofit, community, equity, etc.) - Settings: add Voice & Personality textarea to Identity expander; add Mission & Values expander with editable fields for all 4 mission categories - .gitignore: exclude CLAUDE.md, config/plain_text_resume.yaml, config/user.yaml.working - search_profiles: add default profile	2026-02-26 12:32:28 -08:00
pyr0ball	7ca20eec42	feat: ODT support, two-column PDF column-split extraction, title/company layout detection hardening	2026-02-26 10:33:28 -08:00
pyr0ball	1775c7fa36	fix: harden resume section detection — anchor patterns to full line, expand header synonyms, fix name heuristic for hyphenated/middle-initial names, add parse diagnostics UI	2026-02-26 09:28:31 -08:00
pyr0ball	26563a0990	refactor: replace LLM-based resume parser with section regex parser Primary parse path is now fully deterministic — no LLM, no token limits, no JSON generation. Handles two-column experience headers, institution-before- or-after-degree education layouts, and header bleed prevention via looks_like_header detection. LLM path retained as optional career_summary enhancement only (1500 chars, falls back silently). structure_resume() now returns tuple[dict, str]. Tests updated to match the new API.	2026-02-26 07:34:25 -08:00
pyr0ball	c8d8434371	fix: resume parser — max_tokens, json-repair fallback, logging, PYTHONUNBUFFERED	2026-02-26 00:00:23 -08:00
pyr0ball	af53f0a5eb	fix: add python-docx to container requirements	2026-02-25 23:43:30 -08:00
pyr0ball	70b385f3fd	fix: add /v1 prefix to all license server API paths	2026-02-25 23:35:58 -08:00
pyr0ball	6585d9ec75	feat: License tab in Settings (activate/deactivate UI) + startup refresh	2026-02-25 23:08:20 -08:00
pyr0ball	fa1f36dc8d	feat: wire license.effective_tier into tiers.py; add dev_override priority	2026-02-25 23:05:55 -08:00
pyr0ball	7d5a706202	feat: license.py client — verify_local, effective_tier, activate, refresh, report_usage	2026-02-25 22:53:11 -08:00
pyr0ball	4da5e0a2a4	fix: GPU detection + pdfplumber + pass GPU env vars into app container - preflight.py now writes PEREGRINE_GPU_COUNT and PEREGRINE_GPU_NAMES to .env so the app container gets GPU info without needing nvidia-smi access - compose.yml passes PEREGRINE_GPU_COUNT, PEREGRINE_GPU_NAMES, and RECOMMENDED_PROFILE as env vars to the app service - 0_Setup.py _detect_gpus() reads PEREGRINE_GPU_NAMES env var first; falls back to nvidia-smi (bare / GPU-passthrough environments) - 0_Setup.py _suggest_profile() reads RECOMMENDED_PROFILE env var first - requirements.txt: add pdfplumber (needed for resume PDF parsing)	2026-02-25 21:58:28 -08:00
pyr0ball	c7fe1626a7	fix: add app/__init__.py so wizard submodule is importable inside Docker Without __init__.py, Python treats app/ as a namespace package that doesn't resolve correctly when running from WORKDIR /app inside the container. 'from app.wizard.step_hardware import ...' raises ModuleNotFoundError: No module named 'app.wizard'; 'app' is not a package	2026-02-25 21:41:09 -08:00
pyr0ball	1d228b293b	fix: stub-port adoption — stubs bind free ports, app routes to external via host.docker.internal Three inter-related fixes for the service adoption flow: - preflight: stub_port field — adopted services get a free port for their no-op container (avoids binding conflict with external service on real port) while update_llm_yaml still uses the real external port for host.docker.internal URLs - preflight: write_env now uses stub_port (not resolved) for adopted services so SEARXNG_PORT etc point to the stub's harmless port, not the occupied one - preflight: stub containers use sleep infinity + CMD true healthcheck so depends_on: service_healthy is satisfied without holding any real port - Makefile: finetune profile changed from [cpu,single-gpu,dual-gpu] to [finetune] so the pytorch/cuda base image is not built during make start	2026-02-25 21:38:23 -08:00
pyr0ball	7c62935371	fix: ollama docker_owned=True; finetune gets own profile to avoid build on start - preflight: ollama was incorrectly marked docker_owned=False — Docker does define an ollama service, so external detection now correctly disables it via compose.override.yml when host Ollama is already running - compose.yml: finetune moves from [cpu,single-gpu,dual-gpu] profiles to [finetune] profile so it is never built during 'make start' (pytorch/cuda base is 3.7GB+ and unnecessary for the UI) - compose.yml: remove depends_on ollama from finetune — it reaches Ollama via OLLAMA_URL env var which works whether Ollama is Docker or host - Makefile: finetune target uses --profile finetune + compose.gpu.yml overlay	2026-02-25 21:24:33 -08:00
pyr0ball	9c1f894446	feat: smart service adoption in preflight — use external services instead of conflicting preflight.py now detects when a managed service (ollama, vllm, vision, searxng) is already running on its configured port and adopts it rather than reassigning or conflicting: - Generates compose.override.yml disabling Docker containers for adopted services (profiles: [_external_] — a profile never passed via --profile) - Rewrites config/llm.yaml base_url entries to host.docker.internal:<port> so the app container can reach host-side services through Docker's host-gateway mapping - compose.yml: adds extra_hosts host.docker.internal:host-gateway to the app service (required on Linux; no-op on macOS Docker Desktop) - .gitignore: excludes compose.override.yml (auto-generated, host-specific) Only streamlit is non-adoptable and continues to reassign on conflict.	2026-02-25 19:23:02 -08:00
pyr0ball	e3fbdd5502	docs: use ./manage.sh setup in quickstart	2026-02-25 17:18:03 -08:00
pyr0ball	04915d33be	docs: update README — manage.sh CLI reference + correct Forgejo clone URL	2026-02-25 16:59:34 -08:00
pyr0ball	ca278d5b41	feat: add manage.sh — single CLI entry point for beta testers	2026-02-25 16:51:30 -08:00
pyr0ball	775d54d605	fix: fix dual-gpu port conflict + move GPU config to overlay files - Remove ollama-gpu service (was colliding with ollama on port 11434) - Strip inline deploy.resources GPU blocks from vision and vllm - Add compose.gpu.yml: Docker NVIDIA overlay for ollama (GPU 0), vision (GPU 0), vllm (GPU 1), finetune (GPU 0) - Fix compose.podman-gpu.yml: rename ollama-gpu → ollama to match service name after removal of ollama-gpu - Update Makefile: apply compose.gpu.yml for Docker + GPU profiles (was only applying podman-gpu.yml for Podman + GPU profiles)	2026-02-25 16:44:59 -08:00
pyr0ball	bcde4c960e	feat: wire fine-tune UI end-to-end + harden setup.sh - setup.sh: replace docker-image-based NVIDIA test with nvidia-ctk validate (faster, no 100MB pull, no daemon required); add check_docker_running() to auto-start the Docker service on Linux or warn on macOS - prepare_training_data.py: also scan training_data/uploads/*.{md,txt} so web-uploaded letters are included in training data - task_runner.py: add prepare_training task type (calls build_records + write_jsonl inline; reports pair count in task result) - Settings fine-tune tab: Step 1 accepts .md/.txt uploads; Step 2 Extract button submits prepare_training background task + shows status; Step 3 shows make finetune command + live Ollama model status poller	2026-02-25 16:31:53 -08:00
pyr0ball	740b0ea45a	feat: containerize fine-tune pipeline (Dockerfile.finetune + make finetune) - Dockerfile.finetune: PyTorch 2.3/CUDA 12.1 base + unsloth + training stack - finetune_local.py: auto-register model via Ollama HTTP API after GGUF export; path-translate between finetune container mount and Ollama's view; update config/llm.yaml automatically; DOCS_DIR env override for Docker - prepare_training_data.py: DOCS_DIR env override so make prepare-training works correctly inside the app container - compose.yml: add finetune service (cpu/single-gpu/dual-gpu profiles); DOCS_DIR=/docs injected into app + finetune containers - compose.podman-gpu.yml: CDI device override for finetune service - Makefile: make prepare-training + make finetune targets	2026-02-25 16:22:48 -08:00
pyr0ball	cfbe1cdf1a	feat: prompt for model weights directory during install Interactive prompt lets users with split-drive setups point Ollama and vLLM model dirs at a dedicated storage drive. Reads current .env value as default so re-runs are idempotent. Skips prompts in non-interactive (piped) mode. Creates the target directory immediately and updates .env in-place via portable awk (Linux + macOS). Also simplifies next-steps output since model paths are now configured at install time.	2026-02-25 16:08:14 -08:00
pyr0ball	57a05417dc	fix: repair beta installer path for Docker-first deployment - llm.yaml + example: replace localhost URLs with Docker service names (ollama:11434, vllm:8000, vision:8002); replace personal model names (alex-cover-writer, llama3.1:8b) with llama3.2:3b - user.yaml.example: update service hosts to Docker names (ollama, vllm, searxng) and searxng port from 8888 (host-mapped) to 8080 (internal) - wizard step 5: fix hardcoded localhost defaults — wizard runs inside Docker, so service name defaults are required for connection tests to pass - scrapers/companyScraper.py: bundle scraper so Dockerfile COPY succeeds - setup.sh: remove host Ollama install (conflicts with Docker Ollama on port 11434); Docker entrypoint handles model download automatically - README + setup.sh banner: add Circuit Forge mission statement	2026-02-25 16:03:10 -08:00
pyr0ball	4f67be4020	feat: add Ollama install + service start + model pull to setup.sh	2026-02-25 15:42:56 -08:00
pyr0ball	7b53e6fd75	feat: Podman support — auto-detect COMPOSE, CDI GPU override, podman-compose in setup.sh	2026-02-25 15:36:36 -08:00
pyr0ball	6be0566335	docs: fix license server paths — dev under CircuitForge/, live at /devl/	2026-02-25 15:28:32 -08:00
pyr0ball	1e6950893a	docs: CircuitForge license server implementation plan (11 tasks)	2026-02-25 15:27:39 -08:00
pyr0ball	b40dda3b91	docs: CircuitForge license server design doc RS256 JWT, FastAPI + SQLite, multi-product schema, offline-capable client integration. Covers server, Peregrine client, deployment, admin workflow, and testing strategy.	2026-02-25 15:21:07 -08:00
pyr0ball	8ceb1d2ebc	docs: mark cover letter refinement complete in backlog + changelog	2026-02-25 14:44:50 -08:00
pyr0ball	7fab2a0cd3	feat: cover letter iterative refinement — feedback UI + backend params - generate() accepts previous_result + feedback; appends both to LLM prompt - task_runner cover_letter handler parses params JSON, passes fields through - Apply Workspace: "Refine with Feedback" expander with text area + Regenerate button; only shown when a draft exists; clears feedback after submitting - 8 new tests (TestGenerateRefinement + TestTaskRunnerCoverLetterParams)	2026-02-25 14:44:20 -08:00
pyr0ball	37ce9fb1f8	docs: finalise Circuit Forge product suite naming + product brief	2026-02-25 14:16:56 -08:00
pyr0ball	76e97bd1cb	docs: backlog — Circuit Forge product expansion (heinous tasks platform)	2026-02-25 14:02:07 -08:00
pyr0ball	c1381c65ba	docs: mark email sync test checklist complete	2026-02-25 13:56:55 -08:00
pyr0ball	f3cfd258c6	test: complete email sync test coverage — 44 new tests across all checklist sections	2026-02-25 13:55:55 -08:00
pyr0ball	3076f051d7	chore: mkdocs deps, CHANGELOG, remove dead Resume Editor page, backlog gap items	2026-02-25 13:51:13 -08:00
pyr0ball	ddabf85a3d	docs: LICENSE-MIT + LICENSE-BSL + updated README for 7-step wizard and current feature set	2026-02-25 12:06:28 -08:00
pyr0ball	0ba27c3939	docs: mkdocs wiki — installation, user guide, developer guide, reference Adds a full MkDocs documentation site under docs/ with Material theme. Getting Started: installation walkthrough, 7-step first-run wizard guide, Docker Compose profile reference with GPU memory guidance and preflight.py description. User Guide: job discovery (search profiles, custom boards, enrichment), job review (sorting, match scores, batch actions), apply workspace (cover letter gen, PDF export, mark applied), interviews (kanban stages, company research auto-trigger, survey assistant), email sync (IMAP, Gmail App Password, classification labels, stage auto-updates), integrations (all 13 drivers with tier requirements), settings (every tab documented). Developer Guide: contributing (dev env setup, code style, branch naming, PR checklist), architecture (ASCII layer diagram, design decisions), adding scrapers (full scrape() interface, registration, search profile config, test patterns), adding integrations (IntegrationBase full interface, auto- discovery, tier gating, test patterns), testing (patterns, fixtures, what not to test). Reference: tier system (full FEATURES table, can_use/tier_label API, dev override, adding gates), LLM router (backend types, complete() signature, fallback chains, vision routing, __auto__ resolution, adding backends), config files (every file with field-level docs and gitignore status). Also adds CONTRIBUTING.md at repo root pointing to the docs site.	2026-02-25 12:05:49 -08:00
pyr0ball	ad7a56dca5	docs: backlog — Ultra tier managed applications concept	2026-02-25 11:40:55 -08:00
pyr0ball	2163f428dc	feat: Integrations tab in Settings — connect/test/disconnect all 12 integration drivers	2026-02-25 11:30:44 -08:00
pyr0ball	bbb3eda747	refactor: move HF token to Developer tab — hidden from standard user UI	2026-02-25 11:04:13 -08:00
pyr0ball	a0164814e9	feat: expanded first-run wizard — complete implementation 13-task implementation covering: - UserProfile wizard fields (wizard_complete, wizard_step, tier, dev_tier_override, dismissed_banners, effective_tier) + params column in background_tasks - Tier system: FEATURES gate, can_use(), tier_label() (app/wizard/tiers.py) - Six pure validate() step modules (hardware, tier, identity, resume, inference, search) - Resume parser: PDF (pdfplumber) + DOCX (python-docx) extraction + LLM structuring - Integration base class + auto-discovery registry (scripts/integrations/) - 13 integration drivers (Notion, Google Sheets, Airtable, Google Drive, Dropbox, OneDrive, MEGA, Nextcloud, Google Calendar, Apple Calendar, Slack, Discord, Home Assistant) + config/integrations/*.yaml.example files - wizard_generate task type with 8 LLM generation sections + iterative refinement (previous_result + feedback support) - step_integrations module: validate(), get_available(), is_connected() - Wizard orchestrator rewrite (0_Setup.py): 7 steps, crash recovery, LLM polling - app.py gate: checks wizard_complete flag in addition to file existence - Home page: 13 dismissible contextual setup banners (wizard_complete-gated) - Settings: Developer tab — tier override selectbox + wizard reset button 219 tests passing.	2026-02-25 10:54:24 -08:00
pyr0ball	ea4f6a9160	feat: Developer tab in Settings — tier override + wizard reset button	2026-02-25 10:50:14 -08:00
pyr0ball	928825b9b9	feat: dismissible setup banners on Home page (13 contextual prompts)	2026-02-25 09:53:34 -08:00
pyr0ball	2a09b40a1d	feat: app.py checks wizard_complete flag to gate main app	2026-02-25 09:43:53 -08:00
pyr0ball	daf8e4a382	feat: wizard orchestrator — 7 steps, LLM generation polling, crash recovery Replaces the old 5-step wizard with a 7-step orchestrator that uses the step modules built in Tasks 2-8. Steps 1-6 are mandatory (hardware, tier, identity, resume, inference, search); step 7 (integrations) is optional. Each Next click validates, writes wizard_step to user.yaml for crash recovery, and resumes at the correct step on page reload. LLM generation buttons submit wizard_generate tasks and poll via @st.fragment(run_every=3). Finish sets wizard_complete=True, removes wizard_step, and calls apply_service_urls. Adds tests/test_wizard_flow.py (7 tests) covering validate() chain, yaml persistence helpers, and wizard state inference.	2026-02-25 09:10:51 -08:00
pyr0ball	6b093522bf	feat: step_integrations module with validate() + tier-filtered available list	2026-02-25 08:35:16 -08:00
pyr0ball	eb0e7883b8	docs: backlog — cover letter iterative refinement feedback loop	2026-02-25 08:30:24 -08:00
pyr0ball	9fdb95e17f	feat: wizard_generate — feedback + previous_result support for iterative refinement	2026-02-25 08:29:56 -08:00
pyr0ball	6156aebd3a	feat: wizard_generate task type — 8 LLM generation sections	2026-02-25 08:25:17 -08:00
pyr0ball	445917cbd6	docs: backlog — Podman support + FastAPI migration path	2026-02-25 08:22:24 -08:00

1 2

78 commits