peregrine

Author	SHA1	Message	Date
pyr0ball	9c87ed1cf2	docs: add Jobgether integration design spec	2026-03-15 09:45:50 -07:00
pyr0ball	a1a1141616	Merge pull request 'feat: LLM queue optimizer — resource-aware batch scheduler (closes #2 )' (#13 ) from feature/llm-queue-optimizer into main Reviewed-on: #13	2026-03-15 05:11:29 -07:00
pyr0ball	27d4b0e732	feat: LLM queue optimizer complete — closes #2 Resource-aware batch scheduler for LLM tasks: - scripts/task_scheduler.py (new): TaskScheduler singleton with VRAM-aware batch scheduling, durability, thread-safe singleton, memory safety - scripts/task_runner.py: submit_task() routes LLM types through scheduler - scripts/db.py: reset_running_tasks() for durable restart behavior - app/app.py: _startup() preserves queued tasks on restart - config/llm.yaml.example: scheduler VRAM budget config documented - tests/test_task_scheduler.py (new): 24 tests covering all behaviors Pre-existing failure: test_generate_calls_llm_router (issue #12, unrelated)	2026-03-15 05:01:24 -07:00
pyr0ball	95378c106e	feat(app): use reset_running_tasks() on startup to preserve queued tasks	2026-03-15 04:57:49 -07:00
pyr0ball	07c627cdb0	feat(task_runner): route LLM tasks through scheduler in submit_task() Replaces the spawn-per-task model for LLM task types with scheduler routing: cover_letter, company_research, and wizard_generate are now enqueued via the TaskScheduler singleton for VRAM-aware batching. Non-LLM tasks (discovery, email_sync, etc.) continue to spawn daemon threads directly. Adds autouse clean_scheduler fixture to test_task_runner.py to prevent singleton cross-test contamination.	2026-03-15 04:52:42 -07:00
pyr0ball	bcd918fb67	feat(scheduler): add durability — re-queue surviving LLM tasks on startup	2026-03-15 04:24:11 -07:00
pyr0ball	207d3816b3	feat(scheduler): implement thread-safe singleton get_scheduler/reset_scheduler	2026-03-15 04:19:23 -07:00
pyr0ball	3984a9c743	feat(scheduler): implement scheduler loop and batch worker with VRAM-aware scheduling	2026-03-15 04:14:56 -07:00
pyr0ball	4d055f6bcd	feat(scheduler): implement enqueue() with depth guard and ghost-row cleanup	2026-03-15 04:05:22 -07:00
pyr0ball	28e66001a3	refactor(scheduler): use module-level _get_gpus directly in __init__	2026-03-15 04:01:01 -07:00
pyr0ball	535c0ae9e0	feat(scheduler): implement TaskScheduler.__init__ with budget loading and VRAM detection	2026-03-15 03:32:11 -07:00
pyr0ball	3d7f6f7ff1	feat(scheduler): add task_scheduler.py skeleton with constants and TaskSpec	2026-03-15 03:28:43 -07:00
pyr0ball	52470759a4	docs(config): add scheduler VRAM budget config to llm.yaml.example	2026-03-15 03:28:26 -07:00
pyr0ball	d51066e8c2	refactor(tests): remove unused imports from test_task_scheduler	2026-03-15 03:27:17 -07:00
pyr0ball	905db2f147	feat(db): add reset_running_tasks() for durable scheduler restart	2026-03-15 03:22:45 -07:00
pyr0ball	eef2478948	docs: add LLM queue optimizer implementation plan 11-task TDD plan across 3 reviewed chunks. Covers: - reset_running_tasks() db helper - TaskScheduler skeleton + __init__ + enqueue + loop + workers - Thread-safe singleton, durability, submit_task routing shim - app.py startup change + full suite verification	2026-03-14 17:11:49 -07:00
pyr0ball	beb1553821	docs: revise queue optimizer spec after review Addresses 16 review findings across two passes: - Clarify _active.pop/double-decrement non-issue - Fix app.py change target (inline SQL, not kill_stuck_tasks) - Scope durability to LLM types only - Add _budgets to state table with load logic - Fix singleton safety explanation (lock, not GIL) - Ghost row fix: mark dropped tasks failed in DB - Document static _available_vram as known limitation - Fix test_llm_tasks_batch_by_type description - Eliminate circular import via routing split in submit_task() - Add missing budget warning at construction	2026-03-14 16:46:38 -07:00
pyr0ball	61dc2122e4	docs: add LLM queue optimizer design spec Resource-aware batch scheduler for LLM tasks. Closes #2.	2026-03-14 16:38:47 -07:00
pyr0ball	0f80b698ff	chore: add .worktrees/ to .gitignore Prevents worktree directories from being tracked.	2026-03-14 16:30:38 -07:00
pyr0ball	097def4bba	fix(linkedin): update selectors for 2025 public DOM; surface login-wall limitation in UI LinkedIn's unauthenticated public profile only exposes name, summary (truncated), current employer name, and certifications. Past roles, education, and skills are blurred server-side behind a login wall — not a scraper limitation. - Update selectors: data-section='summary' (was 'about'), .profile-section-card for certs, .visible-list for current experience entry - Strip login-wall noise injected into summary text after 'see more' - Skip aria-hidden blurred placeholder experience items - Add info callout in UI directing users to data export zip for full history	2026-03-13 19:47:21 -07:00
pyr0ball	1a50bc1392	chore: update changelog for v0.4.0 release	2026-03-13 11:28:03 -07:00
pyr0ball	d1fb4abd56	docs: update backlog with LinkedIn import follow-up items	2026-03-13 11:24:55 -07:00
pyr0ball	6c7499752c	fix(cloud): use per-user config dir for wizard gate; redirect on invalid session - app.py: wizard gate now reads get_config_dir()/user.yaml instead of hardcoded repo-level config/ — fixes perpetual onboarding loop in cloud mode where per-user wizard_complete was never seen - app.py: page title corrected to "Peregrine" - cloud_session.py: add get_config_dir() returning per-user config path in cloud mode, repo config/ locally - cloud_session.py: replace st.error() with JS redirect on missing/invalid session token so users land on login page instead of error screen - Home.py, 4_Apply.py, migrate.py: remove remaining AIHawk UI references	2026-03-13 11:24:42 -07:00
pyr0ball	42f0e6261c	fix(linkedin): conservative settings merge, mkdir guard, split dockerfile playwright layer	2026-03-13 10:58:58 -07:00
pyr0ball	1e12da45f1	fix(linkedin): move session state pop before tabs; add rerun after settings merge - Pop _linkedin_extracted before st.tabs() so tab_builder sees the freshly populated _parsed_resume in the same render pass (no extra rerun needed) - Fix tab label capitalisation: "Build Manually" (capital M) per spec - Add st.rerun() after LinkedIn merge in Settings so form fields refresh immediately to show the newly applied data	2026-03-13 10:55:25 -07:00
pyr0ball	b80e4de050	feat(linkedin): install Playwright Chromium in Docker image	2026-03-13 10:44:03 -07:00
pyr0ball	7489c1c12a	feat(linkedin): add LinkedIn import expander to Settings Resume Profile tab	2026-03-13 10:44:02 -07:00
pyr0ball	97ab8b94e5	feat(linkedin): add LinkedIn tab to wizard resume step	2026-03-13 10:43:53 -07:00
pyr0ball	bd0e9240eb	feat(linkedin): add shared LinkedIn import Streamlit widget	2026-03-13 10:32:23 -07:00
pyr0ball	5344dc8e7a	feat(linkedin): add staging file parser with re-parse support	2026-03-13 10:18:01 -07:00
pyr0ball	fba6796b8a	fix(linkedin): improve scraper error handling, current-job date range, add missing tests	2026-03-13 06:02:03 -07:00
pyr0ball	f759f5fbc0	feat(linkedin): add scraper (Playwright + export zip) with URL validation	2026-03-13 01:06:39 -07:00
pyr0ball	530f4346d1	feat(linkedin): add HTML parser utils with fixture tests	2026-03-13 01:01:05 -07:00
pyr0ball	db26b9aaf9	feat(cloud): add Heimdall tier resolution to cloud_session Calls /admin/cloud/resolve after JWT validation to inject the user's current subscription tier (free/paid/premium/ultra) into session_state as cloud_tier. Cached 5 minutes via st.cache_data to avoid Heimdall spam on every Streamlit rerun. Degrades gracefully to free on timeout or missing token. New env vars: HEIMDALL_URL, HEIMDALL_ADMIN_TOKEN (added to .env.example and compose.cloud.yml). HEIMDALL_URL defaults to http://cf-license:8000 for internal Docker network access. New helper: get_cloud_tier() — returns tier string in cloud mode, "local" in local-first mode, so pages can distinguish self-hosted from cloud.	2026-03-10 12:31:14 -07:00
pyr0ball	97b695c3e3	fix(cloud): extract cf_session cookie by name from X-CF-Session header	2026-03-10 09:22:08 -07:00
pyr0ball	72320315e2	docs: add cloud architecture + cloud-deployment.md architecture.md: updated Docker Compose table (3 compose files), database layer (Postgres platform + SQLite-per-user), cloud session middleware, telemetry system, and cloud design decisions. cloud-deployment.md (new): full operational runbook — env vars, data root layout, GDPR deletion, platform DB queries, telemetry, backup/restore, Caddy routing, demo instance, and onboarding a new app to the cloud.	2026-03-09 23:02:29 -07:00
pyr0ball	37dcdec754	feat(cloud): fix backup/restore for cloud mode — SQLCipher encrypt/decrypt T13: Three fixes: 1. backup.py: _decrypt_db_to_bytes() decrypts SQLCipher DB before archiving so the zip is portable to any local Docker install (plain SQLite). 2. backup.py: _encrypt_db_from_bytes() re-encrypts on restore in cloud mode so the app can open the restored DB normally. 3. 2_Settings.py: _base_dir uses get_db_path().parent in cloud mode (user's per-tenant data dir) instead of the hardcoded app root; db_key wired through both create_backup() and restore_backup() calls. 6 new cloud backup tests + 2 unit tests for SQLCipher helpers (pysqlcipher3 mocked — not available in the local conda test env). 419/419 total passing.	2026-03-09 22:41:44 -07:00
pyr0ball	ce19e00cfe	feat(cloud): Privacy & Telemetry tab in Settings + update_consent() T11: Add CLOUD_MODE-gated Privacy tab to Settings with full telemetry consent UI — hard kill switch, anonymous usage toggle, de-identified content sharing toggle, and time-limited support access grant. All changes persist to telemetry_consent table via new update_consent() in telemetry.py. Tab and all DB calls are completely no-op in local mode (CLOUD_MODE=false).	2026-03-09 22:14:22 -07:00
pyr0ball	8f9955fa96	feat(cloud): add compose.cloud.yml and telemetry consent middleware T8: compose.cloud.yml — multi-tenant cloud stack on port 8505, CLOUD_MODE=true, per-user encrypted data at /devl/menagerie-data, joins caddy-proxy_caddy-internal network; .env.example extended with five cloud-only env vars. T10: app/telemetry.py — log_usage_event() is the ONLY entry point to usage_events table; hard kill switch (all_disabled) checked before any DB write; complete no-op in local mode; swallows all exceptions so telemetry never crashes the app; psycopg2-binary added to requirements.txt. Event calls wired into 4_Apply.py at cover_letter_generated and job_applied. 5 tests, 413/413 total passing.	2026-03-09 22:10:18 -07:00
pyr0ball	5a1fceda84	feat(peregrine): wire cloud_session into pages for multi-tenant db path routing resolve_session() is a no-op in local mode — no behavior change for existing users. In cloud mode, injects user-scoped db_path into st.session_state at page load.	2026-03-09 20:22:17 -07:00
pyr0ball	634e31968f	feat(peregrine): add cloud_session middleware + SQLCipher get_connection() cloud_session.py: no-op in local mode; in cloud mode resolves Directus JWT from X-CF-Session header to per-user db_path in st.session_state. get_connection() in scripts/db.py: transparent SQLCipher/sqlite3 switch — uses encrypted driver when CLOUD_MODE=true and key provided, vanilla sqlite3 otherwise. libsqlcipher-dev added to Dockerfile for Docker builds. 6 new cloud_session tests + 1 new get_connection test — 34/34 db tests pass.	2026-03-09 19:43:42 -07:00
pyr0ball	2fdf6f725e	fix(peregrine): correct port comment in compose.demo.yml, update CLAUDE.md	2026-03-09 15:22:10 -07:00
pyr0ball	fbd47368ff	chore(peregrine): rename compose.menagerie.yml to compose.demo.yml Public demo instances moving to demo.circuitforge.tech; menagerie.circuitforge.tech reserved for cloud-hosted managed instances.	2026-03-09 14:55:38 -07:00
pyr0ball	2124b24e3d	docs: update features table to reflect BYOK tier policy AI features (cover letter gen, research, interview prep, survey assistant) are now correctly shown as unlockable at the free tier with any local LLM or user-supplied API key. Paid tier value prop is managed cloud inference + integrations + email sync, not AI feature gating. Also fixes circuitforge.io → circuitforge.tech throughout.	2026-03-07 22:17:18 -08:00
pyr0ball	88f28c2b41	chore: move internal plans to circuitforge-plans repo All docs/plans/ files migrated to pyr0ball/circuitforge-plans. Keeping docs/ for future user-facing documentation.	2026-03-07 15:38:47 -08:00
pyr0ball	28cc03ba70	chore: expand peregrine .gitleaks.toml allowlists for history scan Suppress false positives found during pre-push history scan: - Path allowlists: docs/plans/, tests/, Streamlit app files, SearXNG default config, apple_calendar.py placeholder - Regex allowlists: Unix epoch timestamps, localhost ports, 555-area-code variants, CFG-* example license key patterns - All 164 history commits now scan clean	2026-03-07 13:24:18 -08:00
pyr0ball	7de630e065	chore: activate circuitforge-hooks, add peregrine .gitleaks.toml - Wire core.hooksPath → circuitforge-hooks/hooks via install.sh - Add .gitleaks.toml extending shared base config with Peregrine-specific allowlists (Craigslist/LinkedIn IDs, localhost port patterns) - Remove .githooks/pre-commit (superseded by gitleaks hook) - Update setup.sh activate_git_hooks() to call circuitforge-hooks/install.sh with .githooks/ as fallback if hooks repo not present	2026-03-07 13:20:52 -08:00
pyr0ball	1cf6e370b1	docs: circuitforge-hooks implementation plan (8 tasks, TDD)	2026-03-07 12:27:47 -08:00
pyr0ball	9d2ed1d00d	docs: circuitforge-hooks design — gitleaks-based secret + PII scanning Centralised pre-commit/pre-push hook repo design covering the token leak root causes: unactivated hooksPath and insufficient regex coverage.	2026-03-07 12:23:54 -08:00
pyr0ball	1b500b9f26	docs: update changelog for v0.3.0 release - Add v0.3.0 section: feedback button, BYOK warning, LLM suggest, backup/restore, privacy scrub - Retroactively document v0.2.0 (was in [Unreleased]) - Clear [Unreleased] for future work	2026-03-06 16:04:28 -08:00

1 2 3 4 5

212 commits