turnstone

Author	SHA1	Message	Date
pyr0ball	6039ab2464	feat: incident ticket export — Notion and Jira integration (#12 ) - app/services/ticket_export.py: plugin-dispatch architecture; Notion exporter (Notion API v1, blocks-based, 50 entry cap, 2000-char truncation per block); Jira exporter (REST API v3, Basic Auth, ADF description, configurable issue type defaulting to Bug) - app/rest.py: POST /api/incidents/{id}/export endpoint; Notion/Jira credential fields added to SettingsBody and PATCH /api/settings handler - web/src/views/IncidentsView.vue: "Export ticket ▾" dropdown in incident detail drawer — click-outside close, inline URL link on success - web/src/views/SettingsView.vue: Ticket Trackers section with Notion token + database ID, Jira URL/email/token/project/issue-type; show/hide for secret fields - tests/test_ticket_export.py: 17 tests covering dispatch, Notion success/error/config/payload/truncation paths, Jira success/error/ auth/project/summary/default-issue-type	2026-06-14 15:46:11 -07:00
pyr0ball	b8f766fb74	feat: SSH target manager — GUI editor for remote host configuration (#24 ) - app/services/ssh_targets.py: full CRUD service with lazy paramiko import, key-path validation, permission warning, and test_connection - app/db/schema.py: ssh_targets table (id, label, host, port, user, key_path, last_tested, last_ok, last_error, timestamps) - app/rest.py: GET/POST /api/ssh-targets, PATCH/DELETE /{id}, POST /{id}/test — key contents never returned in any response - web/src/views/SettingsView.vue: Remote Hosts section with add/edit form, inline connection status badges, test-connection flow, delete with confirmation; new Set() pattern for reactive sshTesting state - tests/test_ssh_targets.py: 22 tests — schema, CRUD, validation, key-warning, serialization, paramiko-absent path	2026-06-14 15:27:12 -07:00
pyr0ball	7a2ab0bb46	feat(orchard): auto-enrollment API for branch node provisioning (#27 ) Implements the Orchard branch grafting system for harvest.circuitforge.tech: - POST /api/orchard/graft: provisions data dir, starts a new turnstone-submissions-<slug> Docker container on the next free port (ORCHARD_PORT_BASE=8538+), injects a handle_path block into the Caddyfile dynamic-branches marker section, restarts caddy-proxy, returns {submit_endpoint, api_key} - GET /api/orchard/branches: list active/inactive branches (admin-only) - DELETE /api/orchard/branches/<slug>: deactivate branch + stop container - POST /api/orchard/branches/<slug>/anonymize: HMAC-based IP/username pseudonymization worker over a branch DB - POST /api/glean/batch: optional TURNSTONE_BRANCH_KEY auth guard - anonymized column added to log_entries schema (migration-safe) - Updated Caddyfile with /huginn/* route (port 8536), /node2/* (8537), and dynamic-branch marker section - All endpoints admin-gated via TURNSTONE_ORCHARD_ADMIN_KEY Closes: #27	2026-06-14 14:30:18 -07:00
pyr0ball	600e5a9eac	feat(sources): context-aware filesystem log scanner (#23 ) Add scan_log_directories() to discover.py that recursively walks /var/log and /opt, filters to readable log files, and scores each candidate by recency (mtime, 0.7 weight), file size (0.3), and keyword match against an optional problem-context query (shifts weights to 0.4/0.2/0.4 when a query is provided). - GET /api/setup/scan?query=...&max_results=N — new API endpoint - SourcesView: "Scan" button opens a panel with ranked candidates, checkboxes, and "Add selected" to write to sources.yaml - 13 new unit tests, 466 passing total Closes: #23	2026-06-14 14:01:45 -07:00
pyr0ball	f3d807d991	feat(diagnose): conversational chat mode + NL source discovery - New ChatDiagnose.vue: multi-turn chat UI in the Diagnose tab - Textarea input (auto-grows) for long free-form problem descriptions - Source suggestion pre-flight: debounced POST /api/sources/suggest identifies relevant log sources from the query text and shows them as interactive chips (deselect to exclude before searching) - Conversation history preserved across turns with LLM reasoning, collapsible log entries, and "Save as incident" per turn - Reuses existing /api/diagnose/stream — no new pipeline - DiagnoseView.vue: Chat is now default tab; viewport-height layout - POST /api/sources/suggest: token-overlap source ranking, no LLM - Fix: add missing 'import re' causing 500 on suggest route	2026-06-11 22:04:53 -07:00
pyr0ball	b6b69e2150	feat(incidents): auto-incident detection + example-node Podman setup Auto-incident detector: - New app/tasks/incident_detector.py: post-glean error cluster detector - Sliding window algorithm: source + N errors within window_s seconds - Deduplication via issue_type='auto:{source_id}' + interval overlap check - Respects TURNSTONE_AUTO_INCIDENT_THRESHOLD (default 5) and TURNSTONE_AUTO_INCIDENT_WINDOW (default 600s) env vars - 20 tests all passing - Wired into glean_scheduler.run_once() and scheduler_loop() - TURNSTONE_AUTO_INCIDENT env var to disable (default enabled) Podman standalone improvements: - REPO_DIR auto-detected from script location (no longer hardcoded to /opt/turnstone) - DATA_DIR/PATTERNS_DIR/HF_CACHE_DIR configurable via env vars - Bootstrap step copies host-specific sources-<hostname>.yaml on first run - Auto-incident env vars passed through example-node sources: - patterns/sources-example-node.yaml: Sonarr, Radarr, Bazarr, Prowlarr, Tautulli, autoscan, organizr, nextcloud, journal export	2026-06-11 18:37:53 -07:00
pyr0ball	4dcc1a441a	feat(incidents): incident timeline visualizer + fix entry lookup using wrong DB path Adds IncidentTimeline.vue — a pure SVG time-axis component rendered inside the incident detail drawer when entries are present: - Horizontal strip scaled to incident window (preserveAspectRatio=none) - Event ticks colored by severity, height proportional to severity level - 50-bin density shading shows burst periods as blue bands - Gap markers (dashed lines) for silence > 10% of window or > 60s - Hover tooltip showing nearest entry's severity, time, and truncated text - Click-to-scroll: clicking a tick highlights and scrolls to its entry in the list below - Legend showing only severity levels present in the incident Also fixes a pre-existing bug: get_incident_endpoint and both build_bundle callers were passing INCIDENTS_DB_PATH to get_incident_entries/build_bundle, causing all incident entry lookups to silently search the empty incidents DB instead of the main log DB. This made all incident detail views show "No log entries found". Closes: #57	2026-06-10 16:02:24 -07:00
pyr0ball	cffe6bcd31	feat: cybersec zero-shot scoring pipeline (#9 ) Second-pass cybersec classifier using DeBERTa-v3-base-mnli (already cached — no download required). Runs after each anomaly scoring pass on entries flagged by the anomaly scorer or with pattern matches. Architecture: - app/services/cybersec.py: zero-shot-classification pipeline with 5 cybersec candidate labels (auth failure, privilege escalation, network intrusion, malware, data exfiltration). Writes ml_score/ml_label/ ml_scored_at to log_entries; inserts high-confidence hits into detections with scorer='cybersec'. - app/tasks/cybersec_scorer.py: async background task (same shape as anomaly_scorer.py). - REST: GET/POST /turnstone/api/cybersec/status\|run\|detections. GET /turnstone/api/anomaly/detections now accepts scorer= filter. Schema: ml_score, ml_label, ml_scored_at added to log_entries; scorer column added to detections (idempotent migrations + DDL for both SQLite and Postgres). UI: Security Alerts view gains Source dropdown (All / Anomaly / Cybersec) and cybersec scorer status badge. Label dropdown split into optgroups. Deployment: TURNSTONE_CYBERSEC_MODEL/DEVICE/THRESHOLD vars added to .env.example, docker-compose.yml, docker-standalone.sh. Tests: 10 new tests — no model, no eligible entries, scoring, detection creation, normal label suppression, threshold filtering, pattern-tag filtering, idempotency, list filtering, scorer column filter. 416/416 passing. Closes: #9	2026-06-10 01:03:25 -07:00
pyr0ball	0693e1fd54	feat: anomaly scoring pipeline (#10 ) - Add app/services/anomaly.py: batch scorer using HF text-classification pipeline; rewrites anomaly_score/anomaly_label/anomaly_scored_at on log_entries; inserts high-confidence hits into detections table - Add app/tasks/anomaly_scorer.py: background task (same shape as glean_scheduler); triggered after each glean cycle when TURNSTONE_ANOMALY_MODEL is set - DB schema: add anomaly_score/anomaly_label/anomaly_scored_at columns to log_entries (idempotent ALTER TABLE migration); add detections table - Wire scorer into scheduler_loop and glean_scheduler.run_once; no-op when model env var is empty (safe to leave unconfigured) - REST endpoints: GET/POST /api/anomaly/status, /api/anomaly/run, GET /api/anomaly/detections, POST /api/anomaly/detections/{id}/acknowledge - Reuses Hybrid-BERT label map from diagnose/classifier.py; works with any HF text-classification model - 12 new tests; 406/406 passing Closes: #10	2026-06-09 11:15:13 -07:00
pyr0ball	0311d72e53	feat: dual-backend SQLite/Postgres + multi-tenant source namespacing - Add app/db/ abstraction layer: Backend enum, DbConn wrapper, dialect helper (q() for ? vs %s paramstyle), get_conn(), tenant_id() - Auto-detect backend from DATABASE_URL; SQLite remains default when unset — no config change for local deployments - Add tenant_id column to all three logical DBs (main, context, incidents); idempotent ALTER TABLE migration runs before schema scripts on existing DBs - All INSERTs inject tenant_id; SELECTs use (tenant_id = ? OR tenant_id = '') for backward compat with pre-namespacing rows - Add docker-compose.yml with named volume turnstone_pgdata (survives rebuilds) and optional external Postgres support via DATABASE_URL override - Add scripts/migrate_sqlite_to_postgres.py — one-shot idempotent migration for existing SQLite data; ON CONFLICT DO NOTHING for safe re-runs - Fix SSH glean path in pipeline.py to use ensure_schema + get_conn (was still using raw sqlite3.connect + old _SCHEMA without tenant_id) - Fix FTS5 JOIN ambiguity: qualify repeat_count as f.repeat_count in search - Update all tests to use ensure_*_schema fixtures; add row_factory where needed - 394/394 tests passing Closes: #42 Closes: #50	2026-06-08 08:37:54 -07:00
pyr0ball	1de156ebde	fix: reset browser UA button chrome for dark mode HTML buttons get a ~#efefef background and 2px outset border from the browser UA stylesheet. In light mode these blend in; in dark mode they render as stark white boxes. Adding a global button reset in theme.css clears the UA defaults — explicit bg-* utility classes still win. Affects: theme toggle, hamburger nav button, dashboard diagnose buttons, and all other icon/text buttons that had no explicit bg class. Bumps version to 0.6.2.	2026-06-05 09:55:08 -07:00
pyr0ball	876cfb9a63	fix: group journal sources by prefix:host stem in source health source_ids with 3+ colon segments (e.g. muninn-journal:Muninn:ssh.service) are now aggregated by their prefix:host key at the SQL level in both list_sources() and stats_summary(). This collapses ~19K transient systemd unit rows (crash-loop scope entries from Muninn) into ~24 grouped rows. - list_sources: SQL CASE/INSTR group-by stem + unit_count field - stats_summary: same stem grouping for dashboard source health table - delete endpoint: LIKE-based cascade delete covers grouped stems - SourcesView: unit_count badge (e.g. "2686 units") on grouped rows; delete confirmation names the unit count when deleting a group - Bump version to v0.6.1	2026-06-02 04:35:26 -07:00
pyr0ball	9cd7450591	chore: bump version to 0.6.0 Release summary: - #60 split incidents tables to turnstone-incidents.db (eliminates FTS5 write lock starvation) - #41 Hybrid-BERT label mapping shim (7-class vocabulary support in classifier) - #15 hybrid BM25 + vector re-ranking for diagnose search (semantic=True, alpha=0.6/beta=0.4) - #32 domain-view mapping: 42 patterns annotated across 10 domains, by_domain in diagnose summary	2026-06-01 20:52:35 -07:00
pyr0ball	ce2a2b55a6	Merge feat/32-domain-view: domain-view mapping for patterns and diagnose output (#32 )	2026-06-01 20:01:19 -07:00
pyr0ball	eac9a4ba28	Merge feat/15-hybrid-rag: hybrid BM25 + vector re-ranking for diagnose search (#15 )	2026-06-01 20:00:02 -07:00
pyr0ball	b1f3d68724	feat: domain-view mapping for patterns and diagnose output (#32 ) Adds a domain: field to the pattern taxonomy and surfaces per-domain hit counts in diagnose summaries for faster triage. Changes: - LogPattern gains domain: str = "" (backward-compatible default) - load_patterns() reads domain from YAML via p.get("domain", "") - All 42 patterns in default.yaml annotated across 10 domains: service_health \| networking \| auth \| storage \| memory \| kernel \| power \| web_proxy \| media \| gpu - _pattern_domain dict built at startup from compiled patterns - _domain_counts() helper: maps matched_patterns tags to domains, counts hits per domain across a result set - diagnose POST: summary includes by_domain: {domain: count} - diagnose stream: summary SSE event includes by_domain when pattern_domain is provided (passed from rest.py at startup) - /api/search gains ?domain= filter: post-filters results to entries whose matched_patterns include at least one tag in the given domain Test fixtures: patch _pattern_domain={} and CONTEXT_DB_PATH in test_blocklist_endpoints.py and test_glean_tautulli.py (worktree has no data/ dir; same fix as feat/60-incidents-db). 372 tests passing. Closes: #32	2026-06-01 19:57:16 -07:00
pyr0ball	1abdcfb1f3	feat: hybrid BM25 + vector re-ranking for diagnose search (#15 ) Adds late-fusion hybrid search to Turnstone's log retrieval layer: hybrid_score = 0.6 * bm25_normalized + 0.4 * cosine_similarity Implementation: - _bm25_search() extracts the existing FTS5 BM25 path as a named helper - _hybrid_search() fetches an oversized BM25 candidate pool (5x limit, min 100), embeds the query and each candidate text in-process via the existing embeddings service, normalizes BM25 rank to [0,1], combines with cosine similarity, and re-ranks - search() gets semantic=False param that dispatches to _hybrid_search() when True; pure BM25 remains the default for all existing call sites - diagnose_stream() enables semantic=True so symptom-based queries ("database connection failed") surface semantically equivalent entries ("ECONNREFUSED", "backend gone away", "max retries exceeded") - /api/search REST endpoint exposes ?semantic=true query param Graceful degradation: falls back silently to pure BM25 when the embedding backend is unavailable (EMBEDDING_AVAILABLE=False) or when embed_batch raises an exception. No new infra — in-process numpy cosine, no vector DB. 11 new tests: BM25 helper, hybrid re-ranking, fallback paths, dispatcher. 372 + 11 = 383 tests passing. Closes: #15	2026-06-01 18:13:09 -07:00
pyr0ball	bd3923e163	fix: split incidents tables to dedicated turnstone-incidents.db (#60 ) FTS5 bulk-insert write locks starved the incident API and bundle endpoints during log bursts (sonarr/radarr, high-volume docker sources). Fix mirrors the context_facts split (context -> turnstone-context.db): - Add INCIDENTS_DB_PATH / TURNSTONE_INCIDENTS_DB env var in rest.py - Add _INCIDENTS_SCHEMA, ensure_incidents_schema(), and migrate_incidents_to_dedicated_db() in glean/pipeline.py - Stub out incidents/received_bundles/sent_bundles in _SCHEMA (no-op CREATE IF NOT EXISTS) so legacy single-file deployments still open - Thread incidents_db_path through diagnose_stream -> run_pipeline -> FalsePositiveSuppressor.suppress -> _fetch_resolved_incidents - One-shot migration on startup: copy existing rows from main DB to incidents DB via INSERT OR IGNORE (idempotent, safe to re-run) - Fix test_blocklist_endpoints fixtures to patch CONTEXT_DB_PATH and INCIDENTS_DB_PATH alongside DB_PATH (worktree has no data/ dir) 372 tests passing. Closes: #60	2026-06-01 15:54:23 -07:00
pyr0ball	1131816666	feat: bundle PII sanitization, onboarding wizard, NL source addition (#51 , #52 , #53 ) Bundle export (#51): - _redact_text() with 5 compiled regex patterns (IPv4, email, user=, host=, password=) - build_bundle(sanitize=False) — per-entry redaction at export time - sent_bundles table tracks every outgoing export (GET and POST /send) - GET /api/sent-bundles exposes history; SentBundle model added - BundlesView: Received/Sent tabs, sanitized badge, 5-entry preview, re-download - IncidentsView: Sanitize PII checkbox next to Send Bundle Onboarding wizard (#52): - app/services/discover.py: journald/Docker/file detection (best-effort, safe in containers) - GET /api/setup/status, /discover, POST /api/setup/write (additive, appends to existing) - SetupWizard.vue: 3-step Detect → Select → Confirm - Step 1 shows grouped summary (journald/file/docker counts) - Step 2: collapsible groups with All/None section toggles - journald + file: pre-selected; docker: collapsed, none pre-selected - Step 3: YAML preview before write - SourcesView: shows wizard on first run; Add Source button reuses it NL source addition (#53): - app/services/nl_source.py: keyword shortcut (13 well-known apps) + LLM fallback - POST /api/setup/interpret: keyword → LLM → null (graceful fallback) - NL field in wizard step 2; manual form shown when interpretation fails - Added sources appear in grouped list immediately	2026-05-29 14:14:28 -07:00
pyr0ball	054ebfa0e3	feat(diagnose): tech-level post-processor, offline mode, API auth, context harvest - synthesizer: 3 system prompts (sysadmin/homelab/executive) selected by tech_level pref - settings: tech_level selector (UI + backend) persisted in preferences.json - QuickCapture: shows active level label in diagnosis card header - TURNSTONE_OFFLINE_MODE=1: sets HF_HUB_OFFLINE + TRANSFORMERS_OFFLINE before lib load - TURNSTONE_API_KEY: bearer token auth on all /api/ routes (hmac.compare_digest) - /health always open; unset key = no auth (backward compatible) - docs/air-gapped-deployment.md: full offline deployment guide - scripts/harvest_docs.py: generalized context doc bulk-uploader with manifest support - scripts/manifests/: heimdall-devops.yaml (10 docs ingested) + example.yaml template - fix: _ingest_upload -> _glean_upload in context doc upload endpoint (was 500) Closes: #56 Closes: #45 Closes: #47 Closes: #49 Closes: #21	2026-05-28 08:51:05 -07:00
pyr0ball	7f49961ec4	fix(db): add timeout=30s to all sqlite3.connect() calls across app Watcher, REST endpoints, services (search, incidents, blocklist), MCP server, context retriever, embedder, glean_scheduler, and doc_upload all used the default 5-second SQLite busy timeout. During collect glean write phases, watcher flush threads were hitting 'database is locked' errors when the glean held the write lock longer than 5 seconds. All connections now use timeout=30.0, matching the pipeline fix from commit `5a9281a`. No logic changes.	2026-05-26 23:12:48 -07:00
pyr0ball	3cfd587d16	fix: separate context KB into own SQLite file to eliminate write-lock contention context_facts, context_documents, and context_chunks now live in turnstone-context.db (sibling of turnstone.db). The glean scheduler held write locks on the main DB long enough to cause 5-second timeout failures on context fact inserts; separate files have independent WAL write locks so they never contend. Changes: - pipeline.py: extract _CONTEXT_SCHEMA + ensure_context_schema() - rest.py: CONTEXT_DB_PATH (TURNSTONE_CONTEXT_DB env var, defaults to sibling file); init via ensure_context_schema(); all context routes pass CONTEXT_DB_PATH; diagnose_stream receives context_db_path kwarg - diagnose/__init__.py: diagnose_stream() accepts context_db_path (falls back to db_path for backward compat); retrieve_context uses it - store.py: sqlite3.connect() timeout=30.0 — Python driver retry loop is independent of PRAGMA busy_timeout; needed for any remaining contention during test or single-file deployments Closes: #42	2026-05-25 21:19:32 -07:00
pyr0ball	6fec294a53	feat: fingerprint-based incremental glean — skip unchanged files (#30 ) - Add glean_fingerprints table to schema (sha256 + mtime + size) - _fingerprint(), _fp_unchanged(), _save_fingerprint() helpers in pipeline.py - _glean_files() now checks fingerprint; skips file if hash unchanged - force=True param threads through glean_dir → glean_file → glean_sources - POST /api/tasks/glean and POST /api/sources/{id}/glean accept force=true - 14 unit tests in tests/test_glean_fingerprint.py, all passing Closes: #30	2026-05-25 11:01:18 -07:00
pyr0ball	41fc89c474	feat: SSH remote glean — transport layer, pipeline integration, REST + UI (#22 ) Closes turnstone#22. ## Transport layer (app/glean/ssh.py) - SSHTransport context manager: key-only auth, paramiko backend - SSHConnectionError / SSHCommandError exception hierarchy - exec_stream() generator: yields stdout lines, raises SSHCommandError on non-zero exit (isinstance(int) guard for test-mock safety) - Command builders: _build_journald_command, _build_syslog_command, _build_plaintext_command, _build_docker_command - 18 unit tests in tests/test_glean_ssh.py ## Pipeline integration (app/glean/pipeline.py) - _stream_and_write(): per-item error isolation — SSHCommandError skips one glean item without aborting the rest of the host connection - _glean_ssh_source(): one SSHTransport per host, dispatches all glean items (journald/syslog/plaintext/docker); SSHConnectionError aborts host - glean_sources(): splits local vs SSH sources; local → _glean_files(); SSH → _glean_ssh_source(); shared compiled patterns and DB connection - glean_ssh_source(): public wrapper for REST use — manages DB connection, pattern compilation, FTS rebuild lifecycle - 15 integration tests in tests/test_glean_pipeline_ssh.py - All 285 tests passing ## REST layer (app/rest.py) - GET /api/sources/configured: reads sources.yaml and enriches with DB stats; SSH sources appear before first glean (entry_count=0); sub-source IDs (rack01/journald, rack01/docker/myapp) aggregated per host entry - POST /api/sources/{id}/glean: detects transport:ssh and dispatches to glean_ssh_source() wrapper; local sources unchanged - Import: glean_ssh_source as _glean_ssh_source ## Frontend (web/src/views/SourcesView.vue) - Fetches /api/sources/configured (primary) + /api/sources (DB-only) in parallel; merges into unified SourceRow list - SSH sources show: ssh badge (with user@host tooltip), glean-type pills (journald/syslog/docker/etc.), host subtitle - SSH sub-source IDs (rack01/journald) suppressed from the DB-only list since they are covered by the parent SSH row - DB-only sources (uploads) appear below configured sources with 'uploaded' badge; reglean button disabled (not in sources.yaml) - Delete zeroes out configured-source stats in-place rather than removing the row (so the source remains visible for re-gleaning)	2026-05-21 12:37:30 -07:00
pyr0ball	828b69768a	refactor: rename ingest → glean throughout codebase Renames the app/ingest/ package to app/glean/ and updates all references across Python modules, shell scripts, Vue components, tests, and documentation. Intentionally preserved: - SQLite column name ingest_time (avoids schema migration) - RetrievedEntry.ingest_time field (maps to the column above) - Any public-facing JSON keys that reference ingest_time Changes by category: - app/ingest/ → app/glean/ (full package move, all parsers) - app/tasks/ingest_scheduler.py → app/tasks/glean_scheduler.py - scripts/ingest_corpus.py → scripts/glean_corpus.py - tests/test_ingest_.py → tests/test_glean_.py - Docstrings, log messages, comments: ingest → glean - Env var: TURNSTONE_INGEST_INTERVAL → TURNSTONE_GLEAN_INTERVAL - Shell scripts: glean.log, glean_corpus.py references - README.md: multi-source ingest → multi-source glean - .env.example: updated env var name - patterns/: new diagnostic patterns from 2026-05-20 SSH incident (service_crash_loop, pkg_daemon_restart, ssh_forward_conflict) - SourcesView.vue: pipeline label updated - All test import paths updated to app.glean.* 285 tests passing.	2026-05-20 23:02:55 -07:00
pyr0ball	63c742a708	feat: periodic ingest scheduler + Orchard submission pipeline Adds asyncio-native background scheduler (TURNSTONE_INGEST_INTERVAL, default 900s) that runs batch ingest then pushes pattern-matched entries to a remote CF harvest endpoint (TURNSTONE_SUBMIT_ENDPOINT). - app/tasks/ingest_scheduler.py: IngestState, scheduler_loop, run_once, submit_matched, _query_matched_since — asyncio.Lock prevents concurrent runs - app/rest.py: POST /api/ingest/batch (pre-parsed entry receiver), GET /api/tasks/ingest/status, POST /api/tasks/ingest (manual trigger), TURNSTONE_INGEST_INTERVAL + TURNSTONE_SUBMIT_ENDPOINT env wiring in lifespan - docker-compose.submissions.yml: segregated contrib1 (8536) + contrib2 (8537) receiving instances on Heimdall, isolated DBs under /devl/docker/turnstone-submissions/<node>/ - podman-standalone.sh: pass-through for TURNSTONE_SUBMIT_ENDPOINT + TURNSTONE_SOURCE_HOST - app/ingest/mqtt_subscriber.py: MQTT log source adapter - app/ingest/wazuh.py: Wazuh alert JSON adapter - tests/test_ingest_wazuh.py: Wazuh adapter test suite	2026-05-20 08:57:25 -07:00
pyr0ball	ed0a4bb469	feat: Alpha milestone — corpus management, upload ingest, harvester agent Closes #1 (incident tagging — already implemented), #2, #3, #5. - feat(api): DELETE /api/sources/{id} — purge entries + FTS rows for a source - feat(api): POST /api/sources/{id}/ingest — re-ingest from sources.yaml - feat(api): POST /api/ingest/upload — multipart log file upload with auto-detect - feat(ui): SourcesView reingest + delete buttons and upload file input (#2) - feat(harvester): harvester.py push + incident subcommands (#5) - feat(harvester): Dockerfile, docker-compose.yml, harvester.sh (containerless) - feat(config): GPU_SERVER_URL → CF_ORCH_URL resolution + write-back (#20) - docs: .env.example, README Configuration table, version bump to 0.5.0	2026-05-19 07:45:58 -07:00
pyr0ball	5263a67fb3	fix(blocklist): get_candidate for O(1) push/unblock, 400 on malformed device_names JSON	2026-05-15 21:19:02 -07:00
pyr0ball	1e186591d7	feat(blocklist): 6 REST endpoints + Pi-hole settings fields Add blocklist candidate listing, scan trigger, status update, push/unblock to Pi-hole, and connection test endpoints. Add pihole_url/version/api_key and router_source_ids/device_names fields to SettingsBody and prefs handling in patch_settings. Add PiholeClient.__post_init__ validation so 503 fires naturally when url/api_key are unconfigured (mock-safe: bypassed in tests).	2026-05-15 21:15:09 -07:00
pyr0ball	842d83b68e	chore: remove stale load_patterns import from rest.py	2026-05-13 21:52:03 -07:00
pyr0ball	279b01902f	fix: tautulli — hmac token compare, public pattern loader, startup cache, endpoint tests	2026-05-13 19:08:49 -07:00
pyr0ball	581e0314b4	fix: tautulli — entry_id collision on missing ts, token settings, test coverage	2026-05-13 19:04:07 -07:00
pyr0ball	4fbac2554e	feat: Tautulli webhook ingest endpoint — plex events -> log_entries POST /turnstone/api/ingest/tautulli accepts Tautulli notification agent payloads and stores them as log_entries under source 'tautulli'. Severity maps error->CRITICAL, buffer->WARN, all others->None. Optional bearer token auth via X-Tautulli-Token header + tautulli_token pref. FTS index rebuilt as a background task after each write. 28 new tests, all passing.	2026-05-13 18:41:03 -07:00
pyr0ball	d8c3eba0f8	feat: context REST API — docs, facts, wizard, and debug endpoints Wires the context/RAG layer into FastAPI via a dedicated _ctx router (/turnstone/api/context/*): document upload (POST/GET/DELETE /docs), fact CRUD (POST/GET/DELETE /facts), wizard state machine (/wizard/schema, /wizard/step, /wizard/apply), and a debug search endpoint (/debug/search). All blocking DB calls are dispatched via asyncio.to_thread to keep the event loop free.	2026-05-13 16:31:07 -07:00
pyr0ball	784a4072b4	feat: SSE streaming diagnose, severity filter pills, per-source-cap search - diagnose_stream() async generator: status/summary/entries/reasoning/done events - POST /api/diagnose/stream SSE endpoint wired in rest.py - entries_in_window() gains per_source_cap to prevent high-volume sources crowding results - QuickCapture: severity filter pills, filtered entries view, pipeline status spinner - llm.py: remove overly broad HTTPStatusError re-raise	2026-05-13 15:45:35 -07:00
pyr0ball	caa85b3d30	feat: source-scoped diagnose; multi-node Docker log collection - Diagnose: add source_filter param threaded through entries_in_window, search, _diagnose, and DiagnoseRequest — clicking diagnose on a dashboard source now scopes both keyword and window hits to that source - QuickCapture: read route.query.source; show scope badge with clear ✕; auto-run when source param is present without a query - DashboardView: pass source= (not q=) when navigating to diagnose - collect_cluster_logs.sh: auto-discover Docker containers on all nodes (Heimdall non-watched, Navi, Strahl via SSH); collect Cass Plex logs via SSH; write to per-node dirs for directory-mode ingest - turnstone-cluster.service: add --reload for hot-reload during dev	2026-05-13 08:10:42 -07:00
pyr0ball	7d46314e86	feat: switch LLM backend to OpenAI-compat; add cf-orch remote inference support Turnstone now calls /v1/chat/completions instead of Ollama's /api/generate. This format works with both local Ollama (>=0.1.24) and a remote cf-orch coordinator, enabling GPU-less nodes like Contributor2's to route diagnoses through the cluster without any local model. - llm.py: OpenAI-compat messages format, optional Bearer auth header - diagnose.py: thread llm_api_key through the call chain - rest.py: llm_api_key pref (default empty), SettingsBody field, passed to diagnose - SettingsView.vue: API Key field, label updated from "Ollama URL" to "LLM Endpoint URL" - tests: updated mocks for new response shape; added bearer token assertion test	2026-05-12 12:58:38 -07:00
pyr0ball	3fd81e5ab1	feat: live watch mode — tail journald/docker/podman sources continuously (#4 ) Adds background watcher that tails active log sources and ingests entries in near-real-time, keeping the DB fresh without manual ingest runs. - app/watch/watcher.py: Watcher + WatchSource using subprocess + select loop; flushes every 10s or 100 lines; syncs FTS index every 3 flushes - patterns/watch.yaml: declarative source config (journald/docker/podman) - app/rest.py: lifespan context manager starts/stops watcher on app startup/shutdown; GET /api/watch/status + POST /api/watch/reload - web/src/views/DashboardView.vue: live/manual indicator chip + stale banner copy adapts to whether live watching is active - tests/test_watch_watcher.py: 16 tests covering config load, command building, docker timestamp stripping, orchestrator lifecycle Closes #4	2026-05-11 15:34:13 -07:00
pyr0ball	c12cc6d68a	feat: severity overrides + last_ingested timestamp on dashboard	2026-05-11 13:00:11 -07:00
pyr0ball	0882083755	feat: LLM reasoning layer — Ollama summarization on diagnose results	2026-05-11 11:35:07 -07:00
pyr0ball	05ad314ed5	feat: add POST /api/diagnose and GET/PATCH /api/settings endpoints	2026-05-11 09:10:58 -07:00
pyr0ball	457b4fd7ae	feat: incident labeling, bundle export, and push/receive flow Turnstone incidents now carry an issue_type tag (free-text with datalist suggestions) used to categorize patterns for signature building. Backend: - Incident model gains issue_type; additive ALTER TABLE migration keeps existing DBs working without a full schema rebuild - New received_bundles table stores incoming JSON bundles with indexes on bundled_at and issue_type - build_bundle() assembles incident + related log entries into a versioned bundle dict; store_bundle()/list_bundles()/get_bundle() for the receiver - POST /api/incidents/{id}/send — pushes bundle to TURNSTONE_BUNDLE_ENDPOINT - GET /api/incidents/{id}/bundle — export without sending - POST /api/bundles — receive and store an incoming bundle - GET /api/bundles — list all received bundles - TURNSTONE_SOURCE_HOST and TURNSTONE_BUNDLE_ENDPOINT env vars; auto-set source host from hostname in podman-standalone.sh Frontend: - Incidents form: issue_type field with datalist suggestions; Type column in the table; Send Bundle button + status feedback in the detail drawer - New BundlesView: collapsible bundle rows, inline JSON parse (no extra round-trip), Export JSON download button - Router and nav updated with /bundles route	2026-05-11 05:23:55 -07:00
pyr0ball	f5893c6003	feat: dashboard view, stats API, and composite index for query perf - Add GET /api/stats endpoint with 24h windowed aggregation (criticals, errors, per-source health, recent criticals list) - Fix timestamp format bug: strftime('%Y-%m-%dT%H:%M:%S', ...) to match stored ISO-8601 T-separated timestamps (datetime('now') uses space) - Add composite index idx_ts_repeat(timestamp_iso, repeat_count) — drops stats query from 3.5 s to <1 ms by resolving both WHERE conditions from the index without table row fetches - New DashboardView: 3 stat cards, source health table with health dots, diagnose-per-source button, recent criticals panel, zero-state card - Router default / → /dashboard; Dashboard first in nav - DiagnoseView: reads ?q= query param on mount and auto-runs; shows formatted LLM summary block - LogEntryRow: expand/collapse for long entries (>200 chars or multiline)	2026-05-11 03:41:55 -07:00
pyr0ball	19d3827e2d	fix: bypass FTS ranking for named-source error retrieval When diagnose() auto-detects a source name, FTS keyword scoring can bury real errors whose text doesn't match the symptom query. Add recent_source_errors() — a plain-SQL scan ordered by timestamp — so the most recent errors from a known service always surface regardless of keyword overlap.	2026-05-10 08:14:23 -07:00
pyr0ball	d63dc2a714	feat: incident tagging — DB schema, CRUD service, REST API (#1 ) - Add `incidents` table to SQLite schema (id, label, started_at, ended_at, notes, created_at, severity) - Extract `ensure_schema()` from ingest pipeline so tables are always created at startup, not only during ingest - New `app/services/incidents.py`: create/list/get/delete + time-window entry association (FTS keyword search + raw window fallback) - New `entries_in_window()` in search.py: plain SQL scan for incident detail when keyword FTS returns nothing - REST endpoints: POST/GET /api/incidents, GET/DELETE /api/incidents/{id} - Incident detail returns up to 100 associated log entries sorted by timestamp, prioritising FTS keyword hits then ERROR/CRITICAL then all	2026-05-09 15:37:14 -07:00
pyr0ball	a3e8ebb0d9	fix: mount all routes at /turnstone prefix for direct LAN access Vite builds with base='/turnstone/' so asset paths in index.html are /turnstone/assets/*. Serving FastAPI at root / meant direct hits to port 8534 got index.html for asset requests (blank page). - All routes now under /turnstone (APIRouter prefix + StaticFiles mount at /turnstone/assets + SPA catch-all at /turnstone/{path}) - Root / redirects to /turnstone/ - Caddy block reverted to no-strip: both direct LAN and Caddy access hit the same paths, no per-host routing differences	2026-05-08 17:45:34 -07:00
pyr0ball	9e46cd4c7f	fix: serve Vue SPA from FastAPI, drop separate port 8535 Python http.server can't do SPA routing and Caddy was forwarding /turnstone/* paths that the static server couldn't resolve. - app/rest.py: mount web/dist/assets as StaticFiles; add SPA catch-all route that serves index.html for any unmatched path - manage.sh: start/stop/status simplified to single process on :8534; remove UI_PORT / UI_PID_FILE; drop http.server invocation - Caddyfile: replace split API/:8534 + SPA/:8535 block with a single strip_prefix + reverse_proxy to :8534	2026-05-08 17:27:46 -07:00
pyr0ball	eef84d55be	feat: Vue 3 frontend and FastAPI REST layer - app/rest.py: FastAPI app wrapping search/diagnose/sources with CORS - web/: Vue 3 + Vite + UnoCSS + Pinia frontend at port 8535 - LogSearchView: sidebar filters (source, severity, limit) + FTS search - DiagnoseView: layered symptom investigation matching MCP diagnose tool - SourcesView: corpus table with entry count, error count, time range - LogEntryRow: severity badge, pattern chips, repeat count, timestamp - StatusDot: live API health indicator in nav - scripts/start_dev.sh: launch FastAPI (:8534) + Vite dev server (:8535) - .gitignore: add web/node_modules/ and web/dist/ - Caddy: /turnstone* route added to menagerie.circuitforge.tech block (API → :8534 with /turnstone strip, SPA fallback → :8535)	2026-05-08 16:27:59 -07:00

48 commits