turnstone/scripts/collect_cluster_logs.sh
pyr0ball 07e151b01f refactor: use live watcher + systemd timer instead of cron for cluster ingest
Local Heimdall sources (journal, Docker containers, network syslog) are now
tailed continuously by the built-in watcher via watch.yaml — no periodic
collection needed for those.

SSH collection of remote node journals is now handled by a systemd timer
(turnstone-cluster-collect.service/.timer) instead of cron.
collect_cluster_logs.sh simplified to only SSH-collect remote nodes and
trigger ingest directly.

docker-cluster.sh updated to mount:
  - /var/run/docker.sock (so watcher can run docker logs -f)
  - /run/systemd/journal (so watcher can run journalctl -f)
  - /devl/turnstone-cluster/patterns/ (cluster-specific watch.yaml)
2026-05-13 04:55:25 -07:00

50 lines
2 KiB
Bash

#!/usr/bin/env bash
# Collect recent journal logs from remote CircuitForge cluster nodes
# into /devl/turnstone-cluster/data/ for Turnstone to ingest.
#
# Local Heimdall sources (journal, Docker containers, network syslog) are
# handled by the Turnstone live watcher (watch.yaml) — no collection needed.
#
# Triggered by systemd timer: turnstone-cluster-collect.timer (every 15 min).
# Install: sudo cp <scripts>/turnstone-cluster-collect.* /etc/systemd/system/
# sudo systemctl daemon-reload && sudo systemctl enable --now turnstone-cluster-collect.timer
#
# Manual run:
# bash /Library/Development/CircuitForge/turnstone/scripts/collect_cluster_logs.sh
set -euo pipefail
DATA_DIR=/devl/turnstone-cluster/data
WINDOW="20 minutes ago"
SSH_OPTS="-o ConnectTimeout=5 -o BatchMode=yes -o StrictHostKeyChecking=no"
mkdir -p "${DATA_DIR}"
# ── Remote cluster nodes ──────────────────────────────────────────────────────
declare -A NODES=(
[navi]="${DATA_DIR}/navi-journal.jsonl"
[sif]="${DATA_DIR}/sif-journal.jsonl"
[cass]="${DATA_DIR}/cass-journal.jsonl"
[strahl]="${DATA_DIR}/strahl-journal.jsonl"
)
for node in "${!NODES[@]}"; do
outfile="${NODES[$node]}"
echo "${node}: collecting journal..."
if ssh ${SSH_OPTS} "${node}" true 2>/dev/null; then
ssh ${SSH_OPTS} "${node}" \
"journalctl --output=json --priority=0..5 --since '${WINDOW}' --no-pager 2>/dev/null || true" \
> "${outfile}" 2>/dev/null || { echo "${node}: ssh failed, skipping"; : > "${outfile}"; }
echo "${node}: $(wc -l < "${outfile}") entries"
else
echo "${node}: unreachable, skipping"
: > "${outfile}"
fi
done
# Trigger ingest of remote node journals into the running container.
docker exec turnstone-cluster python scripts/ingest_corpus.py \
--sources /patterns/sources-cluster.yaml --db /data/turnstone.db \
>> /var/log/turnstone-cluster-ingest.log 2>&1
echo "collect_cluster_logs: done"