DiscordChatExporter/docs/recurring-scrape-operator-checklist.md
Boden 9c22a3efee docs(scrape): track GUI zip bridge doc in source repo
Add docs/gui-zip-recurring-scrape-bridge.md and cross-links so GUI-only
users have a versioned quick-start beside the linux-x64 zip folder.
2026-05-29 16:08:36 -05:00

1.9 KiB

Recurring scrape operator checklist

Use this after cloning or opening the source repo (DiscordChatExporter, not the GUI zip alone).

One-time setup

  1. ./scripts/operator-handoff.sh — recommended: disk summary, verify-operator-ready, and documents dry-run in one step. Or ./scripts/verify-operator-ready.sh alone for prerequisites only.
  2. cp scrape.env.example scrape.env and set DISCORD_TOKEN, or ./scripts/sync-token-from-gui.sh --force (reads GUI Settings.dat).
  3. ./scripts/bootstrap-recurring-scrape.sh --dry-run — confirm every enabled target has seeded JSON under output_dir.
  4. ./scripts/bootstrap-recurring-scrape.sh — verify archives, build image, preflight Discord.
  5. ./scripts/run-documents-scrape.sh — first incremental append-only scrape.
  6. ./scripts/prove-incremental-append.sh --target <name> — optional grow-only proof.
  7. ./scripts/audit-archive-json.sh — optional; lists invalid JSON before cron runs.

Monthly automation

./scripts/setup-cron.sh --dry-run
./scripts/setup-cron.sh --skip-preflight   # after bootstrap preflight already succeeded

Defaults: first day of month at 04:00. Override with --interval weekly, --at HH:MM, or --cron '0 4 1 * *'.

Installed jobs are marked # BEGIN discord-scrape in crontab -l. Logs append to logs/discord-scrape.log.

Narrow a run

./scripts/run-documents-scrape.sh --target KotOR_discord_msgs
./scripts/setup-cron.sh --target KotOR_discord_msgs --channel CHANNEL_ID

GUI zip only

See gui-zip-recurring-scrape-bridge.md or run ../DiscordChatExporter.linux-x64/bootstrap-recurring-scrape.sh.

Validate scripts after changes:

./scripts/run-all-smokes.sh

Merge / review summary: recurring-scrape-merge-readiness.md

Full detail: .docs/Recurring-Scrape-Setup.md