mirror of
https://github.com/Tyrrrz/DiscordChatExporter.git
synced 2026-06-09 15:52:37 -06:00
One command validates compose, auth, config, and seeded archives before bootstrap or cron. Includes offline smoke test (14 smokes total).
1.8 KiB
1.8 KiB
Recurring scrape operator checklist
Use this after cloning or opening the source repo (DiscordChatExporter, not the GUI zip alone).
One-time setup
./scripts/verify-operator-ready.sh— jq, compose, auth, and archive folders.cp scrape.env.example scrape.envand setDISCORD_TOKEN, or./scripts/sync-token-from-gui.sh --force(reads GUISettings.dat)../scripts/bootstrap-recurring-scrape.sh --dry-run— confirm every enabled target has seeded JSON underoutput_dir../scripts/bootstrap-recurring-scrape.sh— verify archives, build image, preflight Discord../scripts/run-documents-scrape.sh— first incremental append-only scrape../scripts/prove-incremental-append.sh --target <name>— optional grow-only proof../scripts/audit-archive-json.sh— optional; lists invalid JSON before cron runs.
Monthly automation
./scripts/setup-cron.sh --dry-run
./scripts/setup-cron.sh --skip-preflight # after bootstrap preflight already succeeded
Defaults: first day of month at 04:00. Override with --interval weekly, --at HH:MM, or --cron '0 4 1 * *'.
Installed jobs are marked # BEGIN discord-scrape in crontab -l. Logs append to logs/discord-scrape.log.
Narrow a run
./scripts/run-documents-scrape.sh --target KotOR_discord_msgs
./scripts/setup-cron.sh --target KotOR_discord_msgs --channel CHANNEL_ID
GUI zip only
See ../DiscordChatExporter.linux-x64/RECURRING-SCRAPE.md or run ../DiscordChatExporter.linux-x64/bootstrap-recurring-scrape.sh.
Validate scripts after changes:
./scripts/run-all-smokes.sh
Merge / review summary: recurring-scrape-merge-readiness.md
Full detail: .docs/Recurring-Scrape-Setup.md