mirror of
https://github.com/Tyrrrz/DiscordChatExporter.git
synced 2026-06-09 15:52:37 -06:00
Add audit-archive-json and salvage-truncated-export helpers, skip invalid JSON during prove snapshots, and wire an audit smoke test into CI.
1.5 KiB
1.5 KiB
Recurring scrape operator checklist
Use this after cloning or opening the source repo (DiscordChatExporter, not the GUI zip alone).
One-time setup
cp scrape.env.example scrape.envand setDISCORD_TOKEN, or./scripts/sync-token-from-gui.sh --force(reads GUISettings.dat)../scripts/bootstrap-recurring-scrape.sh --dry-run— confirm every enabled target has seeded JSON underoutput_dir../scripts/bootstrap-recurring-scrape.sh— verify archives, build image, preflight Discord../scripts/run-documents-scrape.sh— first incremental append-only scrape../scripts/prove-incremental-append.sh --target <name>— optional grow-only proof../scripts/audit-archive-json.sh— optional; lists invalid JSON before cron runs.
Monthly automation
./scripts/setup-cron.sh --dry-run
./scripts/setup-cron.sh --skip-preflight # after bootstrap preflight already succeeded
Defaults: first day of month at 04:00. Override with --interval weekly, --at HH:MM, or --cron '0 4 1 * *'.
Installed jobs are marked # BEGIN discord-scrape in crontab -l. Logs append to logs/discord-scrape.log.
Narrow a run
./scripts/run-documents-scrape.sh --target KotOR_discord_msgs
./scripts/setup-cron.sh --target KotOR_discord_msgs --channel CHANNEL_ID
GUI zip only
See ../DiscordChatExporter.linux-x64/RECURRING-SCRAPE.md or run ../DiscordChatExporter.linux-x64/bootstrap-recurring-scrape.sh.
Full detail: .docs/Recurring-Scrape-Setup.md