DiscordChatExporter/scripts
Copilot c13c4167be fix(scrape): salvage stale temp exports before re-downloading
When a previous export crashes (OOM, abort, kill), the partially-
downloaded temp export under .dce-temp/ was orphaned. Subsequent
runs started the incremental from the archive's last message ID,
re-downloading everything the failed run had already fetched.

Now scrape_target() checks for orphaned temp exports before each
channel export, salvages truncated JSON (same marker-based repair
as salvage-truncated-export.sh), merges recovered messages into
the archive, and cleans up stale temp dirs. The incremental then
starts from the truly latest message.

Adds salvage-stale smoke test with truncated fixture.
2026-06-03 01:11:28 -05:00
..
lib feat(scrape): run plan, channel ledger, and all-target proof 2026-05-29 20:34:22 -05:00
tests fix(scrape): salvage stale temp exports before re-downloading 2026-06-03 01:11:28 -05:00
tools/ReadDceGuiToken fix(scrape): auth discovery, skip forbidden channels, mount host script 2026-05-28 14:57:16 -05:00
audit-archive-json.sh fix(scrape): audit and salvage corrupt archive JSON 2026-05-29 14:09:46 -05:00
bootstrap-recurring-scrape.sh feat(scrape): operator proof script and podman-compose smoke fix 2026-05-29 16:20:25 -05:00
discover-discord-token.sh fix(scrape): auth discovery, skip forbidden channels, mount host script 2026-05-28 14:57:16 -05:00
gh-approve-pr-runs.sh feat: add gh PR run approval helper for fork CI unblock 2026-05-28 00:30:49 -05:00
operator-handoff.sh feat(scrape): run plan, channel ledger, and all-target proof 2026-05-29 20:34:22 -05:00
prove-incremental-append.sh feat(scrape): offline prove smoke and snapshot-only mode 2026-05-29 14:12:18 -05:00
read-dce-gui-token.sh fix(scrape): auth discovery, skip forbidden channels, mount host script 2026-05-28 14:57:16 -05:00
run-all-smokes.sh fix(scrape): disk preflight before compose and skippable disk errors 2026-05-29 15:27:39 -05:00
run-discord-scrape-host.sh feat(scrape): run plan, channel ledger, and all-target proof 2026-05-29 20:34:22 -05:00
run-discord-scrape.sh fix(scrape): salvage stale temp exports before re-downloading 2026-06-03 01:11:28 -05:00
run-documents-scrape.sh feat(scrape): run plan, channel ledger, and all-target proof 2026-05-29 20:34:22 -05:00
run-operator-proof.sh feat(scrape): run plan, channel ledger, and all-target proof 2026-05-29 20:34:22 -05:00
run-operator-validation.sh feat(scrape): complete validation resume (8/9 targets) 2026-05-29 23:35:35 -05:00
salvage-truncated-export.sh fix(scrape): audit and salvage corrupt archive JSON 2026-05-29 14:09:46 -05:00
scrape-here.sh feat(scrape): add bootstrap-recurring-scrape one-shot operator flow 2026-05-29 13:59:04 -05:00
setup-cron.sh feat(scrape): harden preflight and cron config for Documents archives 2026-05-29 13:49:09 -05:00
setup-scrape-auth.sh feat(scrape): add Documents verify and auth bootstrap helpers 2026-05-28 01:18:05 -05:00
sync-gui-bridge-doc.sh feat(scrape): sync GUI bridge doc into linux-x64 folder 2026-05-29 16:10:17 -05:00
sync-token-from-gui.sh feat(scrape): sync token from GUI Settings.dat for live exports 2026-05-29 14:05:45 -05:00
verify-documents-archives.sh feat(scrape): add Documents verify and auth bootstrap helpers 2026-05-28 01:18:05 -05:00
verify-operator-ready.sh feat(scrape): operator proof script and podman-compose smoke fix 2026-05-29 16:20:25 -05:00