DiscordChatExporter/docs/plans/2026-05-29-026-feat-documents-scrape-disk-preflight-plan.md
Boden 32b7f47d45 feat(scrape): disk-only verify flag and documents scrape preflight
run-documents-scrape.sh checks archive free space before live Discord
calls; verify-operator-ready.sh adds --disk-only for lightweight checks.
2026-05-29 15:54:08 -05:00

28 lines
1,017 B
Markdown

---
title: feat: Disk preflight on documents scrape entrypoint
type: feat
status: complete
date: 2026-05-29
origin: /lfg — plan 025 added disk checks to verify-operator-ready but run-documents-scrape bypassed them
---
# feat: Disk preflight on documents scrape entrypoint
## Summary
Operators often run `./scripts/run-documents-scrape.sh` directly (and monthly cron uses the host runner). Call the same archive disk check before any live Discord scrape so full disks fail fast with a clear message.
## Requirements
| ID | Requirement |
|----|-------------|
| R1 | `verify-operator-ready.sh --disk-only` runs config parse + `require_archive_disk_space` only |
| R2 | `run-documents-scrape.sh` invokes disk check before preflight/scrape (not on `--dry-run`) |
| R3 | `documents-scrape-smoke.sh` covers `--disk-only` success path with `DCE_MIN_FREE_MB=0` |
| R4 | `run-all-smokes.sh` still passes |
## Verification
- `./scripts/tests/documents-scrape-smoke.sh`
- `DCE_MIN_FREE_MB=0 ./scripts/run-all-smokes.sh`