Files
hk-ipo/data/snapshots/sync_runs.csv
T
geometrybase 9aab267f80 Run full HKEX document backfill
Request:
- Remove small-batch guidance and execute the HKEX document archiver across all open T0/T1 sync tasks in one run.

Changes:
- Make archive_hkex_documents.py process every open T0/T1 ticker by default when --limit is omitted.
- Add per-ticker progress output and keep full refreshes moving if one ticker fails.
- Suppress noisy pypdf warnings during large official document extraction.
- Update archivist and README instructions to show the full-run command without batch notes.
- Archive official HKEXnews prospectus and allotment-results PDFs under repo-relative data/raw paths.
- Refresh hk_ipo.sqlite and CSV snapshots for parsed T0/T1 fields, source_refs, sync_runs, sync_tasks, and ticker_sync_state.

Execution:
- Ran .venv/bin/python scripts/archive_hkex_documents.py --as-of 2026-06-15T09:00:00Z.
- Selected 284 open T0/T1 tickers, processed 210 tickers, and archived 398 source files.
- Left 74 tickers as missing target docs because title search did not return target prospectus/allotment documents for this pass.

Verification:
- Parsed archivist scripts with Python ast.
- Confirmed README, archivist skill, and archiver script no longer contain batch guidance.
- Ran git diff --check.
- Checked SQLite integrity and DB/snapshot row counts.
- Verified 706 source_refs use relative local paths, all files exist, and SHA256 hashes match.

Next useful context:
- Current source_refs count is 706 and ipo_demand count is 134.
- Sync ledger now reports 414 complete, 1595 pending_due, and 42 pending_not_due states.
2026-06-15 07:57:33 +00:00

1.6 KiB

1sync_run_idmodeas_ofstarted_atfinished_atstatusnotes
2sync_state_20260615T073000Zrecent_ipo_list_refresh2026-06-15T07:30:00Z2026-06-15T07:30:00Z2026-06-15T07:30:00ZcompleteDerived ticker sync state refreshed.
3sync_state_20260615T081500Zhkex_document_archive2026-06-15T08:15:00Z2026-06-15T08:15:00Z2026-06-15T08:15:00ZcompleteDerived ticker sync state refreshed.
4sync_state_20260615T082000Zhkex_document_archive2026-06-15T08:20:00Z2026-06-15T08:20:00Z2026-06-15T08:20:00ZcompleteDerived ticker sync state refreshed.
5sync_state_20260615T082500Zhkex_document_archive2026-06-15T08:25:00Z2026-06-15T08:25:00Z2026-06-15T08:25:00ZcompleteDerived ticker sync state refreshed.
6sync_state_20260615T083000Zhkex_document_archive2026-06-15T08:30:00Z2026-06-15T08:30:00Z2026-06-15T08:30:00ZcompleteDerived ticker sync state refreshed.
7sync_state_20260615T083500Zhkex_document_archive2026-06-15T08:35:00Z2026-06-15T08:35:00Z2026-06-15T08:35:00ZcompleteDerived ticker sync state refreshed.
8sync_state_20260615T084500Zhkex_document_archive2026-06-15T08:45:00Z2026-06-15T08:45:00Z2026-06-15T08:45:00ZcompleteDerived ticker sync state refreshed.
9sync_state_20260615T085000Zhkex_document_archive2026-06-15T08:50:00Z2026-06-15T08:50:00Z2026-06-15T08:50:00ZcompleteDerived ticker sync state refreshed.
10sync_state_20260615T090000Zhkex_document_archive2026-06-15T09:00:00Z2026-06-15T09:00:00Z2026-06-15T09:00:00ZcompleteDerived ticker sync state refreshed.
11sync_state_seed_2026_06_15bootstrap_state_refresh2026-06-15T06:30:00Z2026-06-15T06:30:00Z2026-06-15T06:30:00ZcompleteDerived ticker sync state refreshed.