c65b20a1c4
Request: Use the project archivist workflow to update IPO target coverage for the most recent three-year window. Changes: - Add scripts/update_recent_ipo_list.py to discover HKEXnews annual new listing reports, archive XLSX sources, parse subscription-relevant IPO rows, and update SQLite plus snapshots. - Add new_listing_report_entries to preserve annual report row-level evidence. - Archive 2023-2026 Main Board new listing reports and 2024-2026 GEM new listing reports. - Seed 290 report-backed IPO targets for 2023-06-15 through 2026-06-15, skipping 10 non-IPO rows without numeric offer prices. - Refresh ipo_master, missing offering_terms fields, source_refs, ticker_sync_state, and sync_tasks. - Add openpyxl as the XLSX parser dependency and document the archivist refresh flow. - Limit sync summary output while keeping the full queue in SQLite and CSV snapshots. Verification: - Ran update_recent_ipo_list.py for 2023-06-15 to 2026-06-15 with as-of 2026-06-15T07:30:00Z. - Parsed project Python scripts with ast.parse. - Checked SQLite integrity and DB-to-snapshot row counts. - Verified source_refs paths are repo-relative, files exist, and SHA-256 hashes match. - Ran git diff --check and git diff --cached --check. - Checked for Python cache and SQLite transient files. Next useful context: - ipo_master now has 293 tickers; new_listing_report_entries has 290 report-backed targets. - Current sync queue has 2005 open tasks and 42 waiting_until_due tasks for deeper per-ticker archival stages.
380 B
380 B
| 1 | sync_run_id | mode | as_of | started_at | finished_at | status | notes |
|---|---|---|---|---|---|---|---|
| 2 | sync_state_20260615T073000Z | recent_ipo_list_refresh | 2026-06-15T07:30:00Z | 2026-06-15T07:30:00Z | 2026-06-15T07:30:00Z | complete | Derived ticker sync state refreshed. |
| 3 | sync_state_seed_2026_06_15 | bootstrap_state_refresh | 2026-06-15T06:30:00Z | 2026-06-15T06:30:00Z | 2026-06-15T06:30:00Z | complete | Derived ticker sync state refreshed. |