hk-ipo

Author	SHA1	Message	Date
geometrybase	943eab27cb	Add external IPO history to heat model Request: - Add historical data around T0.5 margin heat and rebuild the model. Changes: - Add external_ipo_history to store third-party historical IPO records separately from true T0.5 market-heat snapshots. - Add scripts/archive_ipohk_history.py to archive ipohk structured listed IPO history. - Archive 807 ipohk rows, including final oversubscription, one-lot win rate, grey-market return, and first-day return where available. - Extend the v0 analysis dataset with true T0.5 market-heat columns and separate external final-heat columns. - Rebuild reports/2026-06-15_analysis_model_v0.md with T0.5 coverage and external final-heat calibration. - Add a Chinese report explaining why historical final oversubscription cannot be treated as T0.5 margin snapshots. - Update analyst and archivist skills to keep T0.5 and external final history separate. Verification: - .venv/bin/python -m py_compile scripts/build_analysis_dataset.py scripts/archive_ipohk_history.py scripts/archive_t0_5_market_heat.py - .venv/bin/python scripts/build_analysis_dataset.py --as-of 2026-06-15T19:20:00Z - Python sqlite3 PRAGMA integrity_check returned ok and foreign_key_check returned zero rows. - Confirmed 807 external_ipo_history rows, 792 rows with external final oversubscription, 5 true T0.5 market-heat rows, and 297 analysis dataset rows. - git diff --cached --check Next useful context: - True T0.5 historical backtesting still requires ongoing frozen margin-heat snapshots during each IPO subscription window.	2026-06-15 16:06:56 +00:00
geometrybase	222f55c140	Add T0.5 market heat IPO analysis Request: - Test whether subscription-period T0.5 market heat data can be captured and incorporated into IPO analysis. Changes: - Add an ipo_market_heat table for non-official market-heat snapshots. - Add a VBKR/Jieli archive script for expected margin subscription multiples. - Archive the 2026-06-15T18:40:00Z heat snapshot for 01392, 02335, 06067, 06106, and 06132. - Add an experimental T0.5 overlay rule file and a Chinese cross-IPO trial report. - Update archivist and analyst skills so T0.5 remains separate from official T1 allotment demand. Verification: - .venv/bin/python -m py_compile scripts/archive_t0_5_market_heat.py scripts/build_analysis_dataset.py scripts/update_sync_state.py - Python sqlite3 PRAGMA integrity_check returned ok and foreign_key_check returned zero rows. - Confirmed 5 ipo_market_heat rows and 5 t0_5_market_heat source_refs for the frozen snapshot. - git diff --cached --check Next useful context: - T0.5 data is non-official and should be resampled during the subscription window, then compared against T1 official allotment results.	2026-06-15 15:44:32 +00:00
geometrybase	a2ec016769	Add selected T0 horizontal IPO report Request: - Combine the currently selected T0 IPO reports into one cross-sectional analysis report. Changes: - Add a Chinese horizontal T0 report comparing 01392, 02335, 06067, 06106, and 06132. - Rank the selected IPOs by the current T0 model and short-exit discipline focused on T2/D1 selling. - Backfill 02335's Chinese company name from its Chinese HKEX prospectus and archive the source PDF plus extracted text. - Refresh the v0 analysis dataset and sync-state snapshots at 2026-06-15T18:20:00Z. Verification: - .venv/bin/python -m py_compile scripts/build_analysis_dataset.py scripts/generate_ipo_report.py scripts/extract_pdf_text.py scripts/update_sync_state.py - Python sqlite3 PRAGMA integrity_check returned ok and foreign_key_check returned zero rows. - Confirmed 02335 Chinese source_ref, extracted text manifest row, and selected horizontal report content. - git diff --cached --check Next useful context: - Untracked PDF exports of individual reports and the horizontal report were left out of this focused commit.	2026-06-15 15:17:06 +00:00
geometrybase	797bbde201	Prefer Chinese company names in IPO reports Request: - Update the selected analyst reports so stock/company names include Chinese names and use Chinese names first. Changes: - Updated the selected T0 reports for 01392, 06067, 06106, and 06132 to show Chinese company names in the title and summary, with English names in parentheses. - Added company_name_zh to the analyst dataset so report generation has access to Chinese names. - Updated the report generator to prefer Chinese company names and fall back to English names only when Chinese names are unavailable. - Filled Chinese company names for the selected tickers in ipo_master and refreshed snapshots. Verification: - Compiled build_analysis_dataset.py and generate_ipo_report.py. - Ran generator dry-runs for 06132 and 01392 to confirm Chinese-first output. - Ran SQLite integrity_check and foreign_key_check. - Ran git diff --check. Next useful context: - Future generated analyst reports now use company_name_zh first when available.	2026-06-15 15:11:15 +00:00
geometrybase	fcb795b583	Add 02335 T0 analyst report Request: - Generate an analyst report for HK IPO ticker 02335. Changes: - Archived the official HKEXnews 02335 prospectus PDF and extracted text under project-relative data paths. - Seeded 02335 T0 prospectus facts, source references, sync state, and analysis snapshots. - Generated reports/2026-06-15_02335_T0_prospectus_analysis.md in Simplified Chinese with concrete T0/T1/T2/D1 dates and short-exit T2/D1 discipline. - Made PDF text extraction tolerant of invalid Unicode surrogate characters emitted by pypdf. Verification: - Compiled archive_hkex_documents.py, generate_ipo_report.py, build_analysis_dataset.py, extract_pdf_text.py, and update_sync_state.py. - Ran SQLite integrity_check and foreign_key_check. - Verified the archived 02335 PDF hash, extracted-text manifest row, and analysis dataset row. - Ran git diff --check. Next useful context: - 02335 is currently T0_prospectus; T1_allotment is pending for 2026-06-23.	2026-06-15 15:07:44 +00:00
geometrybase	42c18131e8	Add 06067 T0 analyst report Request: - Generate an analyst report for HK IPO ticker 06067. Changes: - Archived the official HKEXnews 06067 prospectus PDF and extracted text under project-relative data paths. - Seeded 06067 T0 prospectus facts, source references, sync state, and analysis snapshots. - Generated reports/2026-06-15_06067_T0_prospectus_analysis.md in Simplified Chinese with concrete T0/T1/T2/D1 dates and short-exit T2/D1 discipline. - Updated the HKEX document archiver so over-allotment shares are only recorded when the prospectus supports them, with explicit no-option cases stored as zero. Verification: - Compiled archive_hkex_documents.py, generate_ipo_report.py, build_analysis_dataset.py, extract_pdf_text.py, and update_sync_state.py. - Ran SQLite integrity_check and foreign_key_check. - Verified the archived 06067 PDF hash, extracted-text manifest row, and analysis dataset row. - Ran git diff --check. Next useful context: - 06067 is currently T0_prospectus; T1_allotment is pending for 2026-06-22.	2026-06-15 15:03:07 +00:00
geometrybase	77b405e4f3	Add T0 analyst reports for active IPOs Request: - Analyze HK IPO ticker 01392 with the analyst skill. - Preserve the in-flight 06132 archive/report work already created for the prior request. Changes: - Archived official HKEX prospectus PDFs and extracted text for 01392 and 06132. - Seeded structured T0 facts into the SQLite archive and refreshed CSV snapshots and sync state. - Rebuilt the v0 analysis dataset and model calibration report. - Generated Simplified Chinese T0 prospectus-stage analyst reports for 01392 and 06132. - Adjusted report stage calendars so T2 uses the previous business day before D1 when listing is separated from allocation by a weekend. Verification: - Compiled modified Python scripts with in-memory syntax checks. - Ran SQLite quick_check and foreign_key_check. - Confirmed DB row counts match CSV snapshots for key tables. - Verified 01392/06132 source paths are repo-relative, raw files exist, hashes match, and PDF text manifest rows are ok. - Ran git diff --cached --check. Next useful context: - 01392 T1 is due on 2026-06-18; rerun analyst after allotment results are archived. - 06132 T1 is due on 2026-06-22; rerun analyst after allotment results are archived.	2026-06-15 14:51:44 +00:00
geometrybase	907e30d9da	Use Chinese for analyst reports Request: - Make analyst reports Chinese by default and record the rule in the analyst skill. Changes: - Add a Simplified Chinese default-language rule to the analyst skill. - Update the single-IPO report generator to emit Chinese Markdown sections, labels, actions, risks, triggers, and exit plans. - Preserve ticker symbols, stage codes, rule ids, score buckets, and source paths as machine-readable identifiers. - Regenerate the 06106 T0 report in Chinese. - Document the Chinese report default in README and the rule change log. Verification: - Ran py_compile for scripts/generate_ipo_report.py. - Generated a 06106 dry-run report and checked Chinese section headings. - Regenerated reports/2026-06-15_06106_T0_prospectus_analysis.md. - Ran git diff --check. Next useful context: - Future analyst prediction and review reports should be written in Simplified Chinese unless the user explicitly requests another language.	2026-06-15 14:37:46 +00:00
geometrybase	07d7a0064a	Add concrete IPO stage dates to reports Request: - Include the concrete T0, T1, T2, and D1 dates in every analyst report. Changes: - Add a Stage Calendar section to the single-IPO report generator. - Require analyst reports to include ticker-specific T0 subscription window, T1 allotment-result date, T2 grey-market date/window, and D1 listing date. - Update the 06106 T0 report with its concrete stage dates. - Document the requirement in the analyst skill, README, and rule change log. Verification: - Ran py_compile for scripts/generate_ipo_report.py. - Generated a 06106 dry-run report and checked the stage calendar. - Ran git diff --check. Next useful context: - For 06106, T0 is 2026-06-15 to 2026-06-18, T1/T2 is 2026-06-23, and D1 is 2026-06-24.	2026-06-15 14:24:06 +00:00
geometrybase	29ed22e476	Clarify IPO short-exit strategy horizon Request: - Emphasize that the analyst model is for selling allocated IPO shares in T2 grey market or on D1, not for long-term holding. Changes: - Add explicit T2/D1 sell discipline to the analyst skill. - Update ipo_score_v0 targets and holding policy so D1 sell return is primary and T2 is the intended extension when reliable grey-market data exists. - Clarify that D5/D20/D60 are review labels only, not planned holding targets. - Update the model report, single-ticker report generator, README, and the 06106 report language to reflect the short-exit horizon. Verification: - Rebuilt the model report with the same dataset timestamp and confirmed the analysis dataset did not change. - Ran py_compile for build_analysis_dataset.py and generate_ipo_report.py. - Generated a 06106 dry-run report showing T2/D1 exit discipline. - Ran git diff --check. Next useful context: - T2 is still disabled in v0 until archivist approves a reliable grey-market data source; D1 remains the live modeled sell label.	2026-06-15 14:20:56 +00:00
geometrybase	bd5a06465d	Add 06106 T0 analyst report Request: - Use the analyst skill to analyze upcoming IPO 06106 and generate a Markdown report. Changes: - Add a stage-safe T0_prospectus analyst report for 06106. - Record the v0 score, calibrated historical probability, key T0 positives, risks, triggers, and source path. Verification: - Confirmed 06106 has no structured T1 demand yet, so the report is T0_prospectus. - Reviewed the report for stage safety and repo-relative source paths. - Ran git diff --check. Next useful context: - Re-run analyst after 06106 allotment results are archived around 2026-06-23 to generate a T1_allotment report.	2026-06-15 14:13:27 +00:00
geometrybase	58ad869f84	Refresh IPO analysis model calibration Request: - Re-analyze the IPO model using the updated historical archive after T1 demand backfill. Changes: - Regenerate the v0 analysis dataset from the current SQLite archive. - Refresh the v0 calibration report with expanded T1 coverage and new empirical bucket rates. - Update the report template to show pending T1 rows and field-level blanks. - Clarify v0 limitations and record why the score formula stays unchanged for this refresh. Verification: - Ran scripts/build_analysis_dataset.py against data/hk_ipo.sqlite. - Ran py_compile for scripts/build_analysis_dataset.py. - Checked dataset row count, T1 demand coverage, source-only T1 gaps, and repo-relative paths. - Ran git diff --check. Next useful context: - T1 structured coverage is now 291 rows, with 06106 and 06675 still pending_not_due. - The high-conviction T1 bucket remains differentiated, but middle and low buckets are still not monotonic enough for a v1 rule change.	2026-06-15 14:05:34 +00:00
geometrybase	6d05056609	Backfill structured T1 demand from archived text Request: - Use archivist to close the 137 T1 ipo_demand source-only gaps using extracted PDF text. Changes: - Add an incremental T1 demand text backfill script. - Parse existing allotment-result extracted text into ipo_demand. - Archive linked Summary PDFs from old HKEX HTML allotment-result pages. - Correct allotment-result selection to prefer primary result announcements over clarification or supplemental notices. - Add robust line-aware allotment parsing and document the workflow in archivist and README. - Record the backfill result in a report. Execution: - Selected 137 source-only T1 demand gaps. - Wrote 137 ipo_demand rows, increasing ipo_demand from 154 to 291 rows. - Archived 38 new HKEX allotment-result PDFs and extracted their text. - Confirmed an incremental rerun selects 0 gaps and writes 0 rows. Verification: - Ran git diff --cached --check. - Ran py_compile for archive_hkex_documents.py and backfill_t1_demand_from_text.py. - Checked SQLite integrity and foreign keys. - Confirmed DB row counts match CSV snapshots. - Verified no T1 complete row is missing ipo_demand. - Verified source_refs paths/files/hashes and PDF extracted-text manifest hashes. Next useful context: - T1 demand structure is complete for listed rows; 06106 and 06675 remain pending_not_due. - T2 grey-market and due price-performance gaps remain separate archivist priorities. - Analyst output should be regenerated before using the new T1 demand facts for scoring.	2026-06-15 13:59:06 +00:00
geometrybase	33d0bc056e	Tighten historical data audit coverage Request: - Use the audit skill to check historical data completeness and self-correct the audit criteria after the missed PDF extracted-text gap. Changes: - Add a mandatory derived-evidence checklist to the audit skill. - Require broad historical audits to reconcile PDF source_refs, extracted text files, manifest rows, and hashes. - Add a historical data completeness audit report for the current archive. Findings: - Source integrity and PDF extracted-text completeness now pass. - Full historical completeness still fails due to incomplete structured T1 demand, unresolved T2 grey-market data, open due price-performance tasks, and missing context fields. Verification: - Ran SQLite integrity, foreign-key, source hash, snapshot, PDF manifest, extracted-text hash, stage coverage, and analysis-dataset checks. - Ran scripts/extract_pdf_text.py and confirmed 557 PDF sources were skipped unchanged with 557 manifest rows. - Ran git diff --check.	2026-06-15 13:43:22 +00:00
geometrybase	48b89552fe	Add IPO analysis model baseline Request: - Use the analyst skill to digest downloaded IPO archive data and start building an analysis model. Changes: - Add ipo_score_v0 as the first transparent stage-safe scoring rule set. - Add build_analysis_dataset.py to derive model features, scores, decision bands, and empirical D1 calibration from SQLite. - Generate analysis_model_v0_dataset.csv with 293 scored IPO rows and archived source paths. - Add a model calibration report documenting coverage, T0/T1 bucket performance, usage, and known gaps. - Record the initial model entry in the rule change log and document the command in README. Verification: - Ran py_compile for scripts/build_analysis_dataset.py. - Regenerated the analysis dataset and report with as-of 2026-06-15T13:00:00Z. - Checked CSV row count, source path coverage, and repo-relative path hygiene. - Ran git diff --cached --check. Next useful context: - v0 should be treated as a transparent baseline, with T1 high-score calibration strongest and middle buckets still non-monotonic. - T2 is excluded until a reliable grey-market source is approved.	2026-06-15 12:49:48 +00:00

15 Commits