Files
hk-ipo/data/snapshots/ipo_master.csv
T
geometrybase eae427d85b Add PDF text extraction workflow
Request:
- Provide a way to install or develop a PDF extraction tool for archived HK IPO documents.

Changes:
- Add requirements.txt with pypdf as the lightweight PDF text extraction dependency.
- Add scripts/extract_pdf_text.py to extract text from PDF source_refs into repo-relative data/extracted_text files.
- Add extracted text outputs and an extracted_text_manifest snapshot for the six archived HKEXnews PDFs.
- Document the extraction workflow in README.md.
- Ignore .venv and keep generated SQLite/Python transient files out of git.
- Use extracted text to verify the 06106 full prospectus, update source_refs, remove the related data gap, and fill 06106 offering terms.

Verification:
- Installed python3.14-venv system support, created a local .venv, and installed requirements.txt.
- Re-ran scripts/bootstrap_historical_data.py and scripts/extract_pdf_text.py.
- Verified extracted text paths and hashes against data/snapshots/extracted_text_manifest.csv.
- Verified SQLite integrity and snapshot row counts.
- Ran git diff --cached --check and searched durable files for machine-specific absolute paths.
2026-06-15 06:21:16 +00:00

1.1 KiB

1tickercompany_name_encompany_name_zhstock_short_nameexchangeboardstatuslisting_dateapplication_start_dateapplication_end_dateallotment_results_expected_dateindustry_labeldata_as_ofnotes
206106Shanghai Seer Intelligent Technology Co., Ltd.上海仙工智能科技股份有限公司HKEXMain Boardopen_for_subscription2026-06-242026-06-152026-06-182026-06-23Industrial intelligent robots / robot controllers2026-06-15T06:15:00ZSeeded from HKEXnews global offering announcement; full prospectus source classification needs follow-up.
306658Liuliumei Co., Ltd.溜溜梅股份有限公司LIULIUMEIHKEXMain Boardlisted2026-06-152026-06-052026-06-102026-06-12Snack food / preserved fruit2026-06-15T06:15:00ZSeeded from HKEXnews prospectus and allotment results.
406675SENASIC Electronics Technology Co., Ltd.琻捷電子科技(江蘇)股份有限公司HKEXMain Boardpending_listing2026-06-172026-06-092026-06-122026-06-16Automotive wireless sensing SoC / semiconductors2026-06-15T06:15:00ZSeeded from HKEXnews prospectus and global offering announcement; allotment results not yet archived.