IR PDF Downloader

Use this skill for documents, not financial metrics.

Scope

Find likely PDF URLs from an IR domain
Search Wayback Machine for archived PDF links
Use SEC EDGAR only as a PDF discovery source
Download known static PDF URLs with proper headers

Do not use this skill for

Revenue / net income / EPS analysis
SEC XBRL/companyfacts queries
Filing data normalization

Use sec-finance for those.

Core commands

# Download a known static PDF URL
python3 scripts/download_ir_pdf.py "https://ir.jd.com/static-files/..."

# Discover likely PDFs from an IR domain
python3 scripts/find_ir_pdf.py --domain ir.baidu.com --year 2024

# Search by company alias (uses shared issuer map when available)
python3 scripts/find_ir_pdf.py --company Alibaba

# Search only Wayback
python3 scripts/find_ir_pdf.py --domain ir.alibabagroup.com --sources wayback

Workflow

If the PDF URL is already known, use download_ir_pdf.py directly.
If only the company or IR domain is known, use find_ir_pdf.py to discover likely URLs.
If discovery succeeds, pass the resulting URL to download_ir_pdf.py.
If the user actually wants structured financial numbers rather than the document itself, stop and switch to sec-finance.

Notes

Issuer hints for aliases, IR domains, and validated CIKs live in references/issuers.json.
Keep this skill focused on PDF discovery and download.
Do not duplicate structured-finance guidance here.