Firecrawl Web Scraping

Converts web pages into clean, LLM-ready markdown or structured data. Handles JavaScript rendering, anti-bot measures, and complex sites.

When to Use

Use Firecrawl when you need to:

scripts/firecrawl.sh scrape "<url>" [format]

Formats: markdown (default), html, links, screenshot

Example:

scripts/firecrawl.sh scrape "https://docs.firecrawl.dev/introduction"
scripts/firecrawl.sh scrape "https://example.com" "html"

scripts/firecrawl.sh search "<query>" [limit]

Example:

scripts/firecrawl.sh search "firecrawl web scraping API" 5

scripts/firecrawl.sh map "<url>" [limit] [search]

Example:

scripts/firecrawl.sh map "https://firecrawl.dev" 50
scripts/firecrawl.sh map "https://docs.firecrawl.dev" 100 "api reference"

scripts/firecrawl.sh extract "<url>" "<prompt>"

Uses Firecrawl's LLM extraction to return structured JSON from a single page.

Example:

scripts/firecrawl.sh extract "https://firecrawl.dev" "Extract company name, mission, and pricing tiers"

scripts/firecrawl.sh crawl "<url>" [limit] [depth]

Example:

scripts/firecrawl.sh crawl "https://docs.firecrawl.dev" 20 2

Scrape for single pages - Use scrape when you have specific URLs
Map before crawl - Use map to discover URLs, then scrape specific ones
Search for discovery - Use search to find relevant pages when you don't know URLs
Extract for structure - Use extract when you need JSON, not markdown
Respect rate limits - Script auto-retries on 429 with key rotation
Current year is 2026 - Use this when recency matters; omit for timeless topics or use older years when historically relevant

See reference/troubleshooting.md for error handling, configuration, and common issues.