返回 Skill 列表
extension
分类: 开发与工程无需 API Key

context-scrapers

网络爬虫(Scrapy)、解析器和源管理。

person作者: jakexiaohubgithub

Scrapers Context

Overview

Web scraping infrastructure using Scrapy. Handles data acquisition from city councils (Legistar) and municipal codes (Municode).

Active Files

Spiders

  • backend/affordabot_scraper/affordabot_scraper/spiders/sanjose_meetings.py - Legistar meeting scraper
  • backend/affordabot_scraper/affordabot_scraper/spiders/sanjose_municode.py - Municode scraper

Configuration

  • backend/affordabot_scraper/affordabot_scraper/pipelines.py - Item pipelines (DB storage)
  • backend/affordabot_scraper/affordabot_scraper/settings.py - Scrapy settings

Verification Scripts

  • scripts/verify_raw_scrapes.py - Check raw_scrapes table
  • scripts/verify_municode_discovery.py - Verify discovery logic

Usage

Use this skill when modifying scrapers, adding new jurisdictions, or debugging ingestion.