返回 Skill 列表
extension
分类: 开发与工程无需 API Key

visit-webpage

访问网页并将其内容提取为markdown,或获取图片。用于阅读文章、文档或任何网页内容。可以处理HTML页面(通过Jina Reader)和图片URL(下载并保存到本地)。

person作者: jakexiaohubgithub

Visit Webpage

Fetch and extract readable content from web pages as markdown, or download images. Handles JavaScript-rendered content via Jina Reader service.

Setup

Optionally get a Jina API key for higher rate limits:

  1. Create an account at https://jina.ai/
  2. Get your API key from the dashboard
  3. Add to your shell profile (~/.profile or ~/.zprofile for zsh):
    export JINA_API_KEY="your-api-key-here"
    

Without an API key, the service works with rate limits.

Usage

{baseDir}/visit.py <url>

Examples

# Read an article (returns markdown)
{baseDir}/visit.py https://example.com/article

# Fetch documentation
{baseDir}/visit.py https://docs.python.org/3/library/asyncio.html

# Download an image (auto-detected by content-type)
{baseDir}/visit.py https://example.com/image.png
# Then use read tool to view: read /tmp/visit-image-xxx.png

Output

For HTML pages: Returns markdown content to stdout.

For images: Downloads the image to a temp file and prints the path. Use the read tool to view it. Supports PNG, JPEG, GIF, and WebP formats.

Features

  • Extracts main content from HTML pages
  • Converts HTML to clean markdown
  • Handles JavaScript-rendered pages via Jina Reader
  • Auto-detects and downloads images to temp files
  • Retries on rate limiting (HTTP 451)
  • 5MB max image size limit

When to Use

  • Reading articles, blog posts, or documentation
  • Extracting content from search results
  • Downloading images from URLs (then use read to view)
  • Following links found during web search