Visit Webpage
Fetch and extract readable content from web pages as markdown, or download images. Handles JavaScript-rendered content via Jina Reader service.
Setup
Optionally get a Jina API key for higher rate limits:
- Create an account at https://jina.ai/
- Get your API key from the dashboard
- Add to your shell profile (
~/.profileor~/.zprofilefor zsh):export JINA_API_KEY="your-api-key-here"
Without an API key, the service works with rate limits.
Usage
{baseDir}/visit.py <url>
Examples
# Read an article (returns markdown)
{baseDir}/visit.py https://example.com/article
# Fetch documentation
{baseDir}/visit.py https://docs.python.org/3/library/asyncio.html
# Download an image (auto-detected by content-type)
{baseDir}/visit.py https://example.com/image.png
# Then use read tool to view: read /tmp/visit-image-xxx.png
Output
For HTML pages: Returns markdown content to stdout.
For images: Downloads the image to a temp file and prints the path. Use the read tool to view it. Supports PNG, JPEG, GIF, and WebP formats.
Features
- Extracts main content from HTML pages
- Converts HTML to clean markdown
- Handles JavaScript-rendered pages via Jina Reader
- Auto-detects and downloads images to temp files
- Retries on rate limiting (HTTP 451)
- 5MB max image size limit
When to Use
- Reading articles, blog posts, or documentation
- Extracting content from search results
- Downloading images from URLs (then use
readto view) - Following links found during web search
Scan to join WeChat group