browser-read
Extract readable text from an already-open browser page and return markdown, suitable for pages where web_fetch is blocked or missing auth context.
When to use
web_fetchreturned an error or empty content.- Page requires authentication/cookies/session state available only in the browser.
- You need text extraction from Twitter/X or LinkedIn timelines/articles where screenshot/OCR was previously used.
When NOT to use
web_fetchalready returns good markdown/text (faster and cheaper).- Purely static pages where normal fetch is sufficient.
Steps
- Navigate to the URL with
browser navigate. - Read extraction script from
~/clawd/skills/browser-read/extract.js. - Run
browser actwithkind=evaluateand pass the script contents asfn. - Script returns
{title, content, excerpt, byline, siteName, length}wherecontentis markdown. - If extraction fails or returns empty content, script falls back to
document.body.innerText.
Example (tool calls)
{
"action": "navigate",
"targetId": "...",
"url": "https://example.com"
}
{
"action": "act",
"targetId": "...",
"kind": "evaluate",
"fn": "(() => { ... return {title, content, excerpt, byline, siteName, length}; })()"
}
Notes
extract.jsis a self-contained IIFE so it can be passed directly as thefnvalue tobrowser act.- Keep in mind this is a lightweight extractor; it intentionally strips script/style/nav/header/footer/aside/cookie/ad elements before conversion.
微信扫一扫