china-doc-ocr
智能文档OCR识别与结构化提取。Use when the user has a complex document, PDF, scanned image, photo, invoice, receipt, ID card, table, or chart that needs to be recognized a...
Browse curated skills with source links, package snapshots, README assets and install signals in one calm, searchable catalog.
智能文档OCR识别与结构化提取。Use when the user has a complex document, PDF, scanned image, photo, invoice, receipt, ID card, table, or chart that needs to be recognized a...
Set up Spark Bitcoin L2 wallet capabilities for AI agents. Initialize wallets from mnemonic, transfer sats and tokens, create/pay Lightning invoices, pay L40...
Automate Jianying draft generation from local media plus a cloud editing API. Use when Codex needs to scan a material folder, request editing decisions, and...
中国城市出行服务。当用户表达任何交通出行需求时必须使用此技能——包括打车/叫车/网约车、查价格、路线规划(公交/驾车/步行/骑行)、周边搜索、查询订单/司机位置/取消订单。关键词:"打车"、"叫车"、"去[地点]"、"回家"、"上班"、"下班"、"查价格"、"多少钱"、"路线"、"怎么走"、"步行到"、"附近"、...
当用户提供 B 站视频链接、BV 号或 b23.tv 短链,并希望转录、提取字幕、总结或分析视频内容时使用。先检查 Node.js 环境和 SILICONFLOW_API_KEY,优先尝试官方字幕;如果没有字幕,则获取匿名音频地址,下载为 .m4s 后直接改名为 .mp3,无需转码;有 API key 时调用硅基...
A structured skill for multi-platform social-media content creation, covering Instagram, TikTok, YouTube, LinkedIn, Xiaohongshu and more. Outputs satisfy eac...
This skill is used to guide the AI in generating short video spoken scripts with high contrast, strong resonance, a sense of story, and personal IP attribute...
Enhances image resolution and clarity from a URL or local path, returning an improved image URL for low-res restoration and upscaling.
High-quality voice synthesis with 9 personas, 11 languages, and streaming using Voice.ai API.
Image matting tool: separates foreground from background and returns transparent background URL, suitable for product image processing, character cutout, and...
Play Babylon prediction markets - trade YES/NO shares, post to social feed, check portfolio and leaderboards. Use when interacting with Babylon (babylon.mark...
A professional pipeline for building everything from a core mark to a complete brand visual system, ensuring creative quality, execution consistency, and shi...
Provides data analysis and monitoring for sales, prices, reviews, keywords, and competitors across major Chinese ecommerce platforms.
A complete workflow skill for marketing brochure design, covering everything from requirements gathering, layout design, to mock-up delivery. It uses a 'layo...
Detects and splits videos into scene clips using ffmpeg, providing URLs for each clip without analyzing video content.
Audio/video merging tool for merging multiple video and audio tracks based on clips and timeline config, suitable for final cuts, scoring, and post-production.
Codex Pet generator on RunComfy. Build a Codex-compatible Codex Pet spritesheet.webp + pet.json from a single reference image, drop it into `${CODEX_HOME:-$H...
Generate images, videos, and audio/music via Lovart AI. Also manages Lovart projects, threads (conversation history), and user settings. Trigger on: (1) any...
1. Get the storyboard info
A professional storyboard skill for film, advertising, short video, and educational narrative scenarios, built around a strict 'plan first, render later' flow.
A professional product image generation skill purpose-built for the Amazon e-commerce platform. Outputs comply with Amazon's image guidelines while optimizin...
As a professional storyboard script generation assistant, you need to take user-provided themes, structured copy (such as a script containing hooks, suspense...
MinerU AI document parser — intelligent document extraction powered by AI. Parse PDFs, scanned documents, images, Word files, PowerPoint slides, and web page...
wechat-ai-publisher