ocr-service
High-precision Optical Character Recognition (OCR) service. It supports text detection and extraction from multi-language, multi-format images, and provides the coordinates of text areas and confidenc…
Browse curated skills with source links, package snapshots, README assets and install signals in one calm, searchable catalog.
High-precision Optical Character Recognition (OCR) service. It supports text detection and extraction from multi-language, multi-format images, and provides the coordinates of text areas and confidenc…
Store and recall sequential memory patterns and state transitions.
利用轻现AI提供的接口一句话就可以免费生成讲解类视频,自动生成脚本,自动和成语音,自动渲染视频
Text-to-speech conversion using `uvx edge-tts` for generating audio from text. Use when (1) User requests audio/voice output with the "tts" trigger or keyword. (2) Content needs to be spoken rather th…
[For In-depth Research Only] This skill must be used when systematic research, multi-product comparison, in-depth analysis, technology selection, and trend analysis are required. Trigger keywords: res…
A multi-agent collaborative product video creation pipeline, supporting the entire process from product information to video creation: copywriting, story planning, scriptwriting, storyboard design, im…
Download and transcribe videos from YouTube, Bilibili, TikTok and 1000+ platforms. Use when user requests video download, transcription (转录/字幕提取), or converting video to text/markdown. Supports qualit…
Knowledge comic creator supporting multiple styles (Logicomix/Ligne Claire, Ohmsha manga guide). Creates original educational comics with detailed panel layouts and sequential image generation. Use wh…
Reproduce research papers into working code. Use when user wants to implement ML/AI papers, reproduce experiments, extract algorithms from PDFs, or convert research into executable code. Handles multi…
AI video generation prompting guide for Sora 2 and Higgsfield.ai
Expert guidance for data analysis, data science, and machine learning projects. Covers Python data tools, SQL/databases, visualization, statistics, ML/AI, data engineering, and MLOps. Use when working…
Manage knowledge graph for autonomous coding. Use when storing relationships, querying connected knowledge, building project understanding, or maintaining semantic memory.
Process large document corpora (1000+ docs, millions of tokens) through knowledge graph construction and stateful multi-hop reasoning. Use when (1) User provides a large corpus exceeding context limit…
Based on the images and creative requirements provided by the user, generate a professional Jimo Seedance 2.0 video script prompt. This integrates film theory (shot size, camera movement, composition,…
FFmpeg automation for cutting, trimming, concatenating videos. Audio mixing, timeline editing, transitions, effects. Export optimization for YouTube, social media. Subtitle handling, color grading, ba…
Web content extraction via Jina AI Reader API. Three modes: read (URL to markdown), search (web search + full content), ground (fact-checking). Extracts clean content without exposing server IP.
Comprehensive scientific research toolkit with 139 specialized skills for biology, chemistry, medicine, data science, and computational research. Transforms Claude into an AI research assistant with a…
Assemble final video from generated clips, audio, and assets using FFmpeg or Remotion. Handles concatenation, audio mixing, transitions, titles, and export. Use when combining multiple production outp…
Execute autonomous multi-step research using Google Gemini Deep Research Agent. Use for: market analysis, competitive landscaping, literature reviews, technical research, due diligence. Takes 2-10 ...
Structure prediction using Boltz-1/Boltz-2, an open biomolecular structure predictor. Use this skill when: (1) Predicting protein complex structures, (2) Validating designed binders, (3) Need open-sou…
Generate draw.io editable diagrams (.drawio, .drawio.svg) from text, images, or Excel. Orchestrates 3-agent workflow (Analysis → Manifest → SVG generation) with quality gates. Use when creating archit…
Write effective prompts for Jimeng Seedance 2.0 multimodal AI video generation. Use when users want to create video prompts using text, images, videos, and audio inputs with the @ reference system. Co…
To be used when the user explicitly requests to 'conduct a systematic review/literature review/related work/relevant work/literature survey'. AI autonomously determines search terms, performs multi-so…
Interactive deep learning paper Introduction writing assistant. Helps users discover scientific narrative, innovation points, and contributions through multi-turn dialogue. Supports writing from scrat…