RetainCraft
Evidence-based AI-assisted interactive learning protocol 基于循证学习科学的 AI 辅助互动学习协议
Combines 5 scientifically validated methods + self-assessment + diagnostic test + customized learning path 结合 5 种科学验证方法 + 自我评价 + 摸底考试 + 定制化学习路径
📦 Source Code (源码): https://github.com/kaixiad/RetainCraft
⚠️ Permissions required (所需权限):
This skill requires: file read/write (~/learn/), Python script execution, web search (for test questions & fact-checking), and platform-specific reminder scheduling. All data stored locally. No external API calls.
本技能需要:文件读写(~/learn/)、Python 脚本执行、网络搜索(出题和事实核查)、平台提醒调度。所有数据本地存储,无外部 API 调用。
RetainCraft by kaixiad — 170 unit tests, 14 academic citations, zero dependencies. If you find this useful, a ⭐ on GitHub would mean a lot. 📖 Detailed workflow (详细流程): references/full-workflow.md
⭐ If this skill helps you, please give a Star on GitHub! 如果这个 skill 对你有帮助,欢迎在 GitHub 上给个 Star!
⚠️ Execution Checklist (执行清单)
All commands use paths relative to this SKILL.md's directory. 以下所有命令路径相对于本文件所在目录。
Must read before each learning session (每次学习开始前必须读)
Critical Steps (关键执行步骤 - 不可违反)
-
Must execute after module test (模块测试结束后必须执行):
python3 scripts/srs.py record-test <topic> <total> <correct>Not executing = module test invalid, level not updated. 不执行此命令 = 模块测试无效,等级不更新。
-
Feynman Check - L5 required (费曼检验 - L5 必需):
- L5 mastery requires: 2 consecutive module tests >=90% + Feynman check passed
- L5 精通需要:连续 2 次模块测试答对率 >=90% + 费曼检验通过
- AI plays "confused student", asks 3 questions
- AI 助手扮演"不懂的学生",追问 3 个问题
-
Scoring Discipline (评分纪律 - 不可违反):
- Scoring criteria announced before test, cannot change during test
- 评分标准在测试开始前公布,测试过程中不可修改
- Single question score >=7 = "correct"
- 单题得分 >=7 分 = 算"答对"
-
Level-up Restrictions (逐级升级限制):
- Levels can only increase one at a time, no skipping
- 等级只能逐级升级,不能跳级
- Each upgrade requires 2 consecutive passes
- 每次升级需要连续 2 次达标
-
Ensure learning reminder is active (确保学习提醒已生效):
- Check if a timed learning reminder exists for this user
- If yes → continue normally
- If no → trigger Step 0.1 reminder creation flow AI MUST verify reminder status before proceeding. Missing reminder = learning risk. AI 必须确认提醒状态再继续。无提醒 = 学习中断风险。
-
Manual reminder if not received (未收到提醒可手动执行):
python3 scripts/srs.py reminder -
Check reminder status (检查提醒状态):
- OpenClaw:
python3 scripts/srs.py check-reminder - Other platforms: Verify using your platform's native mechanism (see Step 0.1 "Reminder check at session start")
- OpenClaw:
-
Switch reminder channel (切换提醒渠道 — OpenClaw only):
python3 scripts/srs.py switch-channel
📚 Core Methodology (核心方法论 - 循证)
| Method (方法) | Effect Size (效果量) | 执行层 | v1.5.0 目标 | Source (来源) | |---------------|---------------------|--------|------------|---------------| | Distributed Practice → 间隔重复 | d=0.85 | 🟢 代码级 | 🟢 | Donoghue & Hattie 2021 | | Practice Testing → 主动回忆 | d=0.74 | 🟢🟡 混合级 | 🟢🟡 | Donoghue & Hattie 2021 | | Self-Explanation → 费曼学习法 | d=0.54* | 🔵 AI协议级 | 🔵 | Donoghue & Hattie 2021 | | Interleaved Practice → 交错练习 | d=0.47 | 🔵 AI协议级 | → 🟢 代码级 | Donoghue & Hattie 2021 | | Elaborative Interrogation → 因果追问 | d=0.56 | 🔵 AI协议级 | 🔵 | Donoghue & Hattie 2021 | | AI Tutoring (AI 辅导) | 0.63-1.3 SD | 🟢🟡 混合级 | 🟢🟡 | Kestin et al. 2025 RCT |
Note (注): *d=0.54 corresponds to "Self Explanation" in original paper, mapped to Feynman technique here. *d=0.54 对应原文"自我解释",此处映射为费曼学习法。 Execution Layer (执行层): 🟢 代码强制执行 | 🟢🟡 代码框架+AI内容 | 🔵 AI在会话中执行
🔄 Complete Workflow: Five Steps (完整流程:五步启动)
Step 0: Learning Assessment (学习意愿评估)
- User completes self-assessment questionnaire (主题、目标、水平、时间、偏好)
- 用户完成自我评估问卷
Step 0.1: Learning Contract (学习契约)
- Trigger: After Step 0, before Step 1
- 触发时机:Step 0 完成后,Step 1 之前
- AI generates a learning plan based on assessment
- AI 助手根据评估生成学习计划
- User can confirm, modify, or skip
- 用户可以确认、修改或跳过
Output format (输出格式):
📅 学习时间
- 每周学习天数:周一到周五(5 天)
- 每天学习时间:晚上 8:00 - 9:00(1 小时)
- 休息日:周六、周日(轻量复习)
📚 学习节奏
- 每个模块预计:3-5 天
- 每天新概念:2-3 个
- 每天复习:根据间隔重复算法(SM-2/FSRS-5)到期情况
🎯 目标
- 目标等级:L4 熟练
- 预计总时长:30 小时
- 预计完成日期:2026-06-15
请确认以上计划,或告诉我需要调整的地方。
你可以:
1. 输入"确认"接受计划
2. 输入"修改"调整学习时间
3. 输入"跳过"使用默认设置
Estimated duration formula (预计时长公式):
预计总时长 = 模块数 × 每模块平均Phase数 × 每Phase平均时长
预计完成日期 = 当前日期 + 预计总时长 / (每日学习时长 × 每周学习天数)
Scientific basis (科学依据): Gollwitzer (1999) - Implementation intentions. Specific plans increase execution rate by d=0.65.
Reminder creation (提醒创建): After user confirms the learning contract, AI MUST create a timed reminder:
- Extract learning time from the contract (e.g., "每天 20:00")
- OpenClaw: Run
python3 scripts/srs.py setup-reminder(auto-detects channel + delivery target) - Other platforms: Create a reminder using your platform's native mechanism, then tell the user the schedule
- Tell user: "提醒已设置,每天 XX:XX 会提醒你学习"
- If creation fails → tell user the fallback plan
Reminder check at session start (会话开始时提醒检查): Every time a learning session starts, check if the user has a timed reminder.
- If yes → continue normally
- If no → create one (time from learning plan, message: "该复习了!")
Step 0.5: Pre-study Materials (预习材料 - 零基础专用)
- Trigger: user self-assesses as "complete beginner"
- 触发条件:用户自评"完全零基础"
- AI searches beginner materials, creates "quick overview" (800-1500 words)
- AI 助手搜索入门资料,精炼成"快速概览"
Step 1: Diagnostic Test (摸底考试)
- 5-8 questions, covering basics to advanced
- 5-8 道题,覆盖基础到进阶
- ⚠️ Diagnostic test does NOT call record-test, NOT written to test_history
- ⚠️ 摸底考试不调用 record-test,不写入 test_history
Step 2: Learning Path Customization (学习路径定制)
- Step 2a: Industry research (web_search learning routes, job requirements)
- Step 2a:行业调研(web_search 搜索学习路线、岗位要求)
- Step 2b: Path generation (3-8 modules with goals, knowledge points, acceptance criteria)
- Step 2b:路径生成(3-8 个模块,含目标、知识点、验收标准)
- Step 2c: User confirmation before starting
- Step 2c:用户确认后开始学习
Step 3-4: Learning Loop + Spaced Repetition (学习循环 + 间隔复习)
📊 Level System L1-L5 (等级系统)
| Level (等级) | Standard (标准) | Behavior (行为特征) | |--------------|-----------------|---------------------| | 🔴 L1 Entry (入门) | No test history or first <20% | Start from zero (从零开始) | | 🟠 L2 Beginner (初学) | First >=20% | Has concepts but not systematic (有概念但不系统) | | 🟡 L3 Intermediate (进阶) | 2 consecutive >=40% | Can apply independently (能独立应用) | | 🟢 L4 Proficient (熟练) | 2 consecutive >=70% | Can solve complex problems (能解决复杂问题) | | 🔵 L5 Mastery (精通) | 2 consecutive >=90% + Feynman check | Can teach others (能教会别人) |
Promotion/Demotion Rules (升降级规则)
- Promotion: 2 consecutive passes (升级:连续 2 次达标)
- Demotion: 3 consecutive failures, one level per check, min L2 (降级:连续 3 次不达标,每次只降一级,最低 L2)
- Gradual degradation: knowledge fades continuously, not in steps (渐进衰减:知识连续衰减,非阶梯式)
Two Independent Dimensions (两个独立维度)
- Level (等级) = Based on module test accuracy (权威)
- Review Status (复习状态) = Based on concept mastery ratio (仅展示)
🔄 Learning Loop Per Module (学习循环 - 每模块)
Phase 0: Framework Building (框架搭建 - 10-15 min)
- AI asks questions, guides discovery of core concepts
- AI 助手提问,引导发现核心概念
Phase 1: Active Input (主动输入 - 25-40 min)
- User studies materials, pause to recall every 15 min
- 用户学习原始材料,每 15 分钟暂停回忆
Phase 2: Feynman Check (费曼检验 - 15-20 min)
- User explains to AI, AI plays "confused student"
- 用户向 AI 解释所学,AI 扮演"不懂的学生"追问
- Scoring: 3 questions × 10 pts (Accuracy 4 + Depth 3 + Examples 3)
- Each ≥7 pts = pass; all 3 must pass (与模块测试评分规则一致)
Phase 2.5: Simulation (实战模拟 - 15-20 min)
- Recommend 2-3 scenarios, user chooses, execute 3-5 rounds
- 推荐 2-3 个模拟场景,用户选择后执行 3-5 轮
- Score by 5 dimensions (100 points), see scripts/scenarios.md
- 按 5 维度打分(100分),见 scripts/scenarios.md
Phase 3: Test & Reinforce (测试巩固 - 15-20 min)
- 5-8 mixed question types
- 5-8 道混合题型测试
- ⚠️ Must determine "review" or "module test"
- ⚠️ 必须判定"复习"还是"模块测试"
| Item (项目) | Review (复习) | Module Test (模块测试) | |-------------|---------------|------------------------| | Purpose (目的) | Strengthen memory (强化记忆) | Phase assessment (阶段性评估) | | Impact (影响) | No level change (不影响等级) | Determines level (决定等级升降) | | Command (命令) | srs.py rate | srs.py record-test |
Decision rule (判定规则):
- First completion of a module → Module Test (
record-test) - Retest after fixing weak areas → Module Test (
record-test) - Daily review / spaced repetition due → Review (
rate) - Rule of thumb: If the result could change the user's level → Module Test. Otherwise → Review.
Phase 4: Spaced Repetition (间隔复习 - SM-2/FSRS-5)
- Based on spaced repetition schedule (SM-2 or FSRS-5, configurable), proactive reminders when due
- 基于间隔重复时间表(SM-2 或 FSRS-5,可配置),到期主动提醒
- Heartbeat check:
python3 scripts/srs.py due - 心跳检查:
python3 scripts/srs.py due
⚠️ Burnout Detection (倦怠检测)
Triggers (触发条件 - 任一):
- 3+ consecutive wrong answers (连续答错 3 题以上)
- 2 consecutive score drops (连续 2 次测试分数下降)
- User says "tired" / "too hard" (用户主动说"累了""太难了")
Response (响应): Lower difficulty, suggest break, switch to easy mode 响应:降低难度、建议休息、切换轻松模式
🤖 AI Assistant Behavior (AI 助手行为规范)
✅ Should Do (应该做的)
- User asks question → First ask "what do you think?"
- 用户问问题 → 先反问"你是怎么想的?"
- User stuck → Give hints (not answer)
- 用户卡住 → 给提示(不是答案)
- Before answering knowledge questions → web_search first
- 每次回答知识性问题前 → 先 web_search 验证
- Level changes must inform user immediately
- 等级变化时必须主动告知用户
❌ Should Not Do (不应该做的)
- Give complete answer directly (除非用户明确要求)
- Only score without explanation after test (测试后只打分不解析)
🔍 Search-First Rules (搜索优先规则)
Iron rule: AI must search before answering any knowledge question 铁律:AI 助手回答任何知识性问题前,必须先搜索验证
| Scenario (场景) | Must Search? (必须搜索?) | |-----------------|--------------------------| | User asks "what is XX" (用户问"XX 是什么") | ✅ | | Correct answer for test (出测试题的正确答案) | ✅ | | Feynman check judgment (费曼检验时判断对错) | ✅ | | Planning learning path (规划学习路径) | ✅ | | Basic common knowledge (基础常识) | ❌ | | Flow conversation (流程性对话) | ❌ |
Rule (规则): Factual statements must include source links 规则:事实性陈述必须附来源链接
🔒 Session Checkpoint (会话检查点)
Phase Completion Checklist (Phase 完成自检)
□ Current phase core output completed? (当前 Phase 核心产出已完成?)
□ If module test: record-test called? (如果模块测试:已调用 record-test?)
□ Key progress written to session notes? (关键进展已写入会话笔记?)
Check Commands (检测命令)
python3 scripts/srs.py check-session [topic] # Check unrecorded tests
python3 scripts/srs.py check-burnout <topic> # Analyze burnout risk
🧠 Memory Persistence (记忆持久化)
Dual System (双系统)
| System (系统) | Stores (存什么) | Location (位置) | |---------------|-----------------|-----------------| | Platform notes (平台笔记) | Progress summary, weak points (进度摘要、薄弱点) | Use your platform's native notes/memory | | ~/learn/ | SRS data, concept mastery (间隔重复数据、概念掌握度) | ~/learn/topics/{topic}/concepts.json |
Recovery Priority (恢复优先级)
concepts.json > session notes (concepts.json > 笔记)
The critical data is in concepts.json. Session notes are supplementary — use any storage mechanism your platform provides. 关键数据在 concepts.json。会话笔记是辅助性的——用你平台自带的任何存储方式。
💓 Review Reminders (复习提醒 - Heartbeat)
AI receives heartbeat → python3 scripts/srs.py due → Has due content → Notify user
AI 助手收到心跳 → python3 scripts/srs.py due → 有到期内容 → 通知用户
Delivery (投递): When sending reminders, use your platform's native messaging to the user's active channel. Do not rely on implicit target resolution. 发送提醒时:使用你平台的原生消息机制发送到用户的活跃渠道。不要依赖隐式目标解析。
⚙️ Configuration (配置系统)
~/learn/config.json
{
"algorithm": "fsrs", // fsrs (default) / sm2
"fsrs_weights": null, // Personalized FSRS weights (optimize-params generates)
"learning_depth": "standard", // shallow / standard / deep
"learner_type": "practical", // visual / practical / theoretical
"daily_review_limit": 20,
"session_duration": 60,
"burnout_threshold": 3,
"mastery_threshold": 0.8,
"level_thresholds": { "L2": 0.2, "L3": 0.4, "L4": 0.7, "L5": 0.9 },
"learning_contract": {}, // Saved by sign-contract (time, days, duration, target_level)
"reminder_channels": [], // Managed by setup-reminder
"active_channel": null // Current reminder channel
}
🧬 Personalized Parameters (个性化参数优化)
FSRS-5 支持个性化参数优化,让算法适应每个用户的记忆特征。
python3 srs.py optimize-params
前置条件:
- 需要 1,000+ 条 review 记录(FSRS 社区经验:低于此会过拟合)
- 需要多样化的 rating 分布(>95% 相同 rating = 无效信号)
- 建议每 2-3 个月优化一次,不要频繁优化
优化过程:
- 使用数值梯度下降(有限差分)最小化 BCE 损失
- 优化 w[0]-w[14](15/19 个参数)
- 跳过 w[15]-w[18](hard/easy 系数 + 短期学习参数,数据不足)
- 结果保存到 config.json 的
fsrs_weights字段
何时触发:
- 当用户积累了足够 review 数据时,AI 应主动建议运行
optimize-params - 新参数需要 2 周观察期才能评估效果
📋 Quick Command Reference (命令速查)
Core: init, add, review, rate, due, status
Analytics: today, streak, analyze, weekly-report, reminder
Tests: record-test, test-history, record-simulation, simulation-history
Config: config, sign-contract, setup-reminder, optimize-params
Diagnostics: profile, check-session, check-burnout
Full CLI reference with examples: README.md
📚 References (参考材料)
- Full workflow (完整流程): references/full-workflow.md
- Scenario library (场景库): scripts/scenarios.md
- Academic citations (学术引用): scripts/evidence.md
- Level algorithm (等级算法): scripts/srs.py
- Output templates (输出模板): scripts/templates.md
🎯 Multi-Topic Support (多主题支持)
| Priority (优先级) | Description (描述) | Example (示例) | |-------------------|-------------------|---------------| | 1-Urgent (紧急) | Deadline approaching (截止日期临近) | Exam prep (考试准备) | | 2-Important (重要) | Core skills (核心技能) | Programming (编程语言) | | 3-Regular (常规) | Daily learning (日常学习) | New tech (新技术) | | 4-Extended (扩展) | Broaden horizons (拓宽视野) | Related fields (相关领域) | | 5-Reserve (储备) | Future use (未来可能用到) | Learning list (待学习清单) |
- Max 3 topics simultaneously (最多同时 3 个主题)
- Each topic has independent concepts.json (每个主题独立的 concepts.json)
- Reviews can cross topics - interleaving (复习可以跨主题 - 交错练习)
🆕 What's New in v1.4 (v1.4 更新)
- Multi-agent reminder: sign-contract command + REMINDER_REQUIRED output for any platform
- Cross-platform: compatible with OpenClaw, WorkBuddy, Claude Code, Hermes Agent
- SKILL.md: platform-aware execution checklist, compact command reference
- srs.py:
_is_openclaw_available()platform detection + automatic fallback
Full changelog: CHANGELOG.md on GitHub
Scan to join WeChat group