返回 Skill 列表
extension
分类: 内容与媒体无需 API Key

photo-composition-critic

基于研究生水平的视觉美学教育、计算美学研究(AVA、NIMA、LAION-Aesthetics、VisualQuality-R1)以及使用定制工具的专业图像分析的专家级摄影构图批评。可用于图像质量评估、构图分析、美学评分、照片批评。在“照片批评”、“构图分析”、“图像美学”、“NIMA”、“AVA数据集”、“视觉质量”上激活。不适用于照片编辑/润色(请使用原生应用设计师),生成图像(直接使用Stability AI),或基本图像处理(使用clip-aware-embeddings)。

person作者: jakexiaohubgithub

Photo Composition Critic

Expert photography critic with deep grounding in graduate-level visual aesthetics, computational aesthetics research, and professional image analysis.

When to Use This Skill

Use for:

  • Evaluating image composition quality
  • Aesthetic scoring with ML models (NIMA, LAION)
  • Photo critique with actionable feedback
  • Analyzing color harmony and visual balance
  • Comparing multiple crop options
  • Understanding photography theory

Do NOT use for:

  • Generating images → use Stability AI directly
  • Photo editing/retouching → use native-app-designer
  • Simple image similarity → use clip-aware-embeddings
  • Collage creation → use collage-layout-expert

MCP Integrations

| MCP | Purpose | |-----|---------| | Firecrawl | Research latest computational aesthetics papers | | Hugging Face (if configured) | Access NIMA, LAION aesthetic models |

Quick Reference

Compositional Frameworks

| Framework | Key Points | |-----------|------------| | Visual Weight | Size, color warmth, isolation, intrinsic interest, position | | Gestalt | Proximity, similarity, continuity, closure, figure-ground | | Dynamic Symmetry | Root rectangles (√2, √3, φ), baroque/sinister diagonals | | Arabesque | S-curve, spiral, diagonal thrust - eye flow through frame |

Color Harmony Types

| Type | Score | Notes | |------|-------|-------| | Complementary | 0.9 | High visual interest | | Monochromatic | 0.85 | Safe, cohesive | | Triadic | 0.85 | Balanced, vibrant | | Analogous | 0.8 | Natural, harmonious | | Achromatic | 0.7 | B&W or desaturated | | Complex | 0.6 | May be chaotic or intentional |

ML Model Score Interpretation

| Score Range | Meaning | |-------------|---------| | 7.0+ | Exceptional (top ~1%) | | 6.5+ | Great (top ~5%) | | 5.0-5.5 | Mediocre (most images) | | <5.0 | Below average |

Analysis Protocol

1. FIRST IMPRESSION (2 seconds)
   └── Where does the eye go? Emotional hit? Anything "off"?

2. TECHNICAL SCAN
   └── Exposure, focus, noise, color, artifacts

3. COMPOSITIONAL ANALYSIS
   └── Subject clarity, structure, balance, flow, depth, edges

4. AESTHETIC EVALUATION
   └── Light quality, color harmony, decisive moment, story

5. CONTEXTUAL ASSESSMENT
   └── Genre success, photographer intent, audience fit

6. ACTIONABLE RECOMMENDATIONS
   └── Specific improvements, post-processing, alt crops

Anti-Patterns

"Just use rule of thirds"

| What it looks like | Why it's wrong | |--------------------|----------------| | Blindly placing subjects on thirds intersections | Oversimplification ignores visual weight, gestalt, dynamic symmetry | | Instead: Analyze visual weight center, consider multiple frameworks |

"Higher NIMA score = better photo"

| What it looks like | Why it's wrong | |--------------------|----------------| | Using ML score as sole quality metric | Models trained on averages, miss artistic intent, polarizing works | | Instead: Use ML as one input alongside theoretical analysis |

"Color harmony means matching colors"

| What it looks like | Why it's wrong | |--------------------|----------------| | Recommending monochromatic or matchy palettes | Ignores Itten's contrasts, Albers' interaction effects | | Instead: Evaluate harmony type AND contextual appropriateness |

Ignoring genre context

| What it looks like | Why it's wrong | |--------------------|----------------| | Applying portrait criteria to documentary | Different genres have different quality signals | | Instead: Assess against genre-appropriate standards |

Reference Files

Load these for detailed implementations:

| File | Contents | |------|----------| | references/composition-theory.md | Arnheim visual weight, Gestalt, Dynamic Symmetry, Arabesque | | references/color-theory.md | Albers interaction, Itten's 7 contrasts, harmony detection algo | | references/ml-models.md | AVA dataset, NIMA, LAION-Aesthetics, VisualQuality-R1 | | references/analysis-scripts.md | PhotoCritic class, MCP server implementation |

Key Sources

Theory: Arnheim (1974), Hambidge (1926), Itten (1961), Albers (1963), Freeman (2007)

Research: AVA dataset (Murray 2012), NIMA (Talebi 2018), LAION-5B (Schuhmann 2022), Q-Instruct (Wu 2024)