agent-evaluation

Name: agent-evaluation
Author: NeoLabHQ

NeoLabHQ · Meta

为Claude Code智能体和技能提供详细的评估框架，涵盖评估方法、指标和实际实施指南。专注于基于结果的评估而非精确的执行路径，包含针对不同评估场景的具体指标。

Provides a detailed framework for evaluating Claude Code agents and skills, covering evaluation methods, metrics, and practical implementation guidance. Focuses on outcome-based assessment rather than exact execution paths, with specific metrics for different evaluation scenarios.

npx skills add https://github.com/NeoLabHQ/context-engineering-kit --skill agent-evaluation

星标 1100 · 安装量 550

GitHub · SkillBox 全部技能