agent-evaluation
NeoLabHQ · Meta
为Claude Code智能体和技能提供详细的评估框架,涵盖评估方法、指标和实际实施指南。专注于基于结果的评估而非精确的执行路径,包含针对不同评估场景的具体指标。
Provides a detailed framework for evaluating Claude Code agents and skills, covering evaluation methods, metrics, and practical implementation guidance. Focuses on outcome-based assessment rather than exact execution paths, with specific metrics for different evaluation scenarios.
npx skills add https://github.com/NeoLabHQ/context-engineering-kit --skill agent-evaluation
星标 1100 · 安装量 550