evaluation

Name: evaluation
Author: muratcankoylan

muratcankoylan · Development

一份全面且结构清晰的AI智能体性能评估指南，提供了实用的方法论、多维度评估标准和基于证据的见解，适用于系统性质量评估。

This skill should be used when the user asks to "evaluate agent performance", "build test framework", "measure agent quality", "create evaluation rubrics", or mentions LLM-as-judge, multi-dimensional evaluation, agent testing, or quality gates for agent pipelines.

npx skills add https://github.com/muratcankoylan/Agent-Skills-for-Context-Engineering --skill evaluation

星标 16509 · 安装量 5

GitHub · SkillBox 全部技能