evaluation
muratcankoylan · Development
一份全面且结构清晰的AI智能体性能评估指南,提供了实用的方法论、多维度评估标准和基于证据的见解,适用于系统性质量评估。
This skill should be used when the user asks to "evaluate agent performance", "build test framework", "measure agent quality", "create evaluation rubrics", or mentions LLM-as-judge, multi-dimensional evaluation, agent testing, or quality gates for agent pipelines.
npx skills add https://github.com/muratcankoylan/Agent-Skills-for-Context-Engineering --skill evaluation
星标 16509 · 安装量 5