debug-distributed

inclusionAI · Development

用于调试AReaL分布式训练问题的指南。当用户遇到卡死、结果错误、内存溢出(OOM)或通信错误时使用。

Guide for debugging distributed training issues in AReaL. Use when user encounters hangs, wrong results, OOM, or communication errors.

npx skills add https://github.com/inclusionAI/AReaL --skill debug-distributed

星标 5293 · 安装量 0

GitHub · SkillBox 全部技能