debug-distributed
inclusionAI · Development
用于调试AReaL分布式训练问题的指南。当用户遇到卡死、结果错误、内存溢出(OOM)或通信错误时使用。
Guide for debugging distributed training issues in AReaL. Use when user encounters hangs, wrong results, OOM, or communication errors.
npx skills add https://github.com/inclusionAI/AReaL --skill debug-distributed
星标 5293 · 安装量 0