train-with-environments
PrimeIntellect-ai · Development
使用托管 RL 或 prime-rl 在验证环境中训练模型。当被要求配置 RL 运行、调优关键超参数、诊断不稳定性、设置难度过滤与过采样,或为新环境创建实用的训练与评估循环时使用。
Train models with verifiers environments using hosted RL or prime-rl. Use when asked to configure RL runs, tune key hyperparameters, diagnose instability, set up difficulty filtering and oversampling, or create practical train and eval loops for new environments.
npx skills add https://github.com/PrimeIntellect-ai/verifiers --skill train-with-environments
星标 4187 · 安装量 0