train-with-environments

Name: train-with-environments
Author: PrimeIntellect-ai

PrimeIntellect-ai · Development

使用托管 RL 或 prime-rl 在验证环境中训练模型。当被要求配置 RL 运行、调优关键超参数、诊断不稳定性、设置难度过滤与过采样，或为新环境创建实用的训练与评估循环时使用。

Train models with verifiers environments using hosted RL or prime-rl. Use when asked to configure RL runs, tune key hyperparameters, diagnose instability, set up difficulty filtering and oversampling, or create practical train and eval loops for new environments.

npx skills add https://github.com/PrimeIntellect-ai/verifiers --skill train-with-environments

星标 4187 · 安装量 0

GitHub · SkillBox 全部技能