huggingface-llm-trainer

huggingface · Other

使用TRL(Transformer强化学习)或Unsloth与Hugging Face Jobs基础设施训练或微调语言和视觉模型。涵盖SFT、DPO、

Train or fine-tune language and vision models using TRL (Transformer Reinforcement Learning) or Unsloth with Hugging Face Jobs infrastructure. Covers SFT, DPO,

npx skills add https://github.com/huggingface/skills --skill huggingface-llm-trainer

星标 10656 · 安装量 934

GitHub · SkillBox 全部技能