whisper

Orchestra-Research · Development

OpenAI通用语音识别模型,支持99种语言,具备转录、翻译成英文、语言识别功能,提供从小型(39M参数)到大型(1550M参数)六种尺寸。适用于语音转文字、播客转录或多语言音频处理,适合鲁棒、多语言的语音识别任务。

OpenAI's general-purpose speech recognition model. Supports 99 languages, transcription, translation to English, and language identification. Six model sizes from tiny (39M params) to large (1550M params). Use for speech-to-text, podcast transcription, or multilingual audio processing. Best for robust, multilingual ASR.

npx skills add https://github.com/Orchestra-Research/AI-Research-SKILLs --skill whisper

星标 9626 · 安装量 0

GitHub · SkillBox 全部技能