Omnilingual ASR: Meta's 1,600+ Language Open-Source Speech Recognition Model (메타의 1600개 언어 지원 오픈소스 음성인식 모델)

Overview

[AI Summary]: Meta AI has released Omnilingual ASR, an open-source automatic speech recognition system supporting over 1,600 languages, including 500+ low-resource languages previously unsupported by existing ASR systems. The technology achieves high accuracy using large-scale models (up to 7B parameters) and features in-context learning capabilities inspired by LLMs, enabling zero-shot learning for new languages with just a few samples. The project includes both the models and the Omnilingual ASR Corpus dataset, all released under Apache 2.0 license to improve digital accessibility for global language communities.