🔍 Conformer-2 is an upgraded speech recognition model that outperforms Conformer-1 in terms of speed, recognition of alphanumerics and proper nouns, and noise robustness.
🔎 Conformer-2 is already the default model on Assembly AI's API.
📈 The size of Conformer-2 has increased to 450 million parameters and it has been trained on 1.1 million hours of data, resulting in significant performance improvements across various domains and benchmarks.
🔍 Conformer-2 uses noisy student teacher training to improve the training data quality and quantity.
⏱️ Optimizations on the system side reduce latency and allow scaling beyond 1 million data mark.
🧩 Conformer-2 utilizes an ensemble of teacher models and data filtering techniques to ensure high quality pseudo labels and avoid overfitting.
💡 Conformer-2 is a state-of-the-art speech recognition model.
🔎 Conformer-2 focuses on alphanumerics recognition and proper noun recognition.
⚙️ The correct recognition of alphanumerics is crucial for the effectiveness of a speech recognition model.
✨ Conformer-2 has improved alphanumerics and proper noun recognition compared to Conformer-1.
💰 The new parameter called Speech thresholds allows users to control the cost of transcriptions on Assembly AI.
⏱️ Conformer-2 can process files based on a specified minimum number of minutes, reducing costs for sleep podcasts, music, and empty audio files.
Top 12 Signs You Are An ENTP
The Date Tier List - 43 Unique Date Ideas (Never Run Out Of Things To Do)
宁夏银川小镇烧烤,炭烤牛羊肉串,辣卤酱香羊蹄,阿星逛西部影城Small town street barbecue in Yinchuan, Ningxia
Television Production: Crash Course Film Production with Lily Gladstone #15
My Training with Roger Federer
Lo que no vemos en el debate sobre la inmigración | Sani Ladan | TEDxTarragona