The LIUM has a complete automatic speech recognition (ASR) system. The core of the system is based on Sphinx, distributed by Carnegie Mellon University (CMU) and KALDI. The LIUM system was initially developed for transcription of French newscasts. It was then adapted for the transcription of debates in English, Spanish, Italien, Arabic and German, as well as for the telephone transcription of dialog in French an English.
The LIUM has been carrying out research in automatic translation since 2007. The system of the LIUM is based on the software Moses which is a statistical translation system using the concept of phrase tables. We regularly add new functionalities to improve performance, as attested by our results in international evaluation campaigns such as WMT, IWSLT, or those organized by the NIST.
The statistical approach for translation is generic and independent of the treated language pair. However, we concentrated our efforts on the following languages: English, French, Arabic, Mandarin.
Our research activities in translation are also characterized by a privileged co-operation with the company SYSTRAN, the world leader on the market of translation software. We work together on the convergence of the statistical approaches and the formal methods.
The speaker recognition activity began at LIUM at the end of 2004. Since then, we work on speaker diarization (single and cross-show), speaker identification and verification as well as language identification. The developed systems obtained very good performance in speaker diarization and identification.
L’équipe LST participe régulièrement à des campagnes d’évaluation internationales et nationales dans le domaine de la reconnaissance de la parole, de la traduction automatique, de la traduction automatique de la parole ou encore de la reconnaissance du locuteur. Ces campagnes ont pour but d’évaluer les performances des technologies à l’état de l’art, et permettent aux participants de comparer leurs systèmes avec ceux des meilleurs laboratoires du domaine. Le tableau suivant synthétise les participations de l’équipe.