Publications (all)

Publications HAL de la structure parole; multispeech

2025

Journal articles

titre: Adapting general disentanglement-based speaker anonymization for enhanced emotion preservation
auteur: Xiaoxiao Miao, Yuxiang Zhang, Xin Wang, Natalia Tomashenko, Donny Cheng Lock Soh, Ian Mcloughlin
article: Computer Speech and Language, 2025, 94, pp.101810. ⟨10.1016/j.csl.2025.101810⟩
Accès au bibtex

titre: Objective and subjective evaluation of speech enhancement methods in the UDASE task of the 7th CHiME challenge
auteur: Simon Leglaive, Matthieu Fraticelli, Hend ElGhazaly, Léonie Borne, Mostafa Sadeghi, Scott Wisdom, Manuel Pariente, John R. Hershey, Daniel Pressnitzer, Jon P. Barker
article: Computer Speech and Language, 2025, 89, ⟨10.1016/j.csl.2024.101685⟩
Accès au texte intégral et bibtex

titre: Posterior Transition Modeling for Unsupervised Diffusion-Based Speech Enhancement
auteur: Mostafa Sadeghi, Jean-Eudes Ayilo, Romain Serizel, Xavier Alameda-Pineda
article: IEEE Signal Processing Letters, In press, pp.1-5. ⟨10.1109/LSP.2025.3583967⟩
Accès au texte intégral et bibtex

Conference papers

titre: Towards Low-Latency Tracking of Multiple Speakers With Short-Context Speaker Embeddings
auteur: Taous Iatariene, Alexandre Guérin, Romain Serizel
article: 2025 IEEE 27th International Workshop on Multimedia Signal Processing (MMSP), Sep 2025, Beijin, China
Accès au texte intégral et bibtex

titre: Towards Skeletal and Signer Noise Reduction in Sign Language Production via Quaternion-Based Pose Encoding and Contrastive Learning
auteur: Guilhem Fauré, Mostafa Sadeghi, Sam Bigeard, Slim Ouni
article: SLTAT 2025: 9th Workshop on Sign Language Translation and Avatar Technologies, Sep 2025, Berlin, Germany. ⟨10.1145/3742886.3756728⟩
Accès au texte intégral et bibtex

titre: Speaker Embeddings to Improve Tracking of Intermittent and Moving Speakers
auteur: Taous Iatariene, Can Cui, Alexandre Guérin, Romain Serizel
article: 33rd European Signal Processing Conference (EUSIPCO 2025), Sep 2025, Palerme (Italie), Italy
Accès au texte intégral et bibtex

titre: End-to-end Joint Punctuated and Normalized ASR with a Limited Amount of Punctuated Training Data
auteur: Can Cui, Imran Ahamad Sheikh, Mostafa Sadeghi, Emmanuel Vincent
article: European Signal Processing Conference (EUSIPCO 2025), Sep 2025, Palermo, Italy
Accès au texte intégral et bibtex

titre: Joint Beamforming and Speaker-Attributed ASR for Real Distant-Microphone Meeting Transcription
auteur: Can Cui, Imran Ahamad Sheikh, Mostafa Sadeghi, Emmanuel Vincent
article: European Signal Processing Conference (EUSIPCO 2025), Sep 2025, Palermo, Italy
Accès au texte intégral et bibtex

titre: Phoneme-Level Speech Intelligibility Reduction
auteur: Aine Drelingyte, Romain Serizel, Mathieu Lagrange
article: EUSIPCO 2025 – 33rd European Signal Processing Conference, Sep 2025, Palerme, Italy
Accès au texte intégral et bibtex

titre: Exploiting Context-dependent Duration Features for Voice Anonymization Attack Systems
auteur: Natalia Tomashenko, Emmanuel Vincent, Marc Tommasi
article: Interspeech 2025, Aug 2025, Rotterdam, Netherlands
Accès au texte intégral et bibtex

titre: Mixture of LoRA Experts for Low-Resourced Multi-Accent Automatic Speech Recognition
auteur: Raphaël Bagat, Irina Illina, Emmanuel Vincent
article: 26th Interspeech Conference (Interspeech 2025), Aug 2025, Rotterdam, Netherlands
Accès au texte intégral et bibtex

titre: Exploring Gesture Formalization: Encoding Features and Automation Strategies
auteur: Domitille Caillat, Mickaëlla Grondin-Verdon, Slim Ouni
article: 10th Conference of the International Society for Gesture Studies, Jul 2025, Nijmegen, Netherlands
Accès au bibtex

titre: Tracking of Intermittent and Moving Speakers : Dataset and Metrics
auteur: Taous Iatariene, Alexandre Guérin, Romain Serizel
article: Proceedings of the 11th Convention of the European Acoustics Association Forum Acusticum 2025, Jun 2025, Malaga, Espagne, Spain
Accès au texte intégral et bibtex

titre: Diffusion-based Unsupervised Audio-visual Speech Enhancement
auteur: Jean-Eudes Ayilo, Mostafa Sadeghi, Romain Serizel, Xavier Alameda-Pineda
article: ICASSP 2025 – International Conference on Acoustics Speech and Signal Processing, IEEE, Apr 2025, Hyderabad, India. pp.1-5
Accès au texte intégral et bibtex

titre: Analysis of Speech Temporal Dynamics in the Context of Speaker Verification and Voice Anonymization
auteur: Natalia Tomashenko, Emmanuel Vincent, Marc Tommasi
article: 2025 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2025), Apr 2025, Hyderabad, India. ⟨10.1109/ICASSP49660.2025.10887896⟩
Accès au texte intégral et bibtex

titre: The First VoicePrivacy Attacker Challenge
auteur: Natalia Tomashenko, Xiaoxiao Miao, Emmanuel Vincent, Junichi Yamagishi
article: ICASSP 2025 – 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Hyderabad, India, 2025, Apr 2025, Hyderabad, India. pp.1-2, ⟨10.1109/ICASSP49660.2025.10888513⟩
Accès au texte intégral et bibtex

titre: Iconic Co-verbal Gestures: Study of Some Facets of Iconicity
auteur: Domitille Caillat, Mickaëlla Grondin-Verdon, Slim Ouni
article: Workshop on dimensions of iconicity in the visual modality, Feb 2025, Göttingen, Germany
Accès au bibtex

Theses

titre: Co-speech gesture synthesis : Towards a controllable and interpretable model using a graph deterministic approach
auteur: Louis Abel
article: Computer Science [cs]. Université de Lorraine, 2025. English. ⟨NNT : 2025LORR0020⟩
Accès au texte intégral et bibtex

2024

Journal articles

titre: A Phoneme-Scale Assessment of Multichannel Speech Enhancement Algorithms
auteur: Nasser-Eddine Eddine Monir, Paul Magron, Romain Serizel
article: Trends in Hearing, 2024, 28, ⟨10.1177/23312165241292205⟩
Accès au texte intégral et bibtex

titre: Evaluating and predicting the audibility of acoustic alarms in the workplace using experimental methods and deep learning
auteur: François Effa, Jean-Pierre Arz, Romain Serizel, Nicolas Grimault
article: Applied Acoustics, 2024, 219, pp.109955. ⟨10.1016/j.apacoust.2024.109955⟩
Accès au texte intégral et bibtex

titre: Training RNN Language Models on Uncertain ASR Hypotheses in Limited Data Scenarios
auteur: Imran Ahamad Sheikh, Emmanuel Vincent, Irina Illina
article: Computer Speech and Language, 2024, 83, pp.101555. ⟨10.1016/j.csl.2023.101555⟩
Accès au texte intégral et bibtex

titre: Automatic segmentation of vocal tract articulators in real-time magnetic resonance imaging
auteur: Vinicius Ribeiro, Karyna Isaieva, Justine Leclere, Jacques Felblinger, Pierre-André Vuissoz, Yves Laprie
article: Computer Methods and Programs in Biomedicine, In press, 243 (2), pp.107907. ⟨10.1016/j.cmpb.2023.107907⟩
Accès au texte intégral et bibtex

titre: Unsupervised Performance Analysis of 3D Face Alignment with a Statistically Robust Confidence Test
auteur: Mostafa Sadeghi, Xavier Alameda-Pineda, Radu Horaud
article: Neurocomputing, 2024, 564, pp.1-16. ⟨10.1016/j.neucom.2023.126941⟩
Accès au texte intégral et bibtex

titre: The VoicePrivacy 2022 Challenge: Progress and perspectives in voice anonymisation
auteur: Michele Panariello, Natalia Tomashenko, Xin Wang, Xiaoxiao Miao, Pierre Champion, Hubert Nourtel, Massimiliano Todisco, Nicholas Evans, Emmanuel Vincent, Junichi Yamagishi
article: IEEE/ACM Transactions on Audio, Speech and Language Processing, In press, ⟨10.1109/TASLP.2024.3430530⟩
Accès au texte intégral et bibtex

titre: AVI-Corse: methodology and challenges of a participatory project. Digital avatars, new tools for language and communication needs.
auteur: Agnès Piquard-Kipffer, Karen Martinelli, Léa Dussere, Anne Sancier, Jérémy Zytnicki, Caroline Barbot-Bouzit, Slim Ouni
article: La Nouvelle revue – Éducation et société inclusives, 2024, 98-99 (1), pp.341-353. ⟨10.3917/nresi.098.0341⟩
Accès au texte intégral et bibtex

Conference papers

titre: Retour d’expérience : Whisper pour les langues régionales
auteur: Sam Bigeard, Panagiotis Tsolakis, Emmanuel Vincent, Vincent Colotte, Pascale Erhart, Slim Ouni
article: LIFT 2: Journées scientifiques du GdR Linguistique Informatique, Formelle et de Terrain, GdR Linguistique Informatique, Formelle et de Terrain, Nov 2024, Orléans, France
Accès au texte intégral et bibtex

titre: MMAR: Multilingual and multimodal anaphora resolution in instructional videos
auteur: Cennet Oguz, Pascal Denis, Simon Ostermann, Natalia Skachkova, Emmanuel Vincent, Josef van Genabith
article: Findings of the 2024 Conference on Empirical Methods in Natural Language Processing, Nov 2024, Miami, United States
Accès au texte intégral et bibtex

titre: Towards interpretable co-speech gestures synthesis using STARGATE
auteur: Louis Abel, Vincent Colotte, Slim Ouni
article: International Conference on Multimodal Interaction (ICMI Companion ’24: GENEA Workshop), Nov 2024, San José, Costa Rica. ⟨10.1145/3686215.3688819⟩
Accès au texte intégral et bibtex

titre: Qualitative study of gesture annotation corpus : Challenges and perspectives
auteur: Mickaëlla Grondin-Verdon, Domitille Caillat, Slim Ouni
article: ICMI Companion ’24: Companion Proceedings of the 26th International Conference on Multimodal Interaction, Nov 2024, San Jose, Costa Rica. pp.147-155, ⟨10.1145/3686215.3688820⟩
Accès au texte intégral et bibtex

titre: From Computation to Consumption: Exploring the Compute-Energy Link for Training and Testing Neural Networks for SED Systems
auteur: Constance Douwes, Romain Serizel
article: Detection and Classification of Acoustic Scenes and Events 2024, Oct 2024, Tokyo, Japan
Accès au texte intégral et bibtex

titre: Normalizing Energy Consumption for Hardware-Independent Evaluation
auteur: Constance Douwes, Romain Serizel
article: 2024 IEEE International Workshop on Machine Learning for Signal Processing, Sep 2024, London, United Kingdom
Accès au texte intégral et bibtex

titre: Assessment of avatar lip-reading technology (AVI-Corse project). Perspectives of young people with and without hearing loss
auteur: Agnès Piquard-Kipffer, Ana Krilanovic, Jérémy Zytnicki, Karen Martinelli, Léa Dussere, Anne Sancier, Slim Ouni
article: 36th WCA 2024, World Congress of Audiology, the French Society of Audiology with the support of the French ENT Society, Sep 2024, Paris (CNIT – La Défense), France
Accès au bibtex

titre: Towards realtime co-speech gestures synthesis using STARGATE
auteur: Louis Abel, Vincent Colotte, Slim Ouni
article: 25th Interspeech Conference (INTERSPEECH 2024), Sep 2024, Kos Island, Greece
Accès au texte intégral et bibtex

titre: 1000 African Voices: Advancing inclusive multi-speaker multi-accent speech synthesis
auteur: Sewade Ogun, Abraham T. Owodunni, Tobi Olatunji, Eniola Alese, Babatunde Oladimeji, Tejumade Afonja, Kayode Olaleye, Naome A. Etori, Tosin Adewumi
article: Interspeech 2024, Sep 2024, Kos Island, Greece
Accès au texte intégral et bibtex

titre: Multi-channel extension of pre-trained models for speaker verification
auteur: Ladislav Mošner, Romain Serizel, Lukáš Burget, Oldřich Plchot, Emmanuel Vincent, Junyi Peng, Jan Černocký
article: Interspeech, Sep 2024, Kos, Greece
Accès au texte intégral et bibtex

titre: Mixture of Mixups for Multi-label Classification of Rare Anuran Sounds
auteur: Ilyass Moummad, Nicolas Farrugia, Romain Serizel, Jeremy Froidevaux, Vincent Lostanlen
article: EUSIPCO 2024: 32nd European Signal Processing Conference, Aug 2024, Lyon, France. pp.1282-1286, ⟨10.23919/EUSIPCO63174.2024.10715140⟩
Accès au texte intégral et bibtex

titre: Proactive Detection of Voice Cloning with Localized Watermarking
auteur: Robin San Roman, Pierre Fernandez, Hady Elsahar, Alexandre Défossez, Teddy Furon, Tuan Tran
article: ICML 2024 – 41st International Conference on Machine Learning, PMLR, Jul 2024, Vienna, Austria. pp.1-17
Accès au texte intégral et bibtex

titre: Synthèse de gestes communicatifs via STARGATE
auteur: Louis Abel, Vincent Colotte, Slim Ouni
article: 35èmes Journées d’Études sur la Parole (JEP 2024) 31ème Conférence sur le Traitement Automatique des Langues Naturelles (TALN 2024) 26ème Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues (RECITAL 2024), Jul 2024, Toulouse, France. pp.181-190
Accès au texte intégral et bibtex

titre: Textual analysis of Mayan and Egyptian manuscripts: contributions of n-gram coding and graded multidimensional representations
auteur: Bruno Delprat, Martine Cadot, Alain Lelu
article: JADT 2024 – 17es Journées internationales d’Analyse statistique des Données Textuelles, SeSLa (Séminaire des Sciences du Langage de l’UCLouvain – Site Saint-Louis), en collaboration avec le LASLA (Laboratoire d’Analyse statistique des Langues anciennes de l’Université de Liège), Jun 2024, Bruxelles, Belgique
Accès au texte intégral et bibtex

titre: Improving Speaker Assignment in Speaker-Attributed ASR for Real Meeting Applications
auteur: Can Cui, Imran Ahamad Sheikh, Mostafa Sadeghi, Emmanuel Vincent
article: The Speaker and Language Recognition Workshop Odyssey 2024, Jun 2024, Quebec, Canada
Accès au texte intégral et bibtex

titre: Signs and Synonymity Continuing Development of the Multilingual Sign Language Wordnet
auteur: Marc Schulder, Sam Bigea, Maria Kopf, Thomas Hanke, Anna Kuder, Joanna Wójcicka, Johanna Mesch, Thomas Björkstrand, Anna Vacalopoulou, Kyriaki Vasilaki, Theodore Goulas, Stavroula-Evita Fotinea, Ele Efthimiou
article: LREC-COLING 2024 – 11th Workshop on the Representation and Processing of Sign Languages: Evaluation of Sign Language Resources, May 2024, Torino, Italy
Accès au texte intégral et bibtex

titre: RoboVox: A Single/Multi-channel Far-field Speaker Recognition Benchmark for a Mobile Robot
auteur: Mohammad Mohammadamini, Driss Matrouf, Michael Rouvier, Jean-Francois Bonastre, Romain Serizel, Theophile Gonos
article: LREC_COLING, ELRA, May 2024, Turino, Italy
Accès au texte intégral et bibtex

titre: Are glottalic mechanisms in Human Beatboxing really glottalic ?
auteur: Alexis Dehais-Underdown, Lise Crevier-Buchman, Didier Demolin, Pierre-André Vuissoz, Marc Fauvel, Jacques Felblinger, Yves Laprie
article: 13th International Seminar of Speech Production, May 2024, Autrans, France
Accès au texte intégral et bibtex

titre: What does supraglottic articulatory global speed tell us about disfluencies?
auteur: Fabrice Hirsch, Ivana Didirková, Michael Blomgren, Sofiane Azzouz, Fanny Guitard-Ivent, Slim Ouni
article: International Seminar on Speech Production, May 2024, Autrans, France
Accès au bibtex

titre: A weighted-variance variational autoencoder model for speech enhancement
auteur: Ali Golmakani, Mostafa Sadeghi, Xavier Alameda-Pineda, Romain Serizel
article: ICASSP 2024 – International Conference on Acoustics Speech and Signal Processing, IEEE, Apr 2024, Seoul (Korea), South Korea. pp.1-5, ⟨10.1109/ICASSP48485.2024.10446294⟩
Accès au texte intégral et bibtex

titre: Unsupervised speech enhancement with diffusion-based generative models
auteur: Berné Nortier, Mostafa Sadeghi, Romain Serizel
article: International Conference on Acoustics Speech and Signal Processing (ICASSP), IEEE, Apr 2024, Seoul (Korea), South Korea. ⟨10.48550/arXiv.2309.10450⟩
Accès au texte intégral et bibtex

titre: Posterior sampling algorithms for unsupervised speech enhancement with recurrent variational autoencoder
auteur: Mostafa Sadeghi, Romain Serizel
article: International Conference on Acoustics Speech and Signal Processing (ICASSP), IEEE, Apr 2024, Seoul (Korea), South Korea. ⟨10.48550/arXiv.2309.10439⟩
Accès au texte intégral et bibtex

titre: Self-Supervised Learning for Few-Shot Bird Sound Classification
auteur: Ilyass Moummad, Nicolas Farrugia, Romain Serizel
article: ICASSPW 2024: IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops, Apr 2024, Seoul, South Korea. ⟨10.1109/ICASSPW62465.2024.10627576⟩
Accès au texte intégral et bibtex

titre: Performance and Energy Balance: A Comprehensive Study of State-of-the-Art Sound Event Detection Systems
auteur: Francesca Ronchini, Romain Serizel
article: ICASSP 2024 – 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Apr 2024, Seoul, South Korea. pp.1096-1100, ⟨10.1109/ICASSP48485.2024.10445834⟩
Accès au texte intégral et bibtex

titre: Regularized Contrastive Pre-Training for Few-Shot Bioacoustic Sound Detection
auteur: Ilyass Moummad, Nicolas Farrugia, Romain Serizel
article: ICASSP 2024: IEEE International Conference on Acoustics, Speech and Signal Processing, Apr 2024, Seoul, South Korea. ⟨10.1109/ICASSP48485.2024.10446409⟩
Accès au texte intégral et bibtex

titre: Diffusion-based speech enhancement with a weighted generative-supervised learning loss
auteur: Jean-Eudes Ayilo, Mostafa Sadeghi, Romain Serizel
article: International Conference on Acoustics Speech and Signal Processing (ICASSP), IEEE, Apr 2024, Seoul (Korea), South Korea. ⟨10.48550/arXiv.2309.10457⟩
Accès au texte intégral et bibtex

Book sections

titre: STATE OF THE ART
auteur: Guillaume Coiffier, Sewade Ogun, Leo Valque, Priyansh Trivedi
article: THINK BEFORE LOADING, 2024, 978-2-9591975-0-5
Accès au texte intégral et bibtex

Reports

titre: The Voice Privacy 2024 Challenge Evaluation Plan
auteur: Natalia Tomashenko, Xiaoxiao Miao, Pierre Champion, Sarina Meyer, Xin Wang, Emmanuel Vincent, Michele Panariello, Nicholas Evans, Junichi Yamagishi, Massimiliano Todisco
article: Inria; Eurecom; NII. 2024
Accès au texte intégral et bibtex

Theses

titre: Generating diverse synthetic data for ASR training data augmentation
auteur: Sewade Ogun
article: Computer Science [cs]. Université de Lorraine, 2024. English. ⟨NNT : 2024LORR0116⟩
Accès au texte intégral et bibtex

titre: Joint speech separation, diarization, and recognition for automatic meeting transcription
auteur: Can Cui
article: Computation and Language [cs.CL]. Université de Lorraine, 2024. English. ⟨NNT : 2024LORR0103⟩
Accès au texte intégral et bibtex

Preprints, Working Papers, …

titre: The First VoicePrivacy Attacker Challenge Evaluation Plan
auteur: Natalia Tomashenko, Xiaoxiao Miao, Emmanuel Vincent, Junichi Yamagishi
article: 2024
Accès au texte intégral et bibtex

titre: Domain-Invariant Representation Learning of Bird Sounds
auteur: Ilyass Moummad, Romain Serizel, Emmanouil Benetos, Nicolas Farrugia
article: 2024
Accès au texte intégral et bibtex

titre: Energy Consumption Trends in Sound Event Detection Systems
auteur: Constance Douwes, Romain Serizel
article: 2024
Accès au texte intégral et bibtex

titre: Latent Watermarking of Audio Generative Models
auteur: Robin San Roman, Pierre Fernandez, Antoine Deleforge, Yossi Adi, Romain Serizel
article: 2024
Accès au texte intégral et bibtex

2023

Journal articles

titre: Super-Resolved Dynamic 3D Reconstruction of the Vocal Tract during Natural Speech
auteur: Karyna Isaieva, Freddy Odille, Yves Laprie, Guillaume Drouot, Jacques Felblinger, Pierre-André Vuissoz
article: Journal of Imaging, 2023, 9 (10), pp.233. ⟨10.3390/jimaging9100233⟩
Accès au texte intégral et bibtex

titre: Stuttering Detection Using Speaker Representations and Self-supervised Contextual Embeddings
auteur: Shakeel Sheikh, Md Sahidullah, Fabrice Hirsch, Slim Ouni
article: International Journal of Speech Technology, 2023, ⟨10.1007/s10772-023-10032-1⟩
Accès au texte intégral et bibtex

titre: Expression-preserving face frontalization improves visually assisted speech processing
auteur: Zhiqi Kang, Mostafa Sadeghi, Radu Horaud, Xavier Alameda-Pineda
article: International Journal of Computer Vision, 2023, 131 (5), pp.1122-1140. ⟨10.1007/s11263-022-01742-1⟩
Accès au texte intégral et bibtex

titre: Algorithms for audio inpainting based on probabilistic nonnegative matrix factorization
auteur: Ondřej Mokrý, Paul Magron, Thomas Oberlin, Cédric Févotte
article: Signal Processing, 2023, ⟨10.1016/j.sigpro.2022.108905⟩
Accès au texte intégral et bibtex

titre: Privacy in Speech and Language Technology
auteur: Simone Fischer-Hübner, Dietrich Klakow, Peggy Valcke, Emmanuel Vincent
article: Dagstuhl Reports, 2023, 12 (8), pp.60-102. ⟨10.4230/DagRep.12.8.60⟩
Accès au texte intégral et bibtex

titre: Advancing Stuttering Detection via Data Augmentation, Class-Balanced Loss and Multi-Contextual Deep Learning
auteur: Shakeel Ahmad Sheikh, Md Sahidullah, Fabrice Hirsch, Slim Ouni
article: IEEE Journal of Biomedical and Health Informatics, 2023, ⟨10.1109/JBHI.2023.3248281⟩
Accès au texte intégral et bibtex

titre: Cross-corpora spoken language identification with domain diversification and generalization
auteur: Spandan Dey, Md Sahidullah, Goutam Saha
article: Computer Speech and Language, 2023, 81 (June 2023), pp.101489. ⟨10.1016/j.csl.2023.101489⟩
Accès au texte intégral et bibtex

titre: Guest editorial: Special issue on advances in deep learning based speech processing
auteur: Xiaolei Zhang, Lei Xie, Eric Fosler-Lussier, Emmanuel Vincent
article: Neural Networks, 2023, 158, ⟨10.1016/j.neunet.2022.11.033⟩
Accès au texte intégral et bibtex

titre: Differentially private speaker anonymization
auteur: Ali Shahin Shamsabadi, Brij Mohan Lal Srivastava, Aurélien Bellet, Nathalie Vauquier, Emmanuel Vincent, Mohamed Maouche, Marc Tommasi, Nicolas Papernot
article: Proceedings on Privacy Enhancing Technologies, 2023, 2023 (1), ⟨10.48550/arXiv.2202.11823⟩
Accès au bibtex

titre: Modulation spectral features for speech emotion recognition using deep neural networks
auteur: Premjeet Singh, Md Sahidullah, Goutam Saha
article: Speech Communication, 2023, 146 (January), pp.53-69. ⟨10.1016/j.specom.2022.11.005⟩
Accès au texte intégral et bibtex

Conference papers

titre: End-to-end Multichannel Speaker-Attributed ASR: Speaker Guided Decoder and Input Feature Analysis
auteur: Can Cui, Imran Ahamad Sheikh, Mostafa Sadeghi, Emmanuel Vincent
article: 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2023), Dec 2023, Taipei, Taiwan. ⟨10.1109/ASRU57964.2023.10389729⟩
Accès au texte intégral et bibtex

titre: From Discrete Tokens to High-Fidelity Audio Using Multi-Band Diffusion
auteur: Robin San Roman, Yossi Adi, Antoine Deleforge, Romain Serizel, Gabriel Synnaeve, Alexandre Défossez
article: NeurIPS 2023 – Conference on Neural Information Processing Systems, Dec 2023, New Orleans, United States. ⟨10.48550/arXiv.2308.02560⟩
Accès au texte intégral et bibtex

titre: Find-2-Find: Multitask Learning for Anaphora Resolution and Object Localization
auteur: Cennet Oguz, Pascal Denis, Emmanuel Vincent, Simon Ostermann, Josef van Genabith
article: 2023 Conference on Empirical Methods in Natural Language Processing, Dec 2023, Singapore, Singapore
Accès au texte intégral et bibtex

titre: Pretraining Representations for Bioacoustic Few-Shot Detection using Supervised Contrastive Learning
auteur: Ilyass Moummad, Romain Serizel, Nicolas Farrugia
article: Detection and Classification of Acoustic Scenes and Events 2023, Sep 2023, TAMPERE, Finland
Accès au texte intégral et bibtex

titre: Post-Processing Independent Evaluation of Sound Event Detection Systems
auteur: Janek Ebbers, Reinhold Haeb-Umbach, Romain Serizel
article: DCASE 2023 – 8th Workshop on Detection and Classification of Acoustic Scenes and Events, Sep 2023, Tampere, Finland. ⟨10.48550/arXiv.2306.15440⟩
Accès au texte intégral et bibtex

titre: Monitoring environmental impact of DCASE systems: Why and how ?
auteur: Constance Douwes, Francesca Ronchini, Romain Serizel
article: Detection and Classification of Acoustic Scene and Events (DCASE) Workshop, Sep 2023, Tampere (Finlande), Finland
Accès au bibtex

titre: Spectrogram Inversion for Audio Source Separation via Consistency, Mixing, and Magnitude Constraints
auteur: Paul Magron, Tuomas Virtanen
article: EUSIPCO 2023, EURASIP, Sep 2023, Helsinki, Finland. ⟨10.48550/arXiv.2303.01864⟩
Accès au texte intégral et bibtex

titre: BinauRec: A dataset to test the influence of the use of room impulse responses on binaural speech enhancement
auteur: Louis Delebecque, Romain Serizel
article: EUSIPCO23, EURASIP, Sep 2023, Helsiinki, Finland. ⟨10.23919/EUSIPCO58844.2023.10289772⟩
Accès au texte intégral et bibtex

titre: Signal Inpainting from Fourier Magnitudes
auteur: Louis Bahrman, Marina Krémé, Paul Magron, Antoine Deleforge
article: EUSIPCO 2023, Sep 2023, Helsinki, Finland. ⟨10.23919/EUSIPCO58844.2023.10289727⟩
Accès au texte intégral et bibtex

titre: The CHiME-7 UDASE task: Unsupervised domain adaptation for conversational speech enhancement
auteur: Simon Leglaive, Léonie Borne, Efthymios Tzinis, Mostafa Sadeghi, Matthieu Fraticelli, Scott Wisdom, Manuel Pariente, Daniel Pressnitzer, John R. Hershey
article: 7th International Workshop on Speech Processing in Everyday Environments (CHiME), Aug 2023, Dublin, Ireland. ⟨10.21437/CHiME.2023-2⟩
Accès au texte intégral et bibtex

titre: How to (Virtually) Train Your Speaker Localizer
auteur: Prerak Srivastava, Antoine Deleforge, Archontis Politis, Emmanuel Vincent
article: INTERSPEECH 2023, Aug 2023, Dublin, Ireland
Accès au texte intégral et bibtex

titre: Self-supervised learning with diffusion-based multichannel speech enhancement for speaker verification under noisy conditions
auteur: Sandipana Dowerah, Ajinkya Kulkarni, Romain Serizel, Denis Jouvet
article: INTERSPEECH 2023, Aug 2023, Dublin (Ireland), Ireland. pp.3849-3853, ⟨10.21437/Interspeech.2023-1890⟩
Accès au texte intégral et bibtex

titre: Stochastic Pitch Prediction Improves the Diversity and Naturalness of Speech in Glow-TTS
auteur: Sewade Ogun, Vincent Colotte, Emmanuel Vincent
article: InterSpeech 2023, Aug 2023, Dublin, Ireland
Accès au texte intégral et bibtex

titre: Modeling the temporal evolution of the vocal tract shape with deep learning
auteur: Yves Laprie, Vinicius Ribeiro, Karyna Isaieva, Justine Leclere, Jacques Felblinger, Pierre-André Vuissoz
article: 20th International Congress on Phonetic Sciences, Aug 2023, Prague (CZ), Czech Republic
Accès au texte intégral et bibtex

titre: Non-pulmonic initiation in human beatboxing: a real-time MRI study
auteur: Alexis Dehais-Underdown, Paul Vignes, Lise Crevier-Buchman, Didier Demolin, Pierre-André Vuissoz, Karyna Isaieva, Marc Fauvel, Yves Laprie, Jacques Felblinger
article: 20th International Congress of Phonetic Sciences (ICPhS 2023), Aug 2023, Prague, Czech Republic
Accès au texte intégral et bibtex

titre: Performance above all ? energy consumption vs. performance for machine listening, a study on dcase task 4 baseline
auteur: Romain Serizel, Samuele Cornell, Nicolas Turpault
article: ICASSP 2023 – 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Jun 2023, Rhodes Island, France. pp.1-5, ⟨10.1109/ICASSP49357.2023.10095938⟩
Accès au texte intégral et bibtex

titre: Fast and efficient speech enhancement with variational autoencoders
auteur: Mostafa Sadeghi, Romain Serizel
article: International Conference on Acoustics Speech and Signal Processing (ICASSP), IEEE, Jun 2023, Rhodes island, Greece
Accès au texte intégral et bibtex

titre: Lightweight Annotation and Class Weight Training for Automatic Estimation of Alarm Audibility in Noise
auteur: François Effa, Romain Serizel, Jean-Pierre Arz, Nicolas Grimault
article: ICASSP 2023 – 2023 IEEE International Conference on Acoustics, Speech and Signal Processing, Jun 2023, Rhodes Island, Greece. pp.1-5, ⟨10.1109/ICASSP49357.2023.10094730⟩
Accès au texte intégral et bibtex

titre: SPICE+: Evaluation of automatic audio captioning systems with pre-trained language models
auteur: Félix Gontier, Romain Serizel, Christophe Cerisara
article: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2023), Jun 2023, Rhodes Island, Greece
Accès au texte intégral et bibtex

titre: Audio-visual speech enhancement with a deep kalman filter generative model
auteur: Ali Golmakani, Mostafa Sadeghi, Romain Serizel
article: International Conference on Acoustics Speech and Signal Processing (ICASSP), IEEE, Jun 2023, Rhodes island, Greece
Accès au texte intégral et bibtex

titre: Improving Hate Speech Detection with Self-Attention Mechanism and Multi-Task Learning
auteur: Nicolas Zampieri, Irina Illina, Dominique Fohr
article: LTC’23 – 10th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, Apr 2023, Poznan, Poland
Accès au texte intégral et bibtex

titre: Semantic Information Investigation for Transformer-based Rescoring of N-best Speech Recognition
auteur: Irina Illina, Dominique Fohr
article: LTC 2023, Apr 2023, Poznan, Poland
Accès au texte intégral et bibtex

titre: Can we use Common Voice to train a Multi-Speaker TTS system?
auteur: Sewade Ogun, Vincent Colotte, Emmanuel Vincent
article: The 2022 IEEE Spoken Language Technology Workshop (SLT 2022), Jan 2023, Doha, Qatar
Accès au texte intégral et bibtex

titre: Joint optimization of diffusion probabilistic-based multichannel speech enhancement with far-field speaker verification
auteur: Sandipana Dowerah, Romain Serizel, Denis Jouvet, M Mohammadamini, Driss Matrouf
article: IEEE SLT 2022, Jan 2023, Doha, Qatar
Accès au texte intégral et bibtex

titre: From Room Impulse Responses to Wall Impulse Responses using Physics- Aware Deep Learning
auteur: Stéphane Dilungana, Antoine Deleforge, Cédric Foy, Sylvain Faisan
article: Proc. Forum Acusticum, 2023, Torino, Italy
Accès au bibtex

Poster communications

titre: End-to-end Multichannel Speaker-Attributed ASR: Speaker Guided Decoder and Input Feature Analysis
auteur: Can Cui, Imran Ahamad Sheikh, Mostafa Sadeghi, Emmanuel Vincent
article: Rencontre des Jeunes Chercheurs en Parole 2023 – 10E Edition, Nov 2023, Grenoble, France
Accès au texte intégral et bibtex

Reports

titre: Robovox: Far-Field Speaker Recognition By A Mobile Robot (Evaluation Plan)
auteur: Mohammad Mohammadamini, Mickael Rouvier, Driss Matrouf, Jean-François Bonastre, Romain Serizel, Denis Jouvet, Théophile Gonos
article: Avignon Université. 2023
Accès au texte intégral et bibtex

titre: Supervised contrastive learning for pre-training bioacoustic few-shot systems
auteur: Ilyass Moummad, Romain Serizel, Nicolas Farrugia
article: IMT Atlantique; LORIA. 2023
Accès au texte intégral et bibtex

Theses

titre: Hate speech detection in social media : contribution of multiword expressions
auteur: Nicolas Zampieri
article: Informatique [cs]. Université de Lorraine, 2023. Français. ⟨NNT : 2023LORR0387⟩
Accès au texte intégral et bibtex

titre: Deep Supervision of the Vocal Tract Shape for Articulatory Synthesis of Speech
auteur: Vinicius Ribeiro
article: Computer Science [cs]. Université de Lorraine, 2023. English. ⟨NNT : 2023LORR0311⟩
Accès au texte intégral et bibtex

titre: Realism in virtually supervised learning for acoustic room characterization and sound source localization
auteur: Prerak Srivastava
article: Machine Learning [cs.LG]. Université de Lorraine, 2023. English. ⟨NNT : 2023LORR0184⟩
Accès au texte intégral et bibtex

titre: Deep Learning-based Speaker Verification In Real Conditions
auteur: Sandipana Dowerah
article: Computer Science [cs]. Université de Lorraine, 2023. English. ⟨NNT : 2023LORR0046⟩
Accès au texte intégral et bibtex

titre: Anonymizing Speech : Evaluating and Designing Speaker Anonymization Techniques
auteur: Pierre Champion
article: Artificial Intelligence [cs.AI]. Université de Lorraine, 2023. English. ⟨NNT : 2023LORR0101⟩
Accès au texte intégral et bibtex

titre: Enriching large language models with semantic lexicons and analogies
auteur: Georgios Zervakis
article: Document and Text Processing. Université de Lorraine, 2023. English. ⟨NNT : 2023LORR0039⟩
Accès au texte intégral et bibtex

titre: Deep learning for stuttering detection
auteur: Shakeel Ahmad Sheikh
article: Computer Science [cs]. Université de Lorraine, 2023. English. ⟨NNT : 2023LORR0005⟩
Accès au texte intégral et bibtex

titre: Transfer learning for abusive language detection
auteur: Tulika Bose
article: Computer Science [cs]. Université de Lorraine, 2023. English. ⟨NNT : 2023LORR0019⟩
Accès au texte intégral et bibtex

2022

Journal articles

titre: Gridless 3D Recovery of Image Sources from Room Impulse Responses
auteur: Tom Sprunck, Antoine Deleforge, Yannick Privat, Cédric Foy
article: IEEE Signal Processing Letters, 2022, 29, pp.2427-2431. ⟨10.1109/LSP.2022.3224682⟩
Accès au texte intégral et bibtex

titre: An Overview of Indian Spoken Language Recognition from Machine Learning Perspective
auteur: Spandan Dey, Md Sahidullah, Goutam Saha
article: ACM Transactions on Asian and Low-Resource Language Information Processing, 2022, 21 (6), pp.1-45. ⟨10.1145/3523179⟩
Accès au texte intégral et bibtex

titre: Machine Learning for Stuttering Identification: Review, Challenges & Future Directions
auteur: Shakeel Sheikh, Md Sahidullah, Fabrice Hirsch, Slim Ouni
article: Neurocomputing, 2022, 514 (2022), pp.17. ⟨10.1016/j.neucom.2022.10.015⟩
Accès au texte intégral et bibtex

titre: Analysis of constant-Q filterbank based representations for speech emotion recognition
auteur: Premjeet Singh, Shefali Waldekar, Md Sahidullah, Goutam Saha
article: Digital Signal Processing, 2022, 130, pp.103712. ⟨10.1016/j.dsp.2022.103712⟩
Accès au texte intégral et bibtex

titre: 3D dynamic spatiotemporal atlas of the vocal tract during consonant-vowel production from 2D real time MRI
auteur: Ioannis K Douros, Yu Xie, Chrysanthi Dourou, Karyna Isaieva, Pierre-Andre Vussoz, Jacques Felblinger, Yves Laprie
article: Journal of Imaging, 2022, Special Issue Spatio-Temporal Biomedical Image Analysis, 8 (9), pp.227. ⟨10.3390/jimaging8090227⟩
Accès au texte intégral et bibtex

titre: Robust acoustic domain identification with its application to speaker diarization
auteur: A Kishore Kumar, Shefali Waldekar, Md Sahidullah, Goutam Saha
article: International Journal of Speech Technology, 2022, 25 (December), pp.933-945. ⟨10.1007/s10772-022-09990-9⟩
Accès au texte intégral et bibtex

titre: The VoicePrivacy 2020 Challenge: Results and findings
auteur: Natalia Tomashenko, Xin Wang, Emmanuel Vincent, Jose Patino, Brij Mohan Lal Srivastava, Paul-Gauthier Noé, Andreas Nautsch, Nicholas Evans, Junichi Yamagishi, Benjamin O’Brien, Anaïs Chanclu, Jean-François Bonastre, Massimiliano Todisco, Mohamed Maouche
article: Computer Speech and Language, 2022, 74, pp.101362. ⟨10.1016/j.csl.2022.101362⟩
Accès au texte intégral et bibtex

titre: Privacy and utility of x-vector based speaker anonymization
auteur: Brij Mohan Lal Srivastava, Mohamed Maouche, Md Sahidullah, Emmanuel Vincent, Aurélien Bellet, Marc Tommasi, Natalia Tomashenko, Xin Wang, Junichi Yamagishi
article: IEEE/ACM Transactions on Audio, Speech and Language Processing, 2022, ⟨10.1109/TASLP.2022.3190741⟩
Accès au texte intégral et bibtex

titre: A majorization-minimization algorithm for nonnegative binary matrix factorization
auteur: Paul Magron, Cédric Févotte
article: IEEE Signal Processing Letters, 2022, ⟨10.1109/LSP.2022.3187368⟩
Accès au texte intégral et bibtex

titre: Automatic generation of the complete vocal tract shape from the sequence of phonemes to be articulated
auteur: Vinicius Ribeiro, Karyna Isaieva, Justine Leclere, Pierre-André Vuissoz, Yves Laprie
article: Speech Communication, 2022, 141, pp.1-13. ⟨10.1016/j.specom.2022.04.004⟩
Accès au texte intégral et bibtex

titre: Neural content-aware collaborative filtering for cold-start music recommendation
auteur: Paul Magron, Cédric Févotte
article: Data Mining and Knowledge Discovery, 2022, ⟨10.1007/s10618-022-00859-8⟩
Accès au texte intégral et bibtex

titre: Non-Smooth Regularization: Improvement to Learning Framework through Extrapolation
auteur: Sajjad Amini, Mohammad Soltanian, Mostafa Sadeghi, Shahrokh Ghaemmaghami
article: IEEE Transactions on Signal Processing, 2022, 70, pp.1213 – 1223. ⟨10.1109/TSP.2022.3154969⟩
Accès au texte intégral et bibtex

titre: Overlapped speech detection and speaker counting using distant microphone arrays
auteur: Samuele Cornell, Maurizio Omologo, Stefano Squartini, Emmanuel Vincent
article: Computer Speech and Language, 2022, 72, pp.101306. ⟨10.1016/j.csl.2021.101306⟩
Accès au texte intégral et bibtex

titre: Learning the Proximity Operator in Unfolded ADMM for Phase Retrieval
auteur: Pierre-Hugo Vial, Paul Magron, Thomas Oberlin, Cédric Févotte
article: IEEE Signal Processing Letters, 2022, 29, pp.1619-1623. ⟨10.1109/LSP.2022.3189275⟩
Accès au texte intégral et bibtex

Conference papers

titre: An analogy based approach for solving target sense verification
auteur: Georgios Zervakis, Emmanuel Vincent, Miguel Couceiro, Marc Schoenauer, Esteban Marquer
article: NLPIR 2022 – 6th International Conference on Natural Language Processing and Information Retrieval, Dec 2022, Bangkok, Thailand
Accès au texte intégral et bibtex

titre: Transferring Knowledge via Neighborhood-Aware Optimal Transport for Low-Resource Hate Speech Detection
auteur: Tulika Bose, Irina Illina, Dominique Fohr
article: Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing (AACL-IJCNLP), Nov 2022, Online, Taiwan
Accès au texte intégral et bibtex

titre: Chop and change: Anaphora resolution in instructional cooking videos
auteur: Cennet Oguz, Ivana Kruijff-Korbayová, Pascal Denis, Emmanuel Vincent, Josef van Genabith
article: Findings of AACL-IJCNLP 2022 – 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics – 12th International Joint Conference on Natural Language Processing, Nov 2022, Taipeh, Taiwan
Accès au texte intégral et bibtex

titre: Integrating isolated examples with weakly-supervised sound event detection: a direct approach
auteur: Mohammad Abdollahi, Romain Serizel, Alain Rakotomamonjy, Gilles Gasso
article: 7th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE), Nov 2022, Nancy, France
Accès au texte intégral et bibtex

titre: Local time-frequency fading
auteur: Ama Marina Kreme, Bruno Torrésani, Antoine Deleforge
article: ICA 22 – International Congress on Acoustics 2022, Oct 2022, Gyeongju, South Korea
Accès au texte intégral et bibtex

titre: Accelerating the Centerline Processing of Vocal Tract Shapes for Articulatory Synthesis
auteur: Romain Karpinski, Vinicius Ribeiro, Yves Laprie
article: ICA 2022- 24th International Congress on Acoustics, Oct 2022, Gyeongyu, South Korea
Accès au texte intégral et bibtex

titre: How to Leverage DNN-based speech enhancement for multi-channel speaker verification?
auteur: Sandipana Dowerah, Romain Serizel, Denis Jouvet, Mohammad Mohammadamini, Driss Matrouf
article: 4th International Conference on Advances in Signal Processing and Artificial Intelligence (ASPAI’ 2022), Oct 2022, Corfu, Greece
Accès au texte intégral et bibtex

titre: End-to-End and Self-Supervised Learning for ComParE 2022 Stuttering Sub-Challenge
auteur: Shakeel A Sheikh, Md Sahidullah, Fabrice Hirsch, Slim Ouni
article: ACM Multimedia 2022 Computational Paralinguistics Challenge (ComParE), Oct 2022, Lisbon, Portugal
Accès au texte intégral et bibtex

titre: Domain Classification-based Source-specific Term Penalization for Domain Adaptation in Hate-speech Detection
auteur: Tulika Bose, Nikolaos Aletras, Irina Illina, Dominique Fohr
article: COLING 2022 – Proceedings of the 29th International Conference on Computational Linguistics, Oct 2022, Gyeongju, South Korea
Accès au texte intégral et bibtex

titre: Enhancing speech privacy with slicing
auteur: Mohamed Maouche, Brij Mohan Lal Srivastava, Nathalie Vauquier, Aurélien Bellet, Marc Tommasi, Emmanuel Vincent
article: Interspeech 2022 – Human and Humanizing Speech Technology, Sep 2022, Incheon, South Korea
Accès au texte intégral et bibtex

titre: Barlow Twins self-supervised learning for robust speaker recognition
auteur: Mohammad Mohammadamini, Driss Matrouf, Jean-François A Bonastre, Sandipana Dowerah, Romain Serizel, Denis Jouvet
article: Interspeech 2022 – Human and Humanizing Speech Technology, Sep 2022, Incheon, South Korea. ⟨10.21437/Interspeech.2022-11301⟩
Accès au texte intégral et bibtex

titre: A Sparsity-promoting Dictionary Model for Variational Autoencoders
auteur: Mostafa Sadeghi, Paul Magron
article: INTERSPEECH 2022, Sep 2022, Incheon, South Korea
Accès au texte intégral et bibtex

titre: Are disentangled representations all you need to build speaker anonymization systems?
auteur: Pierre Champion, Denis Jouvet, Anthony Larcher
article: INTERSPEECH 2022 – Human and Humanizing Speech Technology, Sep 2022, incheon, South Korea
Accès au texte intégral et bibtex

titre: Autoencoder-Based Tongue Shape Estimation During Continuous Speech
auteur: Vinicius Ribeiro, Yves Laprie
article: 23rd INTERSPEECH Conference on “Human and Humanizing Speech Technology”, Sep 2022, Incheon, South Korea
Accès au texte intégral et bibtex

titre: Analysis of expressivity transfer in non-autoregressive end-to-end multispeaker TTS systems
auteur: Ajinkya Kulkarni, Vincent Colotte, Denis Jouvet
article: INTERSPEECH 2022, Sep 2022, Incheon, South Korea
Accès au texte intégral et bibtex

titre: Exploration of Multi-Corpus Learning for Hate Speech Classification in Low Resource Scenarios
auteur: Ashwin Geet d’Sa, Irina Illina, Dominique Fohr, Awais Akbar
article: TSD 2022 – 25th International Conference on Text, Speech and Dialogue, Sep 2022, Brno, Czech Republic
Accès au texte intégral et bibtex

titre: Vers un système embarqué de classification d’événements sonores : étude de l’impact de la quantification des descripteurs
auteur: Marie-Anne Lacroix, Nancy Bertin, Romuald Rocher, Pascal Scalart
article: GRETSI 2022 XXVIIIème Colloque Francophone de Traitement du Signal et des Images, Sep 2022, Nancy, France
Accès au texte intégral et bibtex

titre: Realistic sources, receivers and walls improve the generalisability of virtually-supervised blind acoustic parameter estimators
auteur: Prerak Srivastava, Antoine Deleforge, Emmanuel Vincent
article: 17th International Workshop on Acoustic Signal Enhancement (IWAENC), Sep 2022, Bamberg, Germany
Accès au texte intégral et bibtex

titre: Multi-stage attention for fine-grained expressivity transfer in multispeaker text-to-speech system
auteur: Ajinkya Kulkarni, Vincent Colotte, Denis Jouvet
article: EUSIPCO 2022, Aug 2022, Belgrade, Serbia
Accès au texte intégral et bibtex

titre: Geometry-Informed Estimation of Surface Absorption Profiles from Room Impulse Responses
auteur: Stéphane Dilungana, Antoine Deleforge, Cédric Foy, Sylvain Faisan
article: 30th European Signal Processing Conference (EUSIPCO), Aug 2022, Belgrade, Serbia. pp.867-871, ⟨10.23919/EUSIPCO55093.2022.9909667⟩
Accès au texte intégral et bibtex

titre: Robust Stuttering Detection via Multi-task and Adversarial Learning
auteur: Shakeel Sheikh, Md Sahidullah, Fabrice Hirsch, Slim Ouni
article: EUSIPCO 2022 – 30th European Signal Processing Conference, Aug 2022, Belgrade, Serbia
Accès au texte intégral et bibtex

titre: A Comprehensive Exploration of Noise Robustness and Noise Compensation in ResNet and TDNN-based Speaker Recognition Systems
auteur: Mohammad Mohammadamini, Driss Matrouf, Jean-François Bonastre, Sandipana Dowerah, Romain Serizel, Denis Jouvet
article: EUSIPCO 2022 – 30th European Signal Processing Conference, Aug 2022, Belgrade, Serbia
Accès au texte intégral et bibtex

titre: Synchronization of speech and gestures in an interactional context (SyncoGest Project)
auteur: Domitille Caillat, Ludovic Marin, Christelle Dodane, Fabrice Hirsch, Slim Ouni, Pierre Slangen, Patrice Guyot, Vincent Colotte, Aliyah Morgenstern, Louis Abel, Mickaëlla Grondin-Verdon, Juliette Lozano Goupil
article: ISGS 2022 – 9th Conference of the International Society for Gesture Studies, Jul 2022, Chicago, United States
Accès au bibtex

titre: Spoofing-Aware Speaker Verification with Unsupervised Domain Adaptation
auteur: Xuechen Liu, Md Sahidullah, Tomi Kinnunen
article: Odyssey 2022 – The Speaker and Language Recognition Workshop, Jun 2022, Beijing, China. pp.85-91, ⟨10.21437/Odyssey.2022-12⟩
Accès au texte intégral et bibtex

titre: Identification des Expressions Polylexicales dans les Tweets
auteur: Nicolas Zampieri, Carlos Ramisch, Irina Illina, Dominique Fohr
article: RECITAL 2022- Traitement Automatique des Langues Naturelles (TALN), Jun 2022, Avignon, France
Accès au texte intégral et bibtex

titre: Adapting Language Models When Training on Privacy-Transformed Data
auteur: Mehmet Ali Tugtekin Turan, Dietrich Klakow, Emmanuel Vincent, Denis Jouvet
article: LREC 2022 – 13th Language Resources and Evaluation Conference, Jun 2022, Marseille, France
Accès au texte intégral et bibtex

titre: Identification of Multiword Expressions in Tweets for Hate Speech Detection
auteur: Nicolas Zampieri, Carlos Ramisch, Irina Illina, Dominique Fohr
article: LREC 2022 – 13th Edition of its Language Resources and Evaluation Conference, Jun 2022, Marseille, France
Accès au texte intégral et bibtex

titre: Transformer versus LSTM Language Models Trained on Uncertain ASR Hypotheses in Limited Data Scenarios
auteur: Imran Ahamad Sheikh, Emmanuel Vincent, Irina Illina
article: LREC 2022 – 13th Language Resources and Evaluation Conference, Jun 2022, Marseille, France
Accès au texte intégral et bibtex

titre: Placing M-Phasis on the Plurality of Hate: A Feature-Based Corpus of Hate Online
auteur: Dana Ruiter, Liane Reiners, Ashwin Geet d’Sa, Thomas Kleinbauer, Dominique Fohr, Irina Illina, Dietrich Klakow, Christian Schemer, Angeliki Monnier
article: 13th International conference Language Resources and Evaluation Conference, European Language Resources Association (Elra), Jun 2022, Marseille, France. pp.791-804
Accès au texte intégral et bibtex

titre: Privacy-Preserving Speech Representation Learning using Vector Quantization
auteur: Pierre Champion, Denis Jouvet, Anthony Larcher
article: JEP 2022 – Journées d’Études sur la Parole, Jun 2022, Île de Noirmoutier, France
Accès au texte intégral et bibtex

titre: La vélocité des mouvements labiaux et mandibulaires : un indice pour différencier les disfluences typiques du bégaiement et les disfluences normales ? Une étude pilote
auteur: Fabrice Hirsch, Ivana Didirková, Slim Ouni, Shakeel Ahmad Sheikh, Yves Laprie, Marie-Claude Monfrais-Pfauwadel, Eléonor Burkhardt
article: 34emes Journées d’Etudes sur la Parole – JEP2022, Jun 2022, Île de Noirmoutier, France. ⟨10.21437/JEP.2022-18⟩
Accès au bibtex

titre: Evaluation of Speaker Anonymization on Emotional Speech
auteur: Hubert Nourtel, Pierre Champion, Denis Jouvet, Anthony Larcher, Marie Tahon
article: JEP 2022 – Journées d’Études sur la Parole, Jun 2022, Île de Noirmoutier, France
Accès au texte intégral et bibtex

titre: BERT Semantic Context Model for Efficient Speech Recognition
auteur: Irina Illina, Dominique Fohr
article: ICCAS 2022 – International Conference on Cognitive Aircraft Systems, ISAE-SUPAERO, Jun 2022, Toulouse, France
Accès au bibtex

titre: Baselines and Protocols for Household Speaker Recognition
auteur: Alexey Sholokhov, Xuechen Liu, Md Sahidullah, Tomi Kinnunen
article: The Speaker and Language Recognition Workshop (Odyssey 2022), Jun 2022, Beijing, China. pp.185-192, ⟨10.21437/Odyssey.2022-26⟩
Accès au texte intégral et bibtex

titre: Baseline Systems for the First Spoofing-Aware Speaker Verification Challenge: Score and Embedding Fusion
auteur: Hye-Jin Shim, Hemlata Tak, Xuechen Liu, Hee-Soo Heo, Jee-Weon Jung, Joon Son Chung, Soo-Whan Chung, Ha-Jin Yu, Bong-Jin Lee, Massimiliano Todisco, Héctor Delgado, Kong Aik Lee, Md Sahidullah, Tomi Kinnunen, Nicholas Evans
article: Odyssey 2022 – The Speaker and Language Recognition Workshop, Jun 2022, Beijing, China
Accès au bibtex

titre: Learnable Nonlinear Compression for Robust Speaker Verification
auteur: Xuechen Liu, Md Sahidullah, Tomi Kinnunen
article: ICASSP 2022 – IEEE International Conference on Acoustics, Speech and Signal Processing, May 2022, Singapore, Singapore. ⟨10.1109/ICASSP43922.2022.9747185⟩
Accès au texte intégral et bibtex

titre: Threshold independent evaluation of sound event detection scores
auteur: Janek Ebbers, Reinhold Haeb-Umbach, Romain Serizel
article: ICASSP 2022 – IEEE International Conference on Acoustics, Speech and Signal Processing, May 2022, Singapore, Singapore. ⟨10.1109/ICASSP43922.2022.9747556⟩
Accès au texte intégral et bibtex

titre: Dynamically Refined Regularization for Improving Cross-corpora Hate Speech Detection
auteur: Tulika Bose, Nikolaos Aletras, Irina Illina, Dominique Fohr
article: ACL 2022 – 60th meeting Association for Computational Linguistics Findings, May 2022, Dublin, Ireland. ⟨10.18653/v1/2022.findings-acl.32⟩
Accès au texte intégral et bibtex

titre: On the impact of normalization strategies in unsupervised adversarial domain adaptation for acoustic scene classification
auteur: Michel Olvera, Emmanuel Vincent, Gilles Gasso
article: ICASSP 2022 – IEEE International Conference on Acoustics, Speech and Signal Processing, May 2022, Singapore, Singapore. ⟨10.1109/ICASSP43922.2022.9747540⟩
Accès au texte intégral et bibtex

titre: A benchmark of state-of-the-art sound event detection systems evaluated on synthetic soundscapes
auteur: Francesca Ronchini, Romain Serizel
article: ICASSP 2022 – IEEE International Conference on Acoustics, Speech and Signal Processing, May 2022, Singapore/Virtual, Singapore. ⟨10.1109/ICASSP43922.2022.9747577⟩
Accès au texte intégral et bibtex

titre: The Impact of Removing Head Movements on Audio-visual Speech Enhancement
auteur: Zhiqi Kang, Mostafa Sadeghi, Radu Horaud, Xavier Alameda-Pineda, Jacob Donley, Anurag Kumar
article: ICASSP 2022 – IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE Signal Processing Society, May 2022, Singapore, Singapore. pp.1-5, ⟨10.1109/ICASSP43922.2022.9746401⟩
Accès au texte intégral et bibtex

titre: Perception of German fricatives by French dyslexic subjects
auteur: Stéphanie Deckert, Agnès Piquard-Kipffer, Anne Bonneau
article: New Sounds 2022, 10th International Symposium on the Acquisition of Second Language Speech, Apr 2022, Barcelone, Spain
Accès au bibtex

titre: Evaluation de l’audibilité ressentie des alarmes sonores dans le bruit
auteur: Jean-Pierre Arz, François Effa, Nicolas Grimault, Romain Serizel
article: 16ème Congrès Français d’Acoustique, CFA2022, Société Française d’Acoustique; Laboratoire de Mécanique et d’Acoustique, Apr 2022, Marseille, France
Accès au bibtex

titre: Reconstruction de la forme d’une pièce par super-résolution à l’aide de réponses impulsionnelles
auteur: Tom Sprunck, Khaoula Chahdi, Cédric Foy, Emmanuel Franck, Antoine Deleforge
article: 16ème Congrès Français d’Acoustique, CFA2022, Société Française d’Acoustique; Laboratoire de Mécanique et d’Acoustique, Apr 2022, Marseille, France
Accès au bibtex

titre: Modélisation de la détection d’alarmes sonores dans le bruit
auteur: François Effa, Jean-Pierre Arz, Nicolas Grimault, Ossen El Sawaf, Romain Serizel
article: 16ème Congrès Français d’Acoustique, CFA2022, Société Française d’Acoustique; Laboratoire de Mécanique et d’Acoustique, Apr 2022, Marseille, France
Accès au bibtex

titre: Room Shape Reconstruction Using Acoustic Super-Resolution
auteur: Tom Sprunck, Yannick Privat, Cédric Foy, Antoine Deleforge
article: Proceeding of 24th International Congress on Acoustics, 2022, Gyeongju, South Korea. ⟨10.1121/10.0001687⟩
Accès au bibtex

Habilitation à diriger des recherches

titre: Contributions to speech processing and ambient sound analysis
auteur: Romain Serizel
article: Computer Science [cs]. Université de Lorraine, 2022
Accès au texte intégral et bibtex

Proceedings

titre: Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2022)
auteur: Mathieu Lagrange, Annamaria Mesaros, Thomas Pellegrini, Gael Richard, Romain Serizel, Dan Stowell
article: Tampere University, pp.1-225, 2022, 978-952-03-2677-7
Accès au texte intégral et bibtex

Theses

titre: Robust sound event detection
auteur: Michel Olvera
article: Computer Science [cs]. Université de Lorraine, 2022. English. ⟨NNT : 2022LORR0324⟩
Accès au texte intégral et bibtex

titre: Expressivity transfer in deep learning based text-to-speech synthesis
auteur: Ajinkya Kulkarni
article: Machine Learning [cs.LG]. Université de Lorraine, 2022. English. ⟨NNT : 2022LORR0122⟩
Accès au texte intégral et bibtex

titre: Expanding the training data for neural network based hate speech classification
auteur: Ashwin Geet d’Sa
article: Computer Science [cs]. Université de Lorraine, 2022. English. ⟨NNT : 2022LORR0081⟩
Accès au texte intégral et bibtex

titre: Leveraging noisy transcriptions for automatic speech recognition
auteur: Adrien Dufraux
article: Informatique [cs]. Université de Lorraine, 2022. Français. ⟨NNT : 2022LORR0032⟩
Accès au texte intégral et bibtex

Preprints, Working Papers, …

titre: The VoicePrivacy 2022 Challenge Evaluation Plan
auteur: Natalia Tomashenko, Xin Wang, Xiaoxiao Miao, Hubert Nourtel, Pierre Champion, Massimiliano Todisco, Emmanuel Vincent, Nicholas Evans, Junichi Yamagishi, Jean-François Bonastre
article: 2022
Accès au texte intégral et bibtex

titre: Supplementary material to the paper The VoicePrivacy 2020 Challenge: Results and findings
auteur: Natalia Tomashenko, Xin Wang, Emmanuel Vincent, Jose Patino, Brij Mohan Lal Srivastava, Paul-Gauthier Noé, Andreas Nautsch, Nicholas Evans, Junichi Yamagishi, Benjamin O’Brien, Anaïs Chanclu, Jean-François Bonastre, Massimiliano Todisco, Mohamed Maouche
article: 2022
Accès au texte intégral et bibtex

titre: Étude d’un algorithme d’optimisation pour le fading temps-fréquence
auteur: Marina Krémé, Bruno Torrésani
article: 2022
Accès au texte intégral et bibtex

titre: Towards an efficient computation of masks for multichannel speech enhancement
auteur: Louis Delebecque, Romain Serizel, Nicolas Furnon
article: 2022
Accès au texte intégral et bibtex

2021

Journal articles

titre: Utterance partitioning for speaker recognition: an experimental review and analysis with new findings under GMM-SVM framework
auteur: Nirmalya Sen, Md Sahidullah, Hemant Patil, Shyamal Kumar das Mandal, Sreenivasa Krothapalli Rao, Tapan Kumar Basu
article: International Journal of Speech Technology, 2021, 24, pp.1067-1088. ⟨10.1007/s10772-021-09862-8⟩
Accès au texte intégral et bibtex

titre: dEchorate: a Calibrated Room Impulse Response Dataset for Echo-aware Signal Processing
auteur: Diego Di Carlo, Pinchas Tandeitnik, Cédric Foy, Nancy Bertin, Antoine Deleforge, Sharon Gannot
article: EURASIP Journal on Audio, Speech, and Music Processing, 2021, 39, ⟨10.1186/s13636-021-00229-0⟩
Accès au texte intégral et bibtex

titre: Multimodal dataset of real-time 2D and static 3D MRI of healthy French speakers
auteur: Karyna Isaieva, Yves Laprie, Justine Leclère, Ioannis K Douros, Jacques Felblinger, Pierre-André Vuissoz
article: Scientific Data , 2021, 8 (1), pp.258. ⟨10.1038/s41597-021-01041-3⟩
Accès au texte intégral et bibtex

titre: A detailed study of the distributed rough set based locality sensitive hashing feature selection technique
auteur: Zaineb Chelly Dagdia, Christine Zarges
article: Fundamenta Informaticae, 2021, 182 (2), pp.111-179. ⟨10.3233/FI-2021-2069⟩
Accès au texte intégral et bibtex

titre: Enabling voice-based apps with European values
auteur: Akira Campbell, Thomas Kleinbauer, Marc Tommasi, Emmanuel Vincent
article: ERCIM News, 2021, 126, pp.38-39
Accès au bibtex

titre: Impact of lip-reading on speech perception in French-speaking children at risk for reading failure assessed from age 5 to 7
auteur: Agnès Piquard-Kipffer, Thalia Cavadini, Liliane Sprenger-Charolles, Edouard Gentaz
article: L’Année psychologique, 2021, 121, pp.3-18. ⟨10.3917/anpsy1.212.0003⟩
Accès au texte intégral et bibtex

titre: Mixture of Inference Networks for VAE-based Audio-visual Speech Enhancement
auteur: Mostafa Sadeghi, Xavier Alameda-Pineda
article: IEEE Transactions on Signal Processing, 2021, 69, pp.1899-1909. ⟨10.1109/TSP.2021.3066038⟩
Accès au texte intégral et bibtex

titre: ASVspoof 2019: Spoofing Countermeasures for the Detection of Synthesized, Converted and Replayed Speech
auteur: Andreas Nautsch, Xin Wang, Nicholas Evans, Tomi Kinnunen, Ville Vestman, Massimiliano Todisco, Hector Delgado, Md Sahidullah, Junichi Yamagishi, Kong Aik Lee
article: IEEE Transactions on Biometrics, Behavior, and Identity Science, 2021, 3 (2), pp.252-265. ⟨10.1109/TBIOM.2021.3059479⟩
Accès au texte intégral et bibtex

titre: Speech Frame Selection for Spoofing Detection with an Application to Partially Spoofed Audio-Data
auteur: Kishore A. Kumar, Dipjyoti Paul, Monisankha Pal, Md Sahidullah, Goutam Saha
article: International Journal of Speech Technology, 2021, ⟨10.1007/s10772-020-09785-w⟩
Accès au texte intégral et bibtex

titre: Mean absorption estimation from room impulse responses using virtually supervised learning
auteur: Cédric Foy, Antoine Deleforge, Diego Di Carlo
article: Journal of the Acoustical Society of America, 2021, 150 (2), pp.1286-1299. ⟨10.1121/10.0005888⟩
Accès au texte intégral et bibtex

titre: Optimizing Multi-Taper Features for Deep Speaker Verification
auteur: Xuechen Liu, Md Sahidullah, Tomi Kinnunen
article: IEEE Signal Processing Letters, 2021, 28, pp.2187 – 2191. ⟨10.1109/LSP.2021.3122796⟩
Accès au texte intégral et bibtex

titre: DNN-based mask estimation for distributed speech enhancement in spatially unconstrained microphone arrays
auteur: Nicolas Furnon, Romain Serizel, Slim Essid, Irina Illina
article: IEEE/ACM Transactions on Audio, Speech and Language Processing, 2021, 29, pp.2310 – 2323. ⟨10.1109/TASLP.2021.3092838⟩
Accès au texte intégral et bibtex

titre: Learning emotions latent representation with CVAE for Text-Driven Expressive AudioVisual Speech Synthesis
auteur: Sara Dahmani, Vincent Colotte, Valérian Girard, Slim Ouni
article: Neural Networks, 2021, 141, pp.315-329. ⟨10.1016/j.neunet.2021.04.021⟩
Accès au texte intégral et bibtex

Conference papers

titre: Optimized Power Normalized Cepstral Coefficients Towards Robust Deep Speaker Verification
auteur: Xuechen Liu, Md Sahidullah, Tomi Kinnunen
article: ASRU 2021 – IEEE Automatic Speech Recognition and Understanding Workshop, Dec 2021, Cartagena, Colombia
Accès au texte intégral et bibtex

titre: Parameterized Channel Normalization for Far-field Deep Speaker Verification
auteur: Xuechen Liu, Md Sahidullah, Tomi Kinnunen
article: ASRU 2021 – IEEE Automatic Speech Recognition and Understanding Workshop, Dec 2021, Cartagena, Colombia
Accès au texte intégral et bibtex

titre: On the invertibility of a voice privacy system using embedding alignement
auteur: Pierre Champion, Thomas Thebaud, Gaël Le Lan, Anthony Larcher, Denis Jouvet
article: ASRU 2021 – IEEE Automatic Speech Recognition and Understanding Workshop, Dec 2021, Cartagena, Colombia
Accès au texte intégral et bibtex

titre: Projet LogilecSur : quelles stratégies enseignantes pour guider des élèves sourds vers l’autonomie en compréhension écrite ?
auteur: Manuel Leitao, Elodie Venti, Thomas Sigiez, Christophe Laroche, Marie Perini, Agnès Piquard-Kipffer
article: IDEKI 2021 – 4ème colloque international Didactiques et métiers de l’humain, Dec 2021, Pont-à-Mousson, France
Accès au texte intégral et bibtex

titre: De codes gestuo-manuels à la Langue des Signes Française : usages et enjeux à la maternelle dans le cadre des gestes professionnels inclusifs et des adaptations didactiques
auteur: Olivia Janin, Agnès Piquard-Kipffer
article: IDEKI 2021 – 4ème colloque international Didactiques et métiers de l’humain, IDEKI, Dec 2021, Pont-A-Mousson, France
Accès au texte intégral et bibtex

titre: Automated audio captioning by fine-tuning bart with audioset tags
auteur: Félix Gontier, Romain Serizel, Christophe Cerisara
article: DCASE 2021 – 6th Workshop on Detection and Classification of Acoustic Scenes and Events, Nov 2021, Virtual, Spain
Accès au texte intégral et bibtex

titre: Improving Sound Event Detection with Auxiliary Foreground-Background Classification and Domain Adaptation
auteur: Michel Olvera, Emmanuel Vincent, Gilles Gasso
article: DCASE 2021 – 6th Workshop on Detection and Classification of Acoustic Scenes and Events, Nov 2021, Virtual, Spain
Accès au texte intégral et bibtex

titre: The impact of non-target events in synthetic soundscapes for sound event detection
auteur: Francesca Ronchini, Romain Serizel, Nicolas Turpault, Samuele Cornell
article: DCASE 2021 – Detection and Classification of Acoustic Scenes and Events, Nov 2021, Barcelona/Virtual, Spain
Accès au texte intégral et bibtex

titre: Benchmarking and challenges in security and privacy for voice biometrics
auteur: Jean-Francois Bonastre, Hector Delgado, Nicholas Evans, Tomi Kinnunen, Kong Aik Lee, Xuechen Liu, Andreas Nautsch, Paul-Gauthier Noe, Jose Patino, Md Sahidullah, Brij Mohan Lal Srivastava, Massimiliano Todisco, Natalia Tomashenko, Emmanuel Vincent, Xin Wang, Junichi Yamagishi
article: SPSC 2021, 1st ISCA Symposium on Security and Privacy in Speech Communication, ISCA, Nov 2021, Magdeburg, Germany. ⟨10.21437/SPSC.2021-11⟩
Accès au texte intégral et bibtex

titre: Evaluation of Speaker Anonymization on Emotional Speech
auteur: Hubert Nourtel, Pierre Champion, Denis Jouvet, Anthony Larcher, Marie Tahon
article: SPSC 2021 – 1st ISCA Symposium on Security and Privacy in Speech Communication, Nov 2021, Virtual, Germany
Accès au texte intégral et bibtex

titre: Deep Variational Generative Models for Audio-visual Speech Separation
auteur: Viet-Nhat Nguyen, Mostafa Sadeghi, Elisa Ricci, Xavier Alameda-Pineda
article: MLSP 2021 – IEEE International Workshop on Machine Learning for Signal Processing, Oct 2021, Gold Coast, Australia. ⟨10.1109/MLSP52302.2021.9596406⟩
Accès au bibtex

titre: Blind room parameter estimation using multiple multichannel speech recordings
auteur: Prerak Srivastava, Antoine Deleforge, Emmanuel Vincent
article: WASPAA 2021 – IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, Oct 2021, New Paltz, NY, United States
Accès au texte intégral et bibtex

titre: Saladnet: Self-Attentive Multisource Localization in the Ambisonics Domain
auteur: Pierre-Amaury Grumiaux, Srdan Kitić, Prerak Srivastava, Laurent Girin, Alexandre Guérin
article: WASPAA 2021 – IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, Oct 2021, New Paltz / Virtual, United States. pp.336-340, ⟨10.1109/WASPAA52581.2021.9632737⟩
Accès au texte intégral et bibtex

titre: Robust Face Frontalization For Visual Speech Recognition
auteur: Zhiqi Kang, Radu Horaud, Mostafa Sadeghi
article: ICCVW 2021 – International Conference on Computer Vision Workshops, IEEE, Oct 2021, Montreal – Virtual, Canada. pp.2485-2495, ⟨10.1109/ICCVW54120.2021.00281⟩
Accès au texte intégral et bibtex

titre: Du développement du langage aux troubles du langage et des apprentissages, enjeux, défis et perspectives
auteur: Agnès Piquard-Kipffer
article: École Doctorale Sociétés, Communication, Arts, Lettres et Langues, Université Félix Houphouët-Boigny, Oct 2021, Abidjan, Côte d’Ivoire
Accès au bibtex

titre: Covid-19 et port du masque à l’école : mise en difficulté de certains élèves
auteur: Agnès Piquard-Kipffer
article: Journée Scientifique Fédération Charles Hermite “COVID”, Fédération Charles Hermite, Sep 2021, Vandœuvre-lès-Nancy, France
Accès au bibtex

titre: Evaluating X-vector-based Speaker Anonymization under White-box Assessment
auteur: Pierre Champion, Denis Jouvet, Anthony Larcher
article: SPECOM 2021 – 23rd International Conference on Speech and Computer, Sep 2021, Saint Petersburg, Russia
Accès au texte intégral et bibtex

titre: A comparative study of different state-of-the-art NLP models for efficient automatic hate speech detection
auteur: Nicolas Zampieri, Irina Illina, Dominique Fohr
article: Comments, hate speech, disinformation and public communication regulation 2021, Sep 2021, Zagreb, Croatia
Accès au bibtex

titre: ASVspoof 2021: accelerating progress in spoofed and deepfake speech detection
auteur: Junichi Yamagishi, Xin Wang, Massimiliano Todisco, Md Sahidullah, Jose Patino, Andreas Nautsch, Xuechen Liu, Kong Aik Lee, Tomi Kinnunen, Nicholas Evans, Héctor Delgado
article: ASVspoof 2021 Workshop – Automatic Speaker Verification and Spoofing Coutermeasures Challenge, Sep 2021, Virtual, France
Accès au texte intégral et bibtex

titre: On Refining BERT Contextualized Embeddings using Semantic Lexicons
auteur: Georgios Zervakis, Emmanuel Vincent, Miguel Couceiro, Marc Schoenauer
article: ECML PKDD 2021 – Machine Learning with Symbolic Methods and Knowledge Graphs co-located with European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, Sep 2021, Online, Spain
Accès au texte intégral et bibtex

titre: Exploring Conditional Language Model Based Data Augmentation Approaches For Hate Speech Classification
auteur: Ashwin Geet d’Sa, Irina Illina, Dominique Fohr, Dietrich Klakow, Dana Ruiter
article: TSD 2021 – 24th International Conference on Text, Speech and Dialogue, Sep 2021, Olomouc, Czech Republic
Accès au texte intégral et bibtex

titre: DNN-based semantic rescoring models for speech recognition
auteur: Irina Illina, Dominique Fohr
article: TSD 2021 – 24th International Conference on Text, Speech and Dialogue, Sep 2021, Olomouc, Czech Republic
Accès au texte intégral et bibtex

titre: Visualizing Classifier Adjacency Relations: A Case Study in Speaker Verification and Voice Anti-Spoofing
auteur: Tomi Kinnunen, Andreas Nautsch, Md Sahidullah, Nicholas Evans, Xin Wang, Massimiliano Todisco, Héctor Delgado, Junichi Yamagishi, Lee Kong Aik
article: INTERSPEECH 2021, Aug 2021, Brno, Czech Republic. ⟨10.21437/Interspeech.2021-1522⟩
Accès au texte intégral et bibtex

titre: Voicing assimilations by French Speakers of German in stop-fricative sequences
auteur: Anne Bonneau
article: INTERSPEECH 2021, Aug 2021, Brno, Czech Republic. ⟨10.21437/Interspeech.2021-601⟩
Accès au texte intégral et bibtex

titre: BERT-based Semantic Model for Rescoring N-best Speech Recognition List
auteur: Dominique Fohr, Irina Illina
article: INTERSPEECH 2021, Aug 2021, Brno, Czech Republic. ⟨10.21437/Interspeech.2021-313⟩
Accès au texte intégral et bibtex

titre: Language recognition on unknown conditions: the LORIA-Inria-MULTISPEECH system for AP20-OLR Challenge
auteur: Raphaël Duroselle, Md Sahidullah, Denis Jouvet, Irina Illina
article: INTERSPEECH 2021, Aug 2021, Brno, Czech Republic. ⟨10.21437/Interspeech.2021-276⟩
Accès au texte intégral et bibtex

titre: Towards the prediction of the vocal tract shape from the sequence of phonemes to be articulated
auteur: Vinicius Ribeiro, Karyna Isaieva, Justine Leclère, Pierre-André Vuissoz, Yves Laprie
article: INTERSPEECH 2021, Aug 2021, Brno, Czech Republic. ⟨10.21437/Interspeech.2021-184⟩
Accès au texte intégral et bibtex

titre: Modeling and training strategies for language recognition systems
auteur: Raphaël Duroselle, Md Sahidullah, Denis Jouvet, Irina Illina
article: INTERSPEECH 2021, Aug 2021, Brno, Czech Republic. ⟨10.21437/Interspeech.2021-277⟩
Accès au texte intégral et bibtex

titre: Data Quality as Predictor of Voice Anti-Spoofing Generalization
auteur: Bhusan Chettri, Rosa González Hautamäki, Md Sahidullah, Tomi Kinnunen
article: INTERSPEECH 2021, Aug 2021, Brno, Czech Republic. ⟨10.21437/Interspeech.2021-1180⟩
Accès au texte intégral et bibtex

titre: Explaining deep learning models for speech enhancement
auteur: Sunit Sivasankaran, Emmanuel Vincent, Dominique Fohr
article: INTERSPEECH 2021, Aug 2021, Brno, Czech Republic. ⟨10.21437/Interspeech.2021-1764⟩
Accès au texte intégral et bibtex

titre: StutterNet: Stuttering Detection Using Time Delay Neural Network
auteur: Shakeel Ahmad Sheikh, Md Sahidullah, Fabrice Hirsch, Slim Ouni
article: EUSIPCO 2021 – 29th European Signal Processing Conference, Aug 2021, Dublin / Virtual, Ireland. ⟨10.23919/EUSIPCO54536.2021.9616063⟩
Accès au texte intégral et bibtex

titre: Improving transfer of expressivity for end-to-end multispeaker text-to-speech synthesis
auteur: Ajinkya Kulkarni, Vincent Colotte, Denis Jouvet
article: EUSIPCO 2021 – 29th European Signal Processing Conference, European Association for Signal Processing (EURASIP), Aug 2021, Dublin / Virtual, Ireland. ⟨10.23919/EUSIPCO54536.2021.9616249⟩
Accès au texte intégral et bibtex

titre: Attention-based distributed speech enhancement for unconstrained microphone arrays with varying number of nodes
auteur: Nicolas Furnon, Romain Serizel, Slim Essid, Irina Illina
article: EUSIPCO 2021 – 29th European Signal Processing Conference, IEEE, Aug 2021, Dublin / Virtual, Ireland. ⟨10.23919/EUSIPCO54536.2021.9616358⟩
Accès au texte intégral et bibtex

titre: Deep scattering network for speech emotion recognition
auteur: Premjeet Singh, Goutam Saha, Md Sahidullah
article: EUSIPCO 2021 – 29th European Signal Processing Conference, Aug 2021, Dublin / Virtual, Ireland. ⟨10.23919/EUSIPCO54536.2021.9615958⟩
Accès au texte intégral et bibtex

titre: Cross-Corpora Language Recognition: A Preliminary Investigation with Indian Languages
auteur: Spandan Dey, Goutam Saha, Md Sahidullah
article: EUSIPCO 2021 – 29th European Signal Processing Conference, Aug 2021, Dublin / Virtual, Ireland. ⟨10.23919/EUSIPCO54536.2021.9616273⟩
Accès au texte intégral et bibtex

titre: Compensate multiple distortions for speaker recognition systems
auteur: Mohammad Mohammadamini, Driss Matrouf, Jean-Francois Bonastre, Romain Serizel, Sandipana Dowerah, Denis Jouvet
article: EUSIPCO 2021 – 29th European Signal Processing Conference, Aug 2021, Dublin / Virtual, Ireland. ⟨10.23919/EUSIPCO54536.2021.9615983⟩
Accès au texte intégral et bibtex

titre: Learning-based estimation of individual absorption profiles from a single room impulse response with known positions of source, sensor and surfaces
auteur: Stéphane Dilungana, Antoine Deleforge, Cédric Foy, Sylvain Faisan
article: 2021 INTER-NOISE and NOISE-CON Congress and Conference, Aug 2021, Internet, United States. pp 5623–5630, ⟨10.3397/IN-2021-3186⟩
Accès au bibtex

titre: Assimilations de voisement et interférences français/allemand
auteur: Anne Bonneau
article: RéaL2 2021 – Colloque International du Réseau d’Acquisition des Langues Secondes, Jul 2021, Toulouse, France
Accès au texte intégral et bibtex

titre: GECko+: a Grammatical and Discourse Error Correction Tool
auteur: Eduardo Calò, Léo Jacqmin, Thibo Rosemplatt, Maxime Amblard, Miguel Couceiro, Ajinkya Kulkarni
article: TALN 2021 – 28e Conférence sur le Traitement Automatique des Langues Naturelles, Jun 2021, Lille / Virtual, France. pp.8-11
Accès au texte intégral et bibtex

titre: A comparative study of different features for efficient automatic hate speech detection
auteur: Nicolas Zampieri, Irina Illina, Dominique Fohr
article: IPrA 2021 – 17th International Pragmatics Conference, Jun 2021, Winterthur, Switzerland
Accès au texte intégral et bibtex

titre: Multiword Expression Features for Automatic Hate Speech Detection
auteur: Nicolas Zampieri, Irina Illina, Dominique Fohr
article: NLDB 2021 – 26th International Conference on Natural Language & Information Systems, Jun 2021, Saarbrücken/Virtual, Germany
Accès au texte intégral et bibtex

titre: Unsupervised Domain Adaptation in Cross-corpora Abusive Language Detection
auteur: Tulika Bose, Irina Illina, Dominique Fohr
article: SocialNLP 2021 – The 9th International Workshop on Natural Language Processing for Social Media, Jun 2021, Virtual, France
Accès au texte intégral et bibtex

titre: Sound Event Detection and Separation: a Benchmark on Desed Synthetic Soundscapes
auteur: Nicolas Turpault, Romain Serizel, Scott Wisdom, Hakan Erdogan, John R Hershey, Eduardo Fonseca, Prem Seetharaman, Justin Salamon
article: ICASSP 2021 – 46th International Conference on Acoustics, Speech, and Signal Processing, Jun 2021, Toronto/Virtual, Canada. ⟨10.1109/ICASSP39728.2021.9414789⟩
Accès au texte intégral et bibtex

titre: What’s All the FUSS About Free Universal Sound Separation Data?
auteur: Scott Wisdom, Hakan Erdogan, Daniel P W Ellis, Romain Serizel, Nicolas Turpault, Eduardo Fonseca, Justin Salamon, Prem Seetharaman, John R Hershey
article: ICASSP 2021 – 46th International Conference on Acoustics, Speech, and Signal Processing, Jun 2021, Toronto/Virtual, Canada. ⟨10.1109/ICASSP39728.2021.9414774⟩
Accès au texte intégral et bibtex

titre: Generalisability of Topic Models in Cross-corpora Abusive Language Detection
auteur: Tulika Bose, Irina Illina, Dominique Fohr
article: NLP4IF 2021 – Workshop Censorship, Disinformation, and Propaganda, Jun 2021, Mexico city/Virtual, Mexico
Accès au texte intégral et bibtex

titre: Switching Variational Auto-Encoders for Noise-Agnostic Audio-visual Speech Enhancement
auteur: Mostafa Sadeghi, Xavier Alameda-Pineda
article: ICASSP 2021 – 46th International Conference on Acoustics, Speech, and Signal Processing, Jun 2021, Toronto / Virtual, Canada. pp.1-5, ⟨10.1109/ICASSP39728.2021.9414097⟩
Accès au texte intégral et bibtex

titre: Distributed speech separation in spatially unconstrained microphone arrays
auteur: Nicolas Furnon, Romain Serizel, Irina Illina, Slim Essid
article: ICASSP 2021 – 46th International Conference on Acoustics, Speech, and Signal Processing, Jun 2021, Toronto / Virtual, Canada. ⟨10.1109/ICASSP39728.2021.9414758⟩
Accès au texte intégral et bibtex

titre: Improving Sound Event Detection Metrics: Insights from DCASE 2020
auteur: Giacomo Ferroni, Nicolas Turpault, Juan Azcarreta, Francesco Tuveri, Romain Serizel, Çagdaş Bilen, Sacha Krstulović
article: ICASSP 2021 – 46th International Conference on Acoustics, Speech, and Signal Processing, Jun 2021, Toronto/Virtual, Canada. ⟨10.1109/ICASSP39728.2021.9414711⟩
Accès au texte intégral et bibtex

titre: Detecting acoustic reflectors using a robot’s ego-noise
auteur: Usama Saqib, Antoine Deleforge, Jesper Rindom Jensen
article: ICASSP 2021 – 46th International Conference on Acoustics, Speech, and Signal Processing, Jun 2021, Toronto / Virtual, Canada. ⟨10.1109/ICASSP39728.2021.9414061⟩
Accès au texte intégral et bibtex

titre: Learnable MFCCs for Speaker Verification
auteur: Xuechen Liu, Md Sahidullah, Tomi Kinnunen
article: ISCAS 2021 – IEEE International Symposium on Circuits and Systems, May 2021, Daegu, South Korea. ⟨10.1109/ISCAS51556.2021.9401593⟩
Accès au texte intégral et bibtex

titre: Non-linear frequency warping using constant-Q transformation for speech emotion recognition
auteur: Premjeet Singh, Goutam Saha, Md Sahidullah
article: ICCCI 2021 – International Conference on Computer Communication and Informatics, Jan 2021, Coimbatore, India. ⟨10.1109/ICCCI50826.2021.9402569⟩
Accès au texte intégral et bibtex

titre: Domain-Dependent Speaker Diarization for the Third DIHARD Challenge
auteur: Kishore A. Kumar, Shefali Waldekar, Goutam Saha, Md Sahidullah
article: DIHARD 2021 – 3rd Speech Diarization Challenge Workshop, Jan 2021, Virtual, France
Accès au texte intégral et bibtex

titre: UIAI System for Short-Duration Speaker Verification Challenge 2020
auteur: Md Sahidullah, Achintya Kumar Sarkar, Ville Vestman, Xuechen Liu, Romain Serizel, Tomi Kinnunen, Zheng-Hua Tan, Emmanuel Vincent
article: SLT 2021 – IEEE Spoken Language Technology Workshop, IEEE, Jan 2021, Shenzhen / Virtual, China. ⟨10.1109/SLT48900.2021.9383596⟩
Accès au texte intégral et bibtex

titre: Foreground-Background Ambient Sound Scene Separation
auteur: Michel Olvera, Emmanuel Vincent, Romain Serizel, Gilles Gasso
article: EUSIPCO 2020 – 28th European Signal Processing Conference, Jan 2021, Amsterdam / Virtual, Netherlands. ⟨10.23919/Eusipco47968.2020.9287436⟩
Accès au texte intégral et bibtex

titre: MRI Vocal Tract Sagittal Slices Estimation during Speech Production of CV
auteur: Ioannis K Douros, Ajinkya Kulkarni, Yu Xie, Chrysanthi Dourou, Jacques Felblinger, Karyna Isaieva, Pierre-André Vuissoz, Yves Laprie
article: EUSIPCO 2020 – 28th European Signal Processing Conference, Jan 2021, Amsterdam / Virtual, Netherlands. ⟨10.23919/Eusipco47968.2020.9287834⟩
Accès au texte intégral et bibtex

titre: Analyzing the impact of speaker localization errors on speech separation for automatic speech recognition
auteur: Sunit Sivasankaran, Emmanuel Vincent, Dominique Fohr
article: EUSIPCO 2020 – 28th European Signal Processing Conference, Jan 2021, Amsterdam / Virtual, Netherlands. ⟨10.23919/Eusipco47968.2020.9287541⟩
Accès au texte intégral et bibtex

Book sections

titre: Histoire des machines parlantes
auteur: Benjamin Elie, Camille Fauth, Melissa Barkat-Defradas
article: Christelle Dodane; Claudia Schweitzer. HISTOIRE DE LA DESCRIPTION DE LA PAROLE : DE L’INTROSPECTON À L’INSTRUMENTATION, Honoré Champion, 2021, 9782745355959
Accès au bibtex

Patents

titre: Audio-driven speech animation using recurrent neutral network
auteur: Slim Ouni, Théo Biasutto–Lervat, Sara Dahmani
article: United States, Patent n° : WO2021023861. 2021
Accès au bibtex

Theses

titre: Deep-learning based speech enhancement with ad-hoc microphone arrays
auteur: Nicolas Furnon
article: Informatique [cs]. Université de Lorraine, 2021. Français. ⟨NNT : 2021LORR0277⟩
Accès au texte intégral et bibtex

titre: Robustness of language recognition system to transmission channel
auteur: Raphaël Duroselle
article: Computer Science [cs]. Université de Lorraine, 2021. English. ⟨NNT : 2021LORR0250⟩
Accès au texte intégral et bibtex

titre: Implicit and explicit phase modeling in deep learning-based source separation
auteur: Manuel Pariente
article: Machine Learning [stat.ML]. Université de Lorraine, 2021. English. ⟨NNT : 2021LORR0150⟩
Accès au texte intégral et bibtex

titre: Analysis of scientific challenges in ambient sound recognition in real environments
auteur: Nicolas Turpault
article: Informatique [cs]. Université de Lorraine, 2021. Français. ⟨NNT : 2021LORR0108⟩
Accès au texte intégral et bibtex

titre: Multimodal Coarticulation Modeling : Towards the animation of an intelligible talking head
auteur: Théo Biasutto-Lervat
article: Intelligence artificielle [cs.AI]. Université de Lorraine, 2021. Français. ⟨NNT : 2021LORR0019⟩
Accès au texte intégral et bibtex

Preprints, Working Papers, …

titre: SAMbA: Speech enhancement with Asynchronous ad-hoc Microphone Arrays
auteur: Nicolas Furnon, Romain Serizel, Slim Essid, Irina Illina
article: 2021
Accès au texte intégral et bibtex

titre: MULTICHANNEL SPEECH ENHANCEMENT FOR SPEAKER VERIFICATION IN NOISY AND REVERBERANT ENVIRONMENTS
auteur: Sandipana Dowerah, Romain Serizel, Denis Jouvet, Mohammad Mohammadamini, Driss Matrouf
article: 2021
Accès au bibtex

titre: Analysis of weak labels for sound event tagging
auteur: Nicolas Turpault, Romain Serizel, Emmanuel Vincent
article: 2021
Accès au texte intégral et bibtex

titre: ABSP System for The Third DIHARD Challenge
auteur: Kishore A. Kumar, Shefali Waldekar, Goutam Saha, Md Sahidullah
article: 2021
Accès au texte intégral et bibtex

2020

Journal articles

titre: Classification of Hate Speech Using Deep Neural Networks
auteur: Ashwin Geet d’Sa, Irina Illina, Dominique Fohr
article: Revue d’Information Scientifique & Technique , 2020, From Data and Information Processing to Knowledge Organization : Architectures, Models and Systems, 25 (01)
Accès au texte intégral et bibtex

titre: Peut-on faire confiance aux IA ?
auteur: Emmanuel Vincent
article: The Conversation France, 2020
Accès au bibtex

titre: Duration modelling and evaluation for Arabic statistical parametric speech synthesis
auteur: Imene Zangar, Zied Mnasri, Vincent Colotte, Denis Jouvet
article: Multimedia Tools and Applications, 2020, ⟨10.1007/s11042-020-09901-7⟩
Accès au texte intégral et bibtex

titre: ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech
auteur: Xin Wang, Junichi Yamagishi, Massimiliano Todisco, Héctor Delgado, Andreas Nautsch, Nicholas Evans, Md Sahidullah, Ville Vestman, Tomi Kinnunen, Kong Aik Lee, Lauri Juvela, Paavo Alku, Yu-Huai Peng, Hsin-Te Hwang, Yu Tsao, Hsin-Min Wang, Sébastien Le Maguer, Markus Becker, Fergus Henderson, Rob Clark, Yu Zhang, Quan Wang, Ye Jia, Kai Onuma, Koji Mushika, Takashi Kaneda, Yuan Jiang, Li-Juan Liu, Yi-Chiao Wu, Wen-Chin Huang, Tomoki Toda, Kou Tanaka, Hirokazu Kameoka, Ingmar Steiner, Driss Matrouf, Jean-François Bonastre, Avashna Govender, Srikanth Ronanki, Jing-Xuan Zhang, Zhen-Hua Ling
article: Computer Speech and Language, 2020, 64, pp.101114. ⟨10.1016/j.csl.2020.101114⟩
Accès au texte intégral et bibtex

titre: Automatic Tongue Delineation from MRI Images with a Convolutional Neural Network Approach
auteur: Karyna Isaieva, Yves Laprie, Nicolas Turpault, Alexis Houssard, Jacques Felblinger, Pierre-André Vuissoz
article: Applied Artificial Intelligence, 2020, 34 (14), pp.1115-1123. ⟨10.1080/08839514.2020.1824090⟩
Accès au texte intégral et bibtex

titre: Optimization of data-driven filterbank for automatic speaker verification
auteur: Susanta Sarangi, Md Sahidullah, Goutam Saha
article: Digital Signal Processing, 2020, 104, ⟨10.1016/j.dsp.2020.102795⟩
Accès au texte intégral et bibtex

titre: Some consideration on expressive audiovisual speech corpus acquisition using a multimodal platform
auteur: Sara Dahmani, Vincent Colotte, Slim Ouni
article: Language Resources and Evaluation, 2020, ⟨10.1007/s10579-020-09500-w⟩
Accès au texte intégral et bibtex

titre: Joint NN-Supported Multichannel Reduction of Acoustic Echo, Reverberation and Noise
auteur: Guillaume Carbajal, Romain Serizel, Emmanuel Vincent, Eric Humbert
article: IEEE/ACM Transactions on Audio, Speech and Language Processing, 2020, ⟨10.1109/TASLP.2020.3008974⟩
Accès au texte intégral et bibtex

titre: Tandem Assessment of Spoofing Countermeasures and Automatic Speaker Verification: Fundamentals
auteur: Tomi Kinnunen, Héctor Delgado, Nicholas Evans, Kong-Aik Lee, Ville Vestman, Andreas Nautsch, Massimiliano Todisco, Xin Wang, Md Sahidullah, Junichi Yamagishi, Douglas A Reynolds
article: IEEE/ACM Transactions on Audio, Speech and Language Processing, 2020, IEEE/ACM Transactions on Audio, Speech, and Language Processing, 28, pp.2195 – 2210. ⟨10.1109/TASLP.2020.3009494⟩
Accès au texte intégral et bibtex

titre: A scalable and effective rough set theory-based approach for big data pre-processing
auteur: Zaineb Chelly Dagdia, Christine Zarges, Gaël Beck, Mustapha Lebbah
article: Knowledge and Information Systems (KAIS), 2020, 62 (8), pp.3321-3386. ⟨10.1007/s10115-020-01467-y⟩
Accès au texte intégral et bibtex

titre: Measurement of Tongue Tip Velocity from Real-Time MRI and Phase-Contrast Cine-MRI in Consonant Production
auteur: Karyna Isaieva, Yves Laprie, Freddy Odille, Ioannis K Douros, Jacques Felblinger, Pierre-André Pav Vuissoz
article: Journal of Imaging, 2020, 6 (5), pp.31. ⟨10.3390/jimaging6050031⟩
Accès au texte intégral et bibtex

titre: On the Use of Artificial Malicious Patterns for Android Malware Detection
auteur: Manel Jerbi, Zaineb Chelly Dagdia, Slim Bechikh, Mohamed Makhlouf, Lamjed Ben Said
article: Computers & Security, 2020, 92, pp.101743. ⟨10.1016/j.cose.2020.101743⟩
Accès au texte intégral et bibtex

titre: Separation of Alpha-Stable Random Vectors
auteur: Mathieu Fontaine, Roland Badeau, Antoine Liutkus
article: Signal Processing, 2020, pp.107465. ⟨10.1016/j.sigpro.2020.107465⟩
Accès au texte intégral et bibtex

titre: RNN Language Model Estimation for Out-of-Vocabulary Words
auteur: Irina Illina, Dominique Fohr
article: Lecture Notes in Artificial Intelligence, 2020, 12598, ⟨10.1007/978-3-030-66527-2_15⟩
Accès au texte intégral et bibtex

Conference papers

titre: Tracking the tongue contours in rt-MRI films with an autoencoder DNN approach
auteur: Karyna Isaieva, Yves Laprie, Alexis Houssard, Jacques Felblinger, Pierre-André Vuissoz
article: ISSP 2020 – 12th International Seminar on Speech Production, Dec 2020, Providence / Virtual, United States
Accès au texte intégral et bibtex

titre: Synthesize MRI vocal tract data during CV production
auteur: Ioannis K Douros, Chrysanthi Dourou, Yu Xie, Jacques Felblinger, Karyna Isaieva, Pierre-André Vuissoz, Yves Laprie
article: ISSP 2020 – 12th International Seminar on Speech Production, Dec 2020, Providence / Virtual, United States
Accès au texte intégral et bibtex

titre: F1 and F2 measurements for French oral vowel with a new pneumotachograph mask
auteur: Amélie Elmerich, Angelique Amelot, Shinji Maeda, Yves Laprie, Jean Francois Papon, Lise Crevier-Buchman
article: ISSP 2020 – 12th International Seminar on Speech Production, Dec 2020, Providence / Virtual, United States
Accès au texte intégral et bibtex

titre: DNN-Based Parametric Speech Synthesis Enhanced With Articulatory Information
auteur: Anastasiia Tsukanova, Ioannis K Douros, Yves Laprie
article: ISSP 2020 – 12th International Seminar on Speech Production, Dec 2020, Providence / Virtual, United States
Accès au texte intégral et bibtex

titre: Vocal tract sagittal slices estimation from MRI midsagittal slices during speech production of CV
auteur: Ioannis K Douros, Yu Xie, Chrysanthi Dourou, Jacques Felblinger, Karyna Isaieva, Pierre-André Vuissoz, Yves Laprie
article: ISSP 2020 – 12th International Seminar on Speech Production, Dec 2020, Providence / Virtual, United States
Accès au texte intégral et bibtex

titre: Mean Absorption Coefficient Estimation From Impulse Responses: Deep Learning vs. Sabine
auteur: Corto Bastien, Antoine Deleforge, Cédric Foy
article: E-FA 2020 – Forum Acusticum 2020, Dec 2020, Lyon / Virtual, France. pp.2, ⟨10.48465/fa.2020.0785⟩
Accès au texte intégral et bibtex

titre: Label Propagation-Based Semi-Supervised Learning for Hate Speech Classification
auteur: Ashwin Geet d’Sa, Irina Illina, Dominique Fohr, Dietrich Klakow, Dana Ruiter
article: Insights from Negative Results Workshop, EMNLP 2020, Nov 2020, Punta Cana, Dominican Republic
Accès au texte intégral et bibtex

titre: A Study of F0 Modification for X-Vector Based Speech Pseudo-Anonymization Across Gender
auteur: Pierre Champion, Denis Jouvet, Anthony Larcher
article: The Second AAAI Workshop on Privacy-Preserving Artificial Intelligence (PPAI)., Nov 2020, online, United States
Accès au texte intégral et bibtex

titre: Task-Aware Separation for the DCASE 2020 Task 4 Sound Event Detection and Separation Challenge
auteur: Samuele Cornell, Michel Olvera, Manuel Pariente, Giovanni Pepe, Emanuele Principi, Leonardo Gabrielli, Stefano Squartini
article: DCASE 2020 – 5th Workshop on Detection and Classification of Acoustic Scenes and Events, Nov 2020, Virtual, Japan
Accès au texte intégral et bibtex

titre: Domain-Adversarial Training and Trainable Parallel Front-end for the DCASE 2020 Task 4 Sound Event Detection Challenge
auteur: Samuele Cornell, Michel Olvera, Manuel Pariente, Giovanni Pepe, Emanuele Principi, Leonardo Gabrielli, Stefano Squartini
article: DCASE 2020 – 5th Workshop on Detection and Classification of Acoustic Scenes and Events, Nov 2020, Virtual, Japan
Accès au texte intégral et bibtex

titre: Unsupervised regularization of the embedding extractor for robust language identification
auteur: Raphaël Duroselle, Denis Jouvet, Irina Illina
article: Odyssey 2020 – The Speaker and Language Recognition Workshop, Nov 2020, Tokyo, Japan
Accès au texte intégral et bibtex

titre: Improving Sound Event Detection In Domestic Environments Using Sound Separation
auteur: Nicolas Turpault, Scott Wisdom, Hakan Erdogan, John R Hershey, Romain Serizel, Eduardo Fonseca, Prem Seetharaman, Justin Salamon
article: DCASE Workshop 2020 – Detection and Classification of Acoustic Scenes and Events, Nov 2020, Tokyo / Virtual, Japan
Accès au texte intégral et bibtex

titre: HUMAN: Hierarchical Universal Modular ANnotator
auteur: Moritz Wolf, Dana Ruiter, Ashwin Geet d’Sa, Liane Reiners, Jan Alexandersson, Dietrich Klakow
article: EMNLP 2020 System Demonstration, Nov 2020, Punta Cana (Virtual), Dominican Republic
Accès au bibtex

titre: Training Sound Event Detection On A Heterogeneous Dataset
auteur: Nicolas Turpault, Romain Serizel
article: DCASE Workshop, Nov 2020, Tokyo, Japan
Accès au texte intégral et bibtex

titre: Metric learning loss functions to reduce domain mismatch in the x-vector space for language recognition
auteur: Raphaël Duroselle, Denis Jouvet, Irina Illina
article: INTERSPEECH 2020, Oct 2020, Shangaï / Virtual, China
Accès au texte intégral et bibtex

titre: Transfer learning of the expressivity using flow metric learning in multispeaker text-to-speech synthesis
auteur: Ajinkya Kulkarni, Vincent Colotte, Denis Jouvet
article: INTERSPEECH 2020, Oct 2020, Shanghai / Virtual, China
Accès au texte intégral et bibtex

titre: Correlation between prosody and pragmatics: case study of discourse markers in French and English
auteur: Lou Lee, Denis Jouvet, Katarina Bartkova, Yvon Keromnes, Mathilde Dargnat
article: INTERSPEECH 2020, Oct 2020, Shanghai, China
Accès au texte intégral et bibtex

titre: Introducing the VoicePrivacy initiative
auteur: Natalia Tomashenko, Brij Mohan Lal Srivastava, Xin Wang, Emmanuel Vincent, Andreas Nautsch, Junichi Yamagishi, Nicholas Evans, Jose Patino, Jean-François Bonastre, Paul-Gauthier Noé, Massimiliano Todisco
article: INTERSPEECH 2020, Oct 2020, Shanghai, China
Accès au texte intégral et bibtex

titre: Kaldi-web: An installation-free, on-device speech recognition system
auteur: Mathieu Hu, Laurent Pierron, Emmanuel Vincent, Denis Jouvet
article: INTERSPEECH 2020 Show & Tell, Oct 2020, Shanghai, China
Accès au texte intégral et bibtex

titre: Design Choices for X-vector Based Speaker Anonymization
auteur: Brij Mohan Lal Srivastava, Natalia Tomashenko, Xin Wang, Emmanuel Vincent, Junichi Yamagishi, Mohamed Maouche, Aurélien Bellet, Marc Tommasi
article: INTERSPEECH 2020, International Speech Communication Association (ISCA), Oct 2020, Shanghai, China
Accès au texte intégral et bibtex

titre: Achieving Multi-Accent ASR via Unsupervised Acoustic Model Adaptation
auteur: Mehmet Ali Tuğtekin Turan, Emmanuel Vincent, Denis Jouvet
article: INTERSPEECH 2020, Oct 2020, Shanghai, China
Accès au texte intégral et bibtex

titre: A Comparative Re-Assessment of Feature Extractors for Deep Speaker Embeddings
auteur: Xuechen Liu, Md Sahidullah, Tomi Kinnunen
article: INTERSPEECH 2020, Oct 2020, Shanghai, China
Accès au texte intégral et bibtex

titre: Asteroid: the PyTorch-based audio source separation toolkit for researchers
auteur: Manuel Pariente, Samuele Cornell, Joris Cosentino, Sunit Sivasankaran, Efthymios Tzinis, Jens Heitkaemper, Michel Olvera, Fabian-Robert Stöter, Mathieu Hu, Juan M. Martín-Doñas, David Ditter, Ariel Frank, Antoine Deleforge, Emmanuel Vincent
article: Interspeech 2020, Oct 2020, Shanghai, China
Accès au texte intégral et bibtex

titre: Using Silence MR Image to Synthesise Dynamic MRI Vocal Tract Data of CV
auteur: Ioannis K Douros, Ajinkya Kulkarni, Chrysanthi Dourou, Yu Xie, Jacques Felblinger, Karyna Isaieva, Pierre-André Vuissoz, Yves Laprie
article: INTERSPEECH 2020, Oct 2020, Shangaï / Virtual, China
Accès au texte intégral et bibtex

titre: Detecting and counting overlapping speakers in distant speech scenarios
auteur: Samuele Cornell, Maurizio Omologo, Stefano Squartini, Emmanuel Vincent
article: INTERSPEECH 2020, Oct 2020, Shanghai, China
Accès au texte intégral et bibtex

titre: On semi-supervised LF-MMI training of acoustic models with limited data
auteur: Imran Sheikh, Emmanuel Vincent, Irina Illina
article: INTERSPEECH 2020, Oct 2020, Shanghai, China
Accès au texte intégral et bibtex

titre: A comparative study of speech anonymization metrics
auteur: Mohamed Maouche, Brij Mohan Lal Srivastava, Nathalie Vauquier, Aurélien Bellet, Marc Tommasi, Emmanuel Vincent
article: INTERSPEECH 2020, Oct 2020, Shanghai, China
Accès au texte intégral et bibtex

titre: Drone audition for search and rescue: Datasets and challenges
auteur: Antoine Deleforge
article: QUIET DRONES International Symposium on UAV/UAS Noise, Oct 2020, Paris, France
Accès au texte intégral et bibtex

titre: Deep variational metric learning for transfer of expressivity in multispeaker text to Speech
auteur: Ajinkya Kulkarni, Vincent Colotte, Denis Jouvet
article: SLSP 2020 – 8th International Conference on Statistical Language and Speech Processing, Oct 2020, Cardiff / Virtual, United Kingdom
Accès au texte intégral et bibtex

titre: Introduction of semantic model to help speech recognition
auteur: Stephane Level, Irina Illina, Dominique Fohr
article: TSD 2020 – Twenty-third International Conference on Text, Speech and Dialogue, Sep 2020, Brno, Czech Republic
Accès au texte intégral et bibtex

titre: Embedding Formal Contexts Using Unordered Composition
auteur: Esteban Marquer, Ajinkya Kulkarni, Miguel Couceiro
article: FCA4AI – 8th International Workshop “What can FCA do for Artificial Intelligence?” (colocated wit ECAI2020), Aug 2020, Santiago de Compostela, Spain
Accès au texte intégral et bibtex

titre: Adaptation de domaine non supervisée pour la reconnaissance de la langue par régularisation d’un réseau de neurones
auteur: Raphaël Duroselle, Denis Jouvet, Irina Illina
article: 6e conférence conjointe Journées d’Études sur la Parole (JEP, 33e édition), Traitement Automatique des Langues Naturelles (TALN, 27e édition), Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues (RÉCITAL, 22e édition). Volume 1 : Journées d’Études sur la Parole, Jun 2020, Nancy, France. pp.190-198
Accès au texte intégral et bibtex

titre: AMIS project : automatic summarization and translation of video
auteur: Mohamed Amine Menacer, Dominique Fohr, Denis Jouvet, Karima Abidi, David Langlois, Kamel Smaïli
article: 6e conférence conjointe Journées d’Études sur la Parole (JEP, 33e édition), Traitement Automatique des Langues Naturelles (TALN, 27e édition), Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues (RÉCITAL, 22e édition). Volume 4 : Démonstrations et résumés d’articles internationaux, Jun 2020, Nancy, France. pp.53-56
Accès au texte intégral et bibtex

titre: Introduction d’informations sémantiques dans un système de reconnaissance de la parole
auteur: Stephane Level, Irina Illina, Dominique Fohr
article: 6e conférence conjointe Journées d’Études sur la Parole (JEP, 33e édition), Traitement Automatique des Langues Naturelles (TALN, 27e édition), Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues (RÉCITAL, 22e édition). Volume 1 : Journées d’Études sur la Parole, Jun 2020, Nancy, France. pp.362-369
Accès au texte intégral et bibtex

titre: Étude comparative des paramètres d’entrée pour la synthèse expressive audiovisuelle de la parole par DNNs
auteur: Sara Dahmani, Vincent Colotte, Slim Ouni
article: 6e conférence conjointe Journées d’Études sur la Parole (JEP, 33e édition), Traitement Automatique des Langues Naturelles (TALN, 27e édition), Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues (RÉCITAL, 22e édition). Volume 1 : Journées d’Études sur la Parole, Jun 2020, Nancy, France. pp.127-135
Accès au texte intégral et bibtex

titre: Étude comparative de corrélats prosodiques de marqueurs discursifs français et anglais selon leur fonction pragmatique
auteur: Lou Lee, Denis Jouvet, Katarina Bartkova, Yvon Keromnes, Mathilde Dargnat
article: 6e conférence conjointe Journées d’Études sur la Parole (JEP, 33e édition), Traitement Automatique des Langues Naturelles (TALN, 27e édition), Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues (RÉCITAL, 22e édition). Volume 1 : Journées d’Études sur la Parole, Jun 2020, Nancy, France. pp.335-343
Accès au texte intégral et bibtex

titre: Towards Non-Toxic Landscapes: Automatic Toxic Comment Detection Using DNN
auteur: Ashwin Geet d’Sa, Irina Illina, Dominique Fohr
article: TRAC-2020, Second Workshop on Trolling, Aggression and Cyberbullying (LREC, 2020), May 2020, Marseille, France
Accès au texte intégral et bibtex

titre: DNN-Based Distributed Multichannel Mask Estimation for Speech Enhancement in Microphone Arrays
auteur: Nicolas Furnon, Romain Serizel, Irina Illina, Slim Essid
article: ICASSP 2020 – 45th International Conference on Acoustics, Speech, and Signal Processing, May 2020, Barcelona, Spain
Accès au texte intégral et bibtex

titre: SLOGD: Speaker Location Guided Deflation Approach to Speech Separation
auteur: Sunit Sivasankaran, Emmanuel Vincent, Dominique Fohr
article: ICASSP 2020 – 45th International Conference on Acoustics, Speech, and Signal Processing, May 2020, Barcelona, Spain
Accès au texte intégral et bibtex

titre: Sound event detection in synthetic domestic environments
auteur: Romain Serizel, Nicolas Turpault, Ankit Shah, Justin Salamon
article: ICASSP 2020 – 45th International Conference on Acoustics, Speech, and Signal Processing, May 2020, Barcelona, Spain
Accès au texte intégral et bibtex

titre: Evaluating Voice Conversion-based Privacy Protection against Informed Attackers
auteur: Brij Mohan Lal Srivastava, Nathalie Vauquier, Md Sahidullah, Aurélien Bellet, Marc Tommasi, Emmanuel Vincent
article: ICASSP 2020 – 45th International Conference on Acoustics, Speech, and Signal Processing, IEEE Signal Processing Society, May 2020, Barcelona, Spain. pp.2802-2806
Accès au texte intégral et bibtex

titre: Limitations of weak labels for embedding and tagging
auteur: Nicolas Turpault, Romain Serizel, Emmanuel Vincent
article: ICASSP 2020 – 45th International Conference on Acoustics, Speech, and Signal Processing, May 2020, Barcelona, Spain
Accès au texte intégral et bibtex

titre: Filterbank design for end-to-end speech separation
auteur: Manuel Pariente, Samuele Cornell, Antoine Deleforge, Emmanuel Vincent
article: ICASSP 2020 – 45th International Conference on Acoustics, Speech, and Signal Processing, May 2020, Barcelona, Spain
Accès au texte intégral et bibtex

titre: CHiME-6 Challenge: Tackling multispeaker speech recognition for unsegmented recordings
auteur: Shinji Watanabe, Michael Mandel, Jon Barker, Emmanuel Vincent, Ashish Arora, Xuankai Chang, Sanjeev Khudanpur, Vimal Manohar, Daniel Povey, Desh Raj, David Snyder, Aswin Shanmugam Subramanian, Jan Trmal, Bar Ben Yair, Christoph Boeddeker, Zhaoheng Ni, Yusuke Fujita, Shota Horiguchi, Naoyuki Kanda, Takuya Yoshioka, Neville Ryant
article: CHiME 2020 – 6th International Workshop on Speech Processing in Everyday Environments, May 2020, Barcelona / Virtual, Spain
Accès au texte intégral et bibtex

titre: BLASTER: An Off-Grid Method for Blind and Regularized Acoustic Echoes Retrieval — with supplementary material
auteur: Diego Di Carlo, Clément Elvira, Antoine Deleforge, Nancy Bertin, Rémi Gribonval
article: ICASSP 2020 – IEEE International Conference on Acoustic Speech and Signal Processing, IEEE, May 2020, Barcelona, Spain
Accès au texte intégral et bibtex

titre: Automatic rule extraction from access rules using Genetic Programming
auteur: Paloma de Las Cuevas, Pablo Garcia-Sanchez, Zaineb Chelly Dagdia, Maria-Isabel Garcia-Arenas, Juan Julian Merelo
article: EvoCOP 2020 – 20th European Conference on Evolutionary Computation in Combinatorial Optimisation, Apr 2020, Seville, Spain
Accès au texte intégral et bibtex

titre: Semantic Context Model for Efficient Speech Recognition
auteur: Stephane Level, Irina Illina, Dominique Fohr
article: ICCAS 2020 – The first International Conference on Cognitive Aircraft Systems, Mar 2020, Toulouse, France
Accès au bibtex

titre: BERT and fastText Embeddings for Automatic Detection of Toxic Speech
auteur: Ashwin Geet d’Sa, Irina Illina, Dominique Fohr
article: SIIE 2020 – Information Systems and Economic Intelligence; International Multi-Conference on:“Organization of Knowledge and Advanced Technologies”(OCTA), Feb 2020, Tunis, Tunisia
Accès au texte intégral et bibtex

titre: A brief introduction to multichannel noise reduction with deep neural networks
auteur: Romain Serizel
article: SpiN 2020 – 12th Speech in Noise Workshop, Jan 2020, Toulouse, France
Accès au texte intégral et bibtex

titre: Reconnaissance automatique de la parole : génération des prononciations non natives pour l’enrichissement du lexique
auteur: Ismael Bada, Dominique Fohr, Irina Illina
article: 6e conférence conjointe Journées d’Études sur la Parole (JEP, 33e édition), Traitement Automatique des Langues Naturelles (TALN, 27e édition), Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues (RÉCITAL, 22e édition). Volume 1 : Journées d’Études sur la Parole, 2020, Nancy, France. pp.27-35
Accès au texte intégral et bibtex

Book sections

titre: Importance of Dataspace Embeddings when Evaluating Text Clustering Methods
auteur: Alain Lelu, Martine Cadot
article: Data Analysis and Rationality in a Complex World, In press
Accès au texte intégral et bibtex

titre: When Evolutionary Computing Meets Astro- and Geoinformatics
auteur: Zaineb Chelly Dagdia, Miroslav Mirchev
article: Knowledge Discovery in Big Data from Astronomy and Earth Observation, , pp.283-306, 2020
Accès au texte intégral et bibtex

Proceedings

titre: Actes de la 6e conférence conjointe Journées d’Études sur la Parole (JEP, 33e édition), Traitement Automatique des Langues Naturelles (TALN, 27e édition), Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues (RÉCITAL, 22e édition). Volume 2 : Traitement Automatique des Langues Naturelles
auteur: Christophe Benzitoun, Chloé Braud, Laurine Huber, David Langlois, Slim Ouni, Sylvain Pogodalla, Stéphane Schneider
article: 2 : Traitement Automatique des Langues Naturelles, ATALA; AFCP, pp.1-395, 2020
Accès au texte intégral et bibtex

titre: Actes de la 6e conférence conjointe Journées d’Études sur la Parole (JEP, 33e édition), Traitement Automatique des Langues Naturelles (TALN, 27e édition), Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues (RÉCITAL, 22e édition). Volume 1 : Journées d’Études sur la Parole
auteur: Christophe Benzitoun, Chloé Braud, Laurine Huber, David Langlois, Slim Ouni, Sylvain Pogodalla, Stéphane Schneider
article: 1 : Journées d’Études sur la Parole, ATALA; AFCP, 2020
Accès au texte intégral et bibtex

titre: Actes de la 6e conférence conjointe Journées d’Études sur la Parole (JEP, 33e édition), Traitement Automatique des Langues Naturelles (TALN, 27e édition), Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues (RÉCITAL, 22e édition). Volume 4 : Démonstrations et résumés d’articles internationaux
auteur: Christophe Benzitoun, Chloé Braud, Laurine Huber, David Langlois, Slim Ouni, Sylvain Pogodalla, Stéphane Schneider
article: 4 : Démonstrations et résumés d’articles internationaux, ATALA; AFCP, pp.1-88, 2020
Accès au texte intégral et bibtex

titre: Actes de la 6e conférence conjointe Journées d’Études sur la Parole (JEP, 33e édition), Traitement Automatique des Langues Naturelles (TALN, 27e édition), Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues (RÉCITAL, 22e édition). Volume 3 : Rencontre des Étudiants Chercheurs en Informatique pour le TAL
auteur: Christophe Benzitoun, Chloé Braud, Laurine Huber, David Langlois, Slim Ouni, Sylvain Pogodalla, Stéphane Schneider
article: 3 : Rencontre des Étudiants Chercheurs en Informatique pour le TAL, ATALA; AFCP, pp.1-230, 2020
Accès au texte intégral et bibtex

Reports

titre: Speaker information modification in the VoicePrivacy 2020 toolchain
auteur: Pierre Champion, Denis Jouvet, Anthony Larcher
article: [Research Report] INRIA Nancy, équipe Multispeech; LIUM – Laboratoire d’Informatique de l’Université du Mans. 2020
Accès au texte intégral et bibtex

titre: The VoicePrivacy 2020 Challenge Evaluation Plan
auteur: Natalia Tomashenko, Brij Mohan Lal Srivastava, Xin Wang, Emmanuel Vincent, Andreas Nautsch, Junichi Yamagishi, Nicholas Evans, Jose Patino, Jean-François Bonastre, Paul-Gauthier Noé, Massimiliano Todisco
article: [0] LIA – Laboratoire Informatique d’Avignon; MULTISPEECH – Speech Modeling for Facilitating Oral-Based Communication Inria Nancy – Grand Est, LORIA – NLPKD – Department of Natural Language Processing & Knowledge Discovery; Eurecom [Sophia Antipolis]; University of Edinburgh. 2020
Accès au texte intégral et bibtex

Software

titre: voiceHome-2 corpus – automatic speech recognition baseline – scripts
auteur: Sunit Sivasankaran, Irina Illina, Emmanuel Vincent
article: 2020, ⟨swh:1:dir:e61ed9084af0d3e8542cd4ab3a990d24314a6724;origin=https://hal.archives-ouvertes.fr/hal-02963802;visit=swh:1:snp:b958e3aa64f6b1663929789c8cf28d019f55f57d;anchor=swh:1:rev:6b9bf3964385d0c16d262796d9e4a3a30a52dafd;path=/⟩
Accès au texte intégral et bibtex

Theses

titre: Echo-aware signal processing for audio scene analysis
auteur: Diego Di Carlo
article: Signal and Image processing. UNIVERSITÉ DE RENNES 1; INRIA – IRISA – PANAMA, 2020. English. ⟨NNT : ⟩
Accès au texte intégral et bibtex

titre: Automatic speech recognition and machine translation of Arabic and dialectal videos
auteur: Mohamed Amine Menacer
article: Informatique et langage [cs.CL]. Université de Lorraine, 2020. Français. ⟨NNT : 2020LORR0157⟩
Accès au texte intégral et bibtex

titre: Audiovisual synthesis of expressive speech : modeling of emotions with deep learning
auteur: Sara Dahmani
article: Informatique [cs]. Université de Lorraine, 2020. Français. ⟨NNT : 2020LORR0137⟩
Accès au texte intégral et bibtex

titre: Localization guided speech separation
auteur: Sunit Sivasankaran
article: Machine Learning [cs.LG]. Université de Lorraine, 2020. English. ⟨NNT : 2020LORR0078⟩
Accès au texte intégral et bibtex

titre: Towards a 3 dimensional dynamic generic speaker model to study geometry simplifications of the vocal tract using magnetic resonance imaging data
auteur: Ioannis K Douros
article: Computation and Language [cs.CL]. Université de Lorraine, 2020. English. ⟨NNT : 2020LORR0115⟩
Accès au texte intégral et bibtex

titre: End-to-end deep learning for speech enhancement
auteur: Guillaume Carbajal
article: Informatique [cs]. Université de Lorraine, 2020. Français. ⟨NNT : 2020LORR0017⟩
Accès au texte intégral et bibtex

titre: Parametric synthesis of Arabic speech
auteur: Amal Houidhek
article: Traitement du signal et de l’image [eess.SP]. Université de Lorraine; Université de Tunis El Manar (Tunisie), 2020. Français. ⟨NNT : 2020LORR0116⟩
Accès au texte intégral et bibtex

Preprints, Working Papers, …

titre: Emotion recognition from phoneme-duration information
auteur: Ajinkya Kulkarni, Ioannis K Douros, Vincent Colotte, Denis Jouvet
article: 2020
Accès au texte intégral et bibtex

titre: LibriMix: An open-source dataset for generalizable speech separation
auteur: Joris Cosentino, Manuel Pariente, Samuele Cornell, Antoine Deleforge, Emmanuel Vincent
article: 2020
Accès au texte intégral et bibtex

2019

Journal articles

titre: Motion planning for robot audition
auteur: van Quan Nguyen, Francis Colas, Emmanuel Vincent, François Charpillet
article: Autonomous Robots, 2019, 43 (8), pp.2293-2317. ⟨10.1007/s10514-019-09880-1⟩
Accès au texte intégral et bibtex

titre: Audio-Based Search and Rescue with a Drone: Highlights from the IEEE Signal Processing Cup 2019 Student Competition
auteur: Antoine Deleforge, Diego Di Carlo, Martin Strauss, Romain Serizel, Lucio Marcenaro
article: IEEE Signal Processing Magazine, 2019, 36 (5), pp.138-144. ⟨10.1109/MSP.2019.2924687⟩
Accès au texte intégral et bibtex

titre: Summarizing videos into a target language: Methodology, architectures and evaluation
auteur: Kamel Smaïli, Dominique Fohr, Carlos-Emiliano González-Gallardo, Michał L Grega, Lucjan Janowski, Denis Jouvet, Arian Koźbiał, David Langlois, Mikołaj Leszczuk, Odile Mella, Mohamed-Amine Menacer, Amaia Mendez, Elvys Linhares L Pontes, Eric Sanjuan, Juan-Manuel Torres-Moreno, Begona Garcia-Zapirain
article: Journal of Intelligent and Fuzzy Systems, 2019, 1, pp.1-12. ⟨10.3233/JIFS-179350⟩
Accès au texte intégral et bibtex

titre: Sound event detection in the DCASE 2017 Challenge
auteur: Annamaria Mesaros, Aleksandr Diment, Benjamin Elizalde, Toni Heittola, Emmanuel Vincent, Bhiksha Raj, Tuomas Virtanen
article: IEEE/ACM Transactions on Audio, Speech and Language Processing, 2019, 27 (6), pp.992-1006. ⟨10.1109/TASLP.2019.2907016⟩
Accès au texte intégral et bibtex

titre: Voice Mimicry Attacks Assisted by Automatic Speaker Verification
auteur: Ville Vestman, Tomi Kinnunen, Rosa González Hautamäki, Md Sahidullah
article: Computer Speech and Language, 2019, 59, pp.36-54. ⟨10.1016/j.csl.2019.05.005⟩
Accès au texte intégral et bibtex

titre: CRNN-based multiple DoA estimation using acoustic intensity features for Ambisonics recordings
auteur: Lauréline Perotin, Romain Serizel, Emmanuel Vincent, Alexandre Guérin
article: IEEE Journal of Selected Topics in Signal Processing, 2019, Special Issue on Acoustic Source Localization and Tracking in Dynamic Real-life Scenes, 13 (1), pp.22-33. ⟨10.1109/jstsp.2019.2900164⟩
Accès au texte intégral et bibtex

titre: VoiceHome-2, an extended corpus for multichannel speech processing in real homes
auteur: Nancy Bertin, Ewen Camberlein, Romain Lebarbenchon, Emmanuel Vincent, Sunit Sivasankaran, Irina Illina, Frédéric Bimbot
article: Speech Communication, 2019, 106, pp.68-78. ⟨10.1016/j.specom.2018.11.002⟩
Accès au texte intégral et bibtex

titre: Quality Measures for Speaker Verification with Short Utterances
auteur: Arnab Poddar, Md Sahidullah, Goutam Saha
article: Digital Signal Processing, 2019, 88, pp.66-79. ⟨10.1016/j.dsp.2019.01.023⟩
Accès au texte intégral et bibtex

titre: Learning of Hierarchical Temporal Structures for Guided Improvisation
auteur: Ken Déguernel, Emmanuel Vincent, Jérôme Nika, Gerard Assayag, Kamel Smaïli
article: Computer Music Journal, 2019, 43 (2), ⟨10.1162/comj_a_00521⟩
Accès au texte intégral et bibtex

Conference papers

titre: Lead2Gold: Towards exploiting the full potential of noisy transcriptions for speech recognition
auteur: Adrien Dufraux, Emmanuel Vincent, Awni Hannun, Armelle Brun, Matthijs Douze
article: ASRU 2019 – IEEE Automatic Speech Recognition and Understanding Workshop, Dec 2019, Singapour, Singapore
Accès au texte intégral et bibtex

titre: Grands défis scientifiques et technologiques en traitement de la parole: quelles initiatives chez Inria et au niveau européen?
auteur: Emmanuel Vincent
article: Voice Tech Paris 2019, Nov 2019, Paris, France
Accès au bibtex

titre: Regression versus classification for neural network based audio source localization
auteur: Lauréline Perotin, Alexandre Défossez, Emmanuel Vincent, Romain Serizel, Alexandre Guérin
article: WASPAA 2019 – IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, IEEE, Oct 2019, New Paltz, United States
Accès au texte intégral et bibtex

titre: Extractive Text-Based Summarization of Arabic videos: Issues, Approaches and Evaluations
auteur: M A Menacer, C E González-Gallardo, K Abidi, Dominique Fohr, Denis Jouvet, D Langlois, Odile Mella, F Sadat, J M Torres-Moreno, Kamel Smaïli
article: ICALP: International Conference on Arabic Language Processing, Oct 2019, Nancy, France. pp.65-78, ⟨10.1007/978-3-030-32959-4_5⟩
Accès au texte intégral et bibtex

titre: A Fine-grained Multilingual Analysis Based on the Appraisal Theory: Application to Arabic and English Videos
auteur: Karima Abidi, Dominique Fohr, Denis Jouvet, David Langlois, Odile Mella, Kamel Smaïli
article: ICALP: International Conference on Arabic Language Processing, Oct 2019, Nancy, France. pp.49-61, ⟨10.1007/978-3-030-32959-4_4⟩
Accès au texte intégral et bibtex

titre: COMPRISE
auteur: Emmanuel Vincent
article: META-FORUM 2019 – Cost-effective, Multilingual, Privacy-driven voice-enabled Services, Oct 2019, Bruxelles, Belgium
Accès au bibtex

titre: Sound event detection in domestic environments with weakly labeled data and soundscape synthesis
auteur: Nicolas Turpault, Romain Serizel, Ankit Parag Shah, Justin Salamon
article: Workshop on Detection and Classification of Acoustic Scenes and Events, Oct 2019, New York City, United States
Accès au texte intégral et bibtex

titre: MODALISA une plateforme intégrative pour capturer l’orchestration des gestes et de la parole
auteur: Christelle Dodane, Dominique Boutet, Fabrice Hirsch, Slim Ouni, Aliyah Morgenstern
article: Défi Instrumentation aux Limites, Colloque de restitution, CNRS, Sep 2019, Paris, France
Accès au bibtex

titre: Privacy-Preserving Adversarial Representation Learning in ASR: Reality or Illusion?
auteur: Brij Mohan Lal Srivastava, Aurélien Bellet, Marc Tommasi, Emmanuel Vincent
article: INTERSPEECH 2019 – 20th Annual Conference of the International Speech Communication Association, Sep 2019, Graz, Austria
Accès au texte intégral et bibtex

titre: A Multimodal Real-Time MRI Articulatory Corpus of French for Speech Research
auteur: Ioannis K Douros, Jacques Felblinger, Jens Frahm, Karyna Isaieva, Arun Joseph, Yves Laprie, Freddy Odille, Anastasiia Tsukanova, Dirk Voit, Pierre-André Vuissoz
article: INTERSPEECH 2019 – 20th Annual Conference of the International Speech Communication Association, Sep 2019, Graz, Austria
Accès au texte intégral et bibtex

titre: Towards a method of dynamic vocal tract shapes generation by combining static 3D and dynamic 2D MRI speech data
auteur: Ioannis K Douros, Anastasiia Tsukanova, Karyna Isaieva, Pierre-André Vuissoz, Yves Laprie
article: INTERSPEECH 2019 – 20th Annual Conference of the International Speech Communication Association, Sep 2019, Graz, Austria
Accès au texte intégral et bibtex

titre: ASVspoof 2019: Future Horizons in Spoofed and Fake Audio Detection
auteur: Massimiliano Todisco, Xin Wang, Ville Vestman, Md Sahidullah, Héctor Delgado, Andreas Nautsch, Junichi Yamagishi, Nicholas Evans, Tomi Kinnunen, Kong Aik Lee
article: INTERSPEECH 2019 – 20th Annual Conference of the International Speech Communication Association, Sep 2019, Graz, Austria
Accès au texte intégral et bibtex

titre: I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences
auteur: Kong Aik Lee, Ville Hautamäki, Tomi Kinnunen, Hitoshi Yamamoto, Koji Okabe, Ville Vestman, Jing Huang, Guohong Ding, Hanwu Sun, Anthony Larcher, Rohan Kumar Das, Haizhou Li, Mickaël Rouvier, Pierre-Michel Bousquet Bousquet, Wei Rao, Qing Wang, Chunlei Zhang, Fahimeh Bahmaninezhad, Héctor Delgado, Jose Patino, Qiongqiong Wang, Ling Guo, Takafumi Koshinaka, Jiacen Zhang, Koichi Shinoda, Trung Ngo Trong, Md Sahidullah, Fan Lu, Yun Tang, Ming Tu, Kah Kuan Teh, Huy Dat Tran, Kuruvachan K George, Ivan Kukanov, Florent Desnous, Jichen Yang, Emre Yilmaz, Longting Xu, Jean-François Bonastre, Chenglin Xu, Zhi Hao Lim, Siong Chng, Shivesh Ranjan, John H. L. Hansen, Massimiliano Todisco, Nicholas Evans
article: INTERSPEECH 2019 – 20th Annual Conference of the International Speech Communication Association, Sep 2019, Graz, Austria
Accès au texte intégral et bibtex

titre: A Statistically Principled and Computationally Efficient Approach to Speech Enhancement using Variational Autoencoders
auteur: Manuel Pariente, Antoine Deleforge, Emmanuel Vincent
article: INTERSPEECH 2019 – 20th Annual Conference of the International Speech Communication Association, Sep 2019, Graz, Austria
Accès au texte intégral et bibtex

titre: Modeling Labial Coarticulation with Bidirectional Gated Recurrent Networks and Transfer Learning
auteur: Théo Biasutto–Lervat, Sara Dahmani, Slim Ouni
article: INTERSPEECH 2019 – 20th Annual Conference of the International Speech Communication Association, Sep 2019, Graz, Austria
Accès au texte intégral et bibtex

titre: Conditional Variational Auto-Encoder for Text-Driven Expressive AudioVisual Speech Synthesis
auteur: Sara Dahmani, Vincent Colotte, Valérian Girard, Slim Ouni
article: INTERSPEECH 2019 – 20th Annual Conference of the International Speech Communication Association, Sep 2019, Graz, Austria
Accès au texte intégral et bibtex

titre: An integrative platform to capture the orchestration of gesture and speech
auteur: Christelle Dodane, Dominique Boutet, Ivana Didirkova, Fabrice Hirsch, Slim Ouni, Aliyah Morgenstern
article: GeSpIn 2019 – Gesture and Speech in Interaction, Sep 2019, Paderborn, Germany
Accès au texte intégral et bibtex

titre: Speech Processing and Prosody
auteur: Denis Jouvet
article: TSD 2019 – 22nd International Conference of Text, Speech and Dialogue, Sep 2019, Ljubljana, Slovenia
Accès au texte intégral et bibtex

titre: Glottal Opening Measurements in VCV and VCCV Sequences
auteur: Benjamin Elie, Angelique Amelot, Yves Laprie, Shinji Maeda
article: ICA 2019 – 23rd International Congress on Acoustics, Sep 2019, Aachen, Germany
Accès au texte intégral et bibtex

titre: Acoustic Evaluation of Simplifying Hypotheses Used in Articulatory Synthesis
auteur: Ioannis K Douros, Yves Laprie, Pierre-André Vuissoz, Benjamin Elie
article: ICA 2019 – 23rd International Congress on Acoustics, Sep 2019, Aachen, Germany
Accès au texte intégral et bibtex

titre: Cauchy Multichannel Speech Enhancement with a Deep Speech Prior
auteur: Mathieu Fontaine, Aditya Arie Nugraha, Roland Badeau, Kazuyoshi Yoshii, Antoine Liutkus
article: EUSIPCO 2019 – 27th European Signal Processing Conference, Sep 2019, Coruña, Spain. ⟨10.23919/EUSIPCO.2019.8903091⟩
Accès au texte intégral et bibtex

titre: Evaluation of text clustering methods and their dataspace embeddings: an exploration
auteur: Alain Lelu, Martine Cadot
article: IFCS 2019 – 16th International of the Federation of Classification Societies, Aug 2019, Thessaloniki, Greece
Accès au texte intégral et bibtex

titre: Can prosody meet pragmatics? Case of discourse particles in French
auteur: Lou Lee, Katarina Bartkova, Denis Jouvet, Mathilde Dargnat, Yvon Keromnes
article: ICPhS 2019 – International Congress of Phonetic Sciences, Aug 2019, Melbourne, Australia
Accès au texte intégral et bibtex

titre: Effect of head posture on phonation of French vowels
auteur: Ioannis K Douros, Pierre-André Vuissoz, Yves Laprie
article: ICPhS 2019 – Proceedings of International Congress of Phonetic Sciences, Aug 2019, Melbourne, Australia
Accès au texte intégral et bibtex

titre: Comparison between 2D and 3D models for speech production: a study of French vowels
auteur: Ioannis K Douros, Pierre-André Vuissoz, Yves Laprie
article: ICPhS 2019 – International Congress of Phonetic Sciences, Aug 2019, Melbourne, Australia
Accès au texte intégral et bibtex

titre: German obstruent sequences by French L2 learners
auteur: Anne Bonneau
article: ICPhS 2019 – International Congress of Phonetic Sciences, Aug 2019, Melbourne, Australia
Accès au texte intégral et bibtex

titre: Can static vocal tract positions represent articulatory targets in continuous speech? Matching static MRI captures against real-time MRI for the French language
auteur: Anastasiia Tsukanova, Ioannis K Douros, Anastasia Shimorina, Yves Laprie
article: ICPhS 2019 – International Congress of Phonetic Sciences, Aug 2019, Melbourne, Australia
Accès au texte intégral et bibtex

titre: Acoustic impacts of geometric approximation at the level of velum and epiglottis on French vowels
auteur: Ioannis K Douros, Pierre-André Vuissoz, Yves Laprie
article: ICPhS 2019 – International Congress of Phonetic Sciences, Aug 2019, Melbourne, Australia
Accès au texte intégral et bibtex

titre: Robust non-linear regression approach for generalized inverse problems in a high dimensional setting
auteur: Florence Forbes, Antoine Deleforge, Radu Horaud, Emeline Perthame
article: AIP 2019 – Applied Inverse Problem conference, Jul 2019, Grenoble, France
Accès au bibtex

titre: Sound Event Detection from Partially Annotated Data: Trends and Challenges
auteur: Romain Serizel, Nicolas Turpault
article: IcETRAN conference, Jun 2019, Srebrno Jezero, Serbia
Accès au texte intégral et bibtex

titre: Machine Translation on a parallel Code-Switched Corpus
auteur: Mohamed Menacer, David Langlois, Denis Jouvet, Dominique Fohr, Odile Mella, Kamel Smaïli
article: Canadian AI 2019 – 32nd Conference on Canadian Artificial Intelligence, May 2019, Ontario, Canada
Accès au texte intégral et bibtex

titre: Layer adaptation for transfer of expressivity in speech synthesis
auteur: Ajinkya Kulkarni, Vincent Colotte, Denis Jouvet
article: LTC’19 – 9th Language & Technology Conference, May 2019, Poznan, Poland
Accès au texte intégral et bibtex

titre: L’impact du trouble du spectre de l’autisme sur le bien-être psychologique des parents
auteur: Tamara Léonova, Géraldine Coffe, Anaïs Tarasconi, Agnès Piquard-Kipffer, Delphine Sardin, Aline Gosse, Julie Boré
article: XVIIIème Congrès de l’Association Internationale de Formation et de Recherche en Éducation Familiale, May 2019, Schoelcher, Martinique, France
Accès au bibtex

titre: Semi-supervised triplet loss based learning of ambient audio embeddings
auteur: Nicolas Turpault, Romain Serizel, Emmanuel Vincent
article: ICASSP 2019, May 2019, Brighton, United Kingdom
Accès au texte intégral et bibtex

titre: Mirage: 2D Source Localization Using Microphone Pair Augmentation with Echoes
auteur: Diego Di Carlo, Antoine Deleforge, Nancy Bertin
article: ICASSP 2019 – IEEE International Conference on Acoustic, Speech Signal Processing, May 2019, Brighton, United Kingdom. pp.775-779, ⟨10.1109/ICASSP.2019.8683534⟩
Accès au texte intégral et bibtex

titre: An improved uncertainty propagation method for robust i-vector based speaker recognition
auteur: Dayana Ribas, Emmanuel Vincent
article: ICASSP 2019 – 44th International Conference on Acoustics, Speech, and Signal Processing, May 2019, Brighton, United Kingdom
Accès au texte intégral et bibtex

titre: Can We Use Speaker Recognition Technology to Attack Itself? Enhancing Mimicry Attacks Using Automatic Target Speaker Selection
auteur: Tomi Kinnunen, Rosa González Hautamäki, Ville Vestman, Md Sahidullah
article: ICASSP 2019 – 44th International Conference on Acoustics, Speech, and Signal Processing, May 2019, Brighton, United Kingdom
Accès au texte intégral et bibtex

titre: F0 modeling using DNN for Arabic parametric speech synthesis
auteur: Imene Zangar, Zied Mnasri, Vincent Colotte, Denis Jouvet
article: INNSBDDL 2019 – INNS Big Data and Deep Learning, Apr 2019, Sestri Levante, Italy
Accès au texte intégral et bibtex

titre: Parole & deep learning : succès et grands défis
auteur: Emmanuel Vincent
article: Journée IA, Langage et Citoyens, Mar 2019, Nancy, France
Accès au bibtex

Book sections

titre: Bibliometric delineation of scientific fields
auteur: Michel Zitt, Alain Lelu, Martine Cadot, Guillaume Cabanac
article: Wolfgang Glänzel; Henk F. Moed; Ulrich Schmoch; Mike Thelwall. Handbook of Science and Technology Indicators, Springer International Publishing, pp.25-68, 2019, Handbook of Science and Technology Indicators, 978-3-030-02510-6. ⟨10.1007/978-3-030-02511-3_2⟩
Accès au texte intégral et bibtex

titre: Introduction to Voice Presentation Attack Detection and Recent Advances
auteur: Md Sahidullah, Héctor Delgado, Massimiliano Todisco, Tomi Kinnunen, Nicholas Evans, Junichi Yamagishi, Kong Aik Lee
article: Sébastien Marcel; Mark S. Nixon; Julian Fierrez; Nicholas Evans. Handbook of Biometric Anti-Spoofing: Presentation Attack Detection, Springer, pp.321-361, 2019, Advances in Computer Vision and Pattern Recognition, 978-3-319-92626-1. ⟨10.1007/978-3-319-92627-8_15⟩
Accès au texte intégral et bibtex

Poster communications

titre: BENEPHIDIRE : un projet de recherche en phonétique, en informatique et en neurologie sur le bégaiement
auteur: Fabrice Hirsch, Guillaume Herbet, Slim Ouni, Rudolph Sock, Christelle Dodane, Agata Jackiewicz, Ivana Didirková, Dodji Gbedahou, Sylvie Gasser, Yves Laprie, Béatrice Vaxelaire, Camille Fauth, Bernard Harmegnies, Marie-Claude Monfrais-Pfauwadel, Bernadette Piérart, Marine Pendeliau, Géraldine Hilaire, Sylvia Topouzkhanian
article: 8è Journées de Phonétique Clinique, May 2019, Mons, Belgium
Accès au texte intégral et bibtex

Reports

titre: Joint NN-Supported Multichannel Reduction of Acoustic Echo, Reverberation and Noise: Supporting Document
auteur: Guillaume Carbajal, Romain Serizel, Emmanuel Vincent, Eric Humbert
article: [Research Report] RR-9303, INRIA Nancy; Invoxia SAS. 2019
Accès au texte intégral et bibtex

titre: A Statistically Principled and Computationally Efficient Approach to Speech Enhancement using Variational Autoencoders : Supporting Document
auteur: Manuel Pariente, Antoine Deleforge, Emmanuel Vincent
article: [Research Report] RR-9268, INRIA. 2019, pp.1-8
Accès au texte intégral et bibtex

titre: I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences
auteur: Kong Aik Lee, Ville Hautamäki, Tomi Kinnunen, Hitoshi Yamamoto, Koji Okabe, Ville Vestman, Jing Huang, Guohong Ding, Hanwu Sun, Anthony Larcher, Rohan Kumar Das, Haizhou Li, Mickaël Rouvier, Pierre-Michel Bousquet Bousquet, Wei Rao, Qing Wang, Chunlei Zhang, Fahimeh Bahmaninezhad, Héctor Delgado, Jose Patino, Qiongqiong Wang, Ling Guo, Takafumi Koshinaka, Jiacen Zhang, Koichi Shinoda, Trung Ngo Trong, Md Sahidullah, Fan Lu, Yun Tang, Ming Tu, Kah Kuan Teh, Huy Dat Tran, Kuruvachan K George, Ivan Kukanov, Florent Desnous, Jichen Yang, Emre Yilmaz, Longting Xu, Jean-François Bonastre, Chenglin Xu, Zhi Hao Lim, Siong Chng, Shivesh Ranjan, John H. L. Hansen, Massimiliano Todisco, Nicholas Evans
article: [Research Report] I4U Consortium. 2019
Accès au texte intégral et bibtex

titre: AI in the media and creative industries
auteur: Baptiste Caramiaux, Fabien Lotte, Joost Geurts, Giuseppe Amato, Malte Behrmann, Frédéric Bimbot, Fabrizio Falchi, Ander Garcia, Jaume Gibert, Guillaume Gravier, Hadmut Holken, Hartmut Koenitz, Sylvain Lefebvre, Antoine Liutkus, Andrew Perkis, Rafael Redondo, Enrico Turrin, Thierry Viéville, Emmanuel Vincent
article: [Research Report] New European Media (NEM). 2019, pp.1-35
Accès au texte intégral et bibtex

Software

titre: Underdetermined Reverberant Source Separation
auteur: Matthieu Kowalski, Emmanuel Vincent, Rémi Gribonval
article: 2019, ⟨swh:1:dir:ec4ae097465d9ea51589537ea94b2ea50e8d134d;origin=https://hal.archives-ouvertes.fr/hal-02309043;visit=swh:1:snp:e35494fd4cb57af0b22131ab8c4a4d8bd5cffcc6;anchor=swh:1:rev:2d23c3e68b755b720ecca8ddd5e1f8fe99909be2;path=/⟩
Accès au texte intégral et bibtex

Theses

titre: Articulatory speech synthesis
auteur: Anastasiia Tsukanova
article: Computation and Language [cs.CL]. Université de Lorraine, 2019. English. ⟨NNT : 2019LORR0166⟩
Accès au texte intégral et bibtex

titre: Localization and enhancement of speech from the Ambisonics format
auteur: Lauréline Perotin
article: Traitement du signal et de l’image [eess.SP]. Université de Lorraine, 2019. Français. ⟨NNT : 2019LORR0124⟩
Accès au texte intégral et bibtex

titre: Alpha-stable processes for signal processing
auteur: Mathieu Fontaine
article: Traitement du signal et de l’image [eess.SP]. Université de Lorraine, 2019. Français. ⟨NNT : 2019LORR0037⟩
Accès au texte intégral et bibtex

Preprints, Working Papers, …

titre: The Speed Submission to DIHARD II: Contributions & Lessons Learned
auteur: Md Sahidullah, Jose Patino, Samuele Cornell, Ruiqing Yin, Sunit Sivasankaran, Hervé Bredin, Pavel Korshunov, Alessio Brutti, Romain Serizel, Emmanuel Vincent, Nicholas Evans, Sébastien Marcel, Stefano Squartini, Claude Barras
article: 2019
Accès au texte intégral et bibtex

2018

Journal articles

titre: Evaluation of speech unit modelling for HMM-based speech synthesis for Arabic
auteur: Amal Houidhek, Vincent Colotte, Zied Mnasri, Denis Jouvet
article: International Journal of Speech Technology, 2018, pp.1-12. ⟨10.1007/s10772-018-09558-6⟩
Accès au texte intégral et bibtex

titre: Probabilistic Factor Oracles for Multidimensional Machine Improvisation
auteur: Ken Déguernel, Emmanuel Vincent, Gérard Assayag
article: Computer Music Journal, 2018, 42 (2), pp.52-66. ⟨10.1162/comj_a_00460⟩
Accès au texte intégral et bibtex

titre: Rank-1 Constrained Multichannel Wiener Filter for Speech Recognition in Noisy Environments
auteur: Ziteng Wang, Emmanuel Vincent, Romain Serizel, Yonghong Yan
article: Computer Speech and Language, 2018, 49, pp.37-51. ⟨10.1016/j.csl.2017.11.003⟩
Accès au texte intégral et bibtex

titre: Adaptation of speech recognition vocabularies for improved transcription of YouTube videos
auteur: Denis Jouvet, David Langlois, Mohamed Amine Menacer, Dominique Fohr, Odile Mella, Kamel Smaïli
article: Journal of International Science and General Applications, 2018, 1 (1), pp.1-9
Accès au texte intégral et bibtex

titre: Dynamic Lip Animation from a Limited number of Control Points: Towards an Effective Audiovisual Spoken Communication
auteur: Slim Ouni, Guillaume Gris
article: Speech Communication, 2018, 96, pp.49-57. ⟨10.1016/j.specom.2017.11.006⟩
Accès au texte intégral et bibtex

titre: DNN Uncertainty Propagation using GMM-Derived Uncertainty Features for Noise Robust ASR
auteur: Karan Nathwani, Emmanuel Vincent, Irina Illina
article: IEEE Signal Processing Letters, 2018, ⟨10.1109/LSP.2018.2791534⟩
Accès au texte intégral et bibtex

Conference papers

titre: Dynamic Extension of ASR Lexicon Using Wikipedia Data
auteur: Badr Abdullah, Irina Illina, Dominique Fohr
article: IEEE Workshop on Spoken and Language Technology (SLT), Dec 2018, Athènes, Greece
Accès au texte intégral et bibtex

titre: Transforming acoustic characteristics to deceive playback spoofing countermeasures of speaker verification systems
auteur: Fuming Fang, Junichi Yamagishi, Isao Echizen, Md Sahidullah, Tomi Kinnunen
article: WIFS 2018 – IEEE International Workshop on Information Forensics and Security, Dec 2018, Hong Kong, Hong Kong SAR China
Accès au texte intégral et bibtex

titre: MULAN: A Blind and Off-Grid Method for Multichannel Echo Retrieval
auteur: Helena Peic Tukuljac, Antoine Deleforge, Rémi Gribonval
article: NeurIPS 2018 – Thirty-second Conference on Neural Information Processing Systems, Dec 2018, Montréal, Canada. pp.1-11
Accès au texte intégral et bibtex

titre: Large-Scale Weakly Labeled Semi-Supervised Sound Event Detection in Domestic Environments
auteur: Romain Serizel, Nicolas Turpault, Hamid Eghbal-Zadeh, Ankit Parag Shah
article: Workshop on Detection and Classification of Acoustic Scenes and Events, Nov 2018, Woking, United Kingdom
Accès au texte intégral et bibtex

titre: DNN-Based Speech Synthesis for Arabic: Modelling and Evaluation
auteur: Amal Houidhek, Vincent Colotte, Zied Mnasri, Denis Jouvet
article: SLSP 2018 – 6th International Conference on Statistical Language and Speech Processing, Oct 2018, Mons, Belgium
Accès au texte intégral et bibtex

titre: DREGON: Dataset and Methods for UAV-Embedded Sound Source Localization
auteur: Martin Strauss, Pol Mordel, Victor Miguet, Antoine Deleforge
article: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2018), Oct 2018, Madrid, Spain. pp.5735-5742, ⟨10.1109/IROS.2018.8593581⟩
Accès au texte intégral et bibtex

titre: CRNN-based joint azimuth and elevation localization with the Ambisonics intensity vector
auteur: Lauréline Perotin, Romain Serizel, Emmanuel Vincent, Alexandre Guérin
article: IWAENC 2018 – 16th International Workshop on Acoustic Signal Enhancement, Sep 2018, Tokyo, Japan
Accès au texte intégral et bibtex

titre: Evaluation of an Open-Source Implementation of the SRP-PHAT Algorithm within the 2018 Locata Challenge
auteur: Romain Lebarbenchon, Ewen Camberlein, Diego Di Carlo, Clément Gaultier, Antoine Deleforge, Nancy Bertin
article: LOCATA Challenge Workshop, a satellite event of IWAENC 2018, Sep 2018, Tokyo, Japan
Accès au texte intégral et bibtex

titre: A Proposed Methodology for Subjective Evaluation of Video and Text Summarization
auteur: Begona Garcia-Zapirain, Cristian Castillo, Aritz Badiola, Sofia Zahia, Amaia Mendez, David Langlois, Denis Jouvet, Juan-Manuel Torres-Moreno, Mikołaj Leszczuk, Kamel Smaïli
article: MISSI 2018 – 11th edition of the International Conference on Multimedia and Network Information Systems, Sep 2018, Wrocław, Poland. pp.396-404, ⟨10.1007/978-3-319-98678-4_40⟩
Accès au texte intégral et bibtex

titre: A First Summarization System of a Video in a Target Language
auteur: Kamel Smaïli, Dominique Fohr, Carlos González-Gallardo, Michal Grega, Lucjan Janowski, Denis Jouvet, Artur Komorowski, Arian Kozbial, David Langlois, Mikolaj Leszczuk, Odile Mella, Mohamed Amine Menacer, Amaia Mendez, Elvys Linhares Pontes, Eric Sanjuan, Damian Swist, Juan-Manuel Torres-Moreno, Begona Garcia-Zapirain
article: MISSI 2018 – 11th edition of the International Conference on Multimedia and Network Information Systems, Sep 2018, Wrocław, Poland. pp.1-12
Accès au texte intégral et bibtex

titre: An Integrated AMIS Prototype for Automated Summarization and Translation of Newscasts and Reports
auteur: Michał L Grega, Kamel Smaïli, Mikołaj Leszczuk, Carlos-Emiliano González-Gallardo, Juan-Manuel Torres-Moreno, Elvys Linhares Pontes, Dominique Fohr, Odile Mella, Mohamed Amine Menacer, Denis Jouvet
article: MISSI 2018 – 11th International Conference on Multimedia and Network Information Systems, Sep 2018, Wroclaw, Poland. pp.415-423, ⟨10.1007/978-3-319-98678-4_42⟩
Accès au texte intégral et bibtex

titre: The VocADom Project: Speech Interaction for Well-being and Reliance Improvement
auteur: Michel Vacher, Emmanuel Vincent, Marc-Eric Bobillier Chaumon, Thierry Joubert, François Portet, Dominique Fohr, Sybille Caffiau, Thierry Desot
article: MobileHCI 2018 – 20th International Conference on Human-Computer Interaction with Mobile Devices and Services, Sep 2018, Barcelona, Spain
Accès au texte intégral et bibtex

titre: Centerline articulatory models of the velum and epiglottis for articulatory synthesis of speech
auteur: Yves Laprie, Benjamin Elie, Anastasiia Tsukanova, Pierre-André Vuissoz
article: 26th European Signal Processing Conference (EUSIPCO 2018), Sep 2018, Rome, Italy. ⟨10.23919/eusipco.2018.8553416⟩
Accès au texte intégral et bibtex

titre: Integrated Presentation Attack Detection and Automatic Speaker Verification: Common Features and Gaussian Back-end Fusion
auteur: Massimiliano Todisco, Héctor Delgado, Kong Aik Lee, Md Sahidullah, Nicholas Evans, Tomi Kinnunen, Junichi Yamagishi
article: Interspeech 2018 – 19th Annual Conference of the International Speech Communication Association, Sep 2018, Hyderabad, India. ⟨10.21437/Interspeech.2018-2289⟩
Accès au texte intégral et bibtex

titre: A French-Spanish Multimodal Speech Communication Corpus Incorporating Acoustic Data, Facial, Hands and Arms Gestures Information
auteur: Lucas Terissi, Gonzalo Sad, Mauricio Cerda, Slim Ouni, Rodrigo Galvez, Juan B. Gómez, Bernard Girau, Nancy Hitschfeld-Kahler
article: Interspeech 2018 – 19th Annual Conference of the International Speech Communication Association, Sep 2018, Hyderabad, India
Accès au texte intégral et bibtex

titre: Keyword-based speaker localization: Localizing a target speaker in a multi-speaker environment
auteur: Sunit Sivasankaran, Emmanuel Vincent, Dominique Fohr
article: Interspeech 2018 – 19th Annual Conference of the International Speech Communication Association, Sep 2018, Hyderabad, India
Accès au texte intégral et bibtex

titre: Phoneme-to-Articulatory mapping using bidirectional gated RNN
auteur: Théo Biasutto– Lervat, Slim Ouni
article: Interspeech 2018 – 19th Annual Conference of the International Speech Communication Association, Sep 2018, Hyderabad, India
Accès au texte intégral et bibtex

titre: The fifth ‘CHiME’ Speech Separation and Recognition Challenge: Dataset, task and baselines
auteur: Jon Barker, Shinji Watanabe, Emmanuel Vincent, Jan Trmal
article: Interspeech 2018 – 19th Annual Conference of the International Speech Communication Association, Sep 2018, Hyderabad, India
Accès au texte intégral et bibtex

titre: Prosodic and Pragmatic Values of Discourse Particles in French
auteur: Lou Lee, Katarina Bartkova, Mathilde Dargnat, Denis Jouvet
article: ExLing 2018 – 9th Tutorial and Research Workshop on Experimental Linguistics, Aug 2018, Paris, France
Accès au texte intégral et bibtex

titre: Analysis of prosodic correlates of emotional speech data
auteur: Katarina Bartkova, Denis Jouvet
article: ExLing 2018 – 9th Tutorial and Research Workshop on Experimental Linguistics, Aug 2018, Paris, France
Accès au texte intégral et bibtex

titre: Phone Merging for Code-switched Speech Recognition
auteur: Sunit Sivasankaran, Brij Mohan Lal Srivastava, Sunayana Sitaram, Kalika Bali, Monojit Choudhury
article: Third Workshop on Computational Approaches to Linguistic Code-switching, collocated with ACL 2018 Jul 2018, Melbourne, Australia
Accès au texte intégral et bibtex

titre: Audiovisual Synchrony Detection with Optimized Audio Features
auteur: Sami Sieranoja, Md Sahidullah, Tomi Kinnunen, Jukka Komulainen, Abdenour Hadid
article: ICSIP 2018 – 3rd International Conference on Signal and Image Processing, Jul 2018, Shenzhen, China
Accès au texte intégral et bibtex

titre: Multichannel Audio Modeling with Elliptically Stable Tensor Decomposition
auteur: Mathieu Fontaine, Fabian-Robert Stöter, Antoine Liutkus, Umut Şimşekli, Romain Serizel, Roland Badeau
article: LVA/ICA 2018 – 14th International Conference on Latent Variable Analysis and Signal Separation, Jul 2018, Surrey, United Kingdom. pp.13-23, ⟨10.1007/978-3-319-93764-9_2⟩
Accès au texte intégral et bibtex

titre: ASVspoof 2017 Version 2.0: meta-data analysis and baseline enhancements
auteur: Héctor Delgado, Massimiliano Todisco, Md Sahidullah, Nicholas Evans, Tomi Kinnunen, Kong Aik Lee, Junichi Yamagishi
article: Odyssey 2018 – The Speaker and Language Recognition Workshop, Jun 2018, Les Sables d’Olonne, France
Accès au texte intégral et bibtex

titre: t-DCF: a Detection Cost Function for the Tandem Assessment of Spoofing Countermeasures and Automatic Speaker Verification
auteur: Tomi Kinnunen, Kong Aik Lee, Héctor Delgado, Nicholas Evans, Massimiliano Todisco, Md Sahidullah, Junichi Yamagishi, Douglas A Reynolds
article: Speaker Odyssey 2018 The Speaker and Language Recognition Workshop, Jun 2018, Les Sables d’Olonne, France
Accès au texte intégral et bibtex

titre: Impact of fluency and segmental categorization in L2: the case of French final fricatives uttered by German speakers
auteur: Anne Bonneau
article: Speech Prosody 2018, Jun 2018, Poznan, Poland. ⟨10.21437/speechprosody.2018-189⟩
Accès au texte intégral et bibtex

titre: Duration modeling using DNN for Arabic speech synthesis
auteur: Imene Zangar, Zied Mnasri, Vincent Colotte, Denis Jouvet, Amal Houidhek
article: 9th International Conference on Speech Prosody, Jun 2018, Poznań, Poland
Accès au texte intégral et bibtex

titre: Exploration de dépendances structurelles mélodiques par réseaux de neurones récurrents
auteur: Nathan Libermann, Frédéric Bimbot, Emmanuel Vincent
article: JIM 2018 – Journées d’Informatique Musicale, May 2018, Amiens, France. pp.81-86
Accès au texte intégral et bibtex

titre: Semi-supervised learning with deep neural networks for relative transfer function inverse regression
auteur: Ziteng Wang, Junfeng Li, Yonghong Yan, Emmanuel Vincent
article: ICASSP 2018 – IEEE International Conference on Acoustics, Speech and Signal Processing, Apr 2018, Calgary, Canada
Accès au texte intégral et bibtex

titre: Multiple-input neural network-based residual echo suppression
auteur: Guillaume Carbajal, Romain Serizel, Emmanuel Vincent, Eric Humbert
article: ICASSP 2018 – IEEE International Conference on Acoustics, Speech and Signal Processing, Apr 2018, Calgary, Canada. pp.1-5
Accès au texte intégral et bibtex

titre: Multichannel speech separation with recurrent neural networks from high-order ambisonics recordings
auteur: Lauréline Perotin, Romain Serizel, Emmanuel Vincent, Alexandre Guérin
article: 43rd IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2018), Apr 2018, Calgary, Canada
Accès au texte intégral et bibtex

titre: Interference reduction on full-length live recordings
auteur: Diego Di Carlo, Antoine Liutkus, Ken Déguernel
article: ICASSP: International Conference on Acoustics, Speech, and Signal Processing, Apr 2018, Calgary, Canada. pp.736-740, ⟨10.1109/ICASSP.2018.8462621⟩
Accès au texte intégral et bibtex

titre: Separake: Source Separation with a Little Help From Echoes
auteur: Robin Scheibler, Diego Di Carlo, Antoine Deleforge, Ivan Dokmanić
article: ICASSP 2018 – IEEE International Conference on Acoustics, Speech and Signal Processing, Apr 2018, Calgary, Canada. pp.6897-6901, ⟨10.1109/ICASSP.2018.8461345⟩
Accès au texte intégral et bibtex

titre: Blind Source Separation Using Mixtures of Alpha-Stable Distributions
auteur: Nicolas Keriven, Antoine Deleforge, Antoine Liutkus
article: ICASSP: International Conference on Acoustics, Speech and Signal Processing, Apr 2018, Calgary, Canada. pp.771-775, ⟨10.1109/ICASSP.2018.8462095⟩
Accès au texte intégral et bibtex

titre: Audio source separation with magnitude priors: the BEADS model
auteur: Antoine Liutkus, Christian Rohlfing, Antoine Deleforge
article: ICASSP: International Conference on Acoustics, Speech and Signal Processing, Apr 2018, Calgary, Canada. pp.56-60, ⟨10.1109/ICASSP.2018.8462515⟩
Accès au texte intégral et bibtex

Book sections

titre: Audio-Motor Integration for Robot Audition
auteur: Antoine Deleforge, Alexander Schmidt, Walter Kellermann
article: Multimodal Behavior Analysis in the Wild, Academic Press, pp.1-27, 2018
Accès au texte intégral et bibtex

titre: Spectral masking and filtering
auteur: Timo Gerkmann, Emmanuel Vincent
article: Emmanuel Vincent; Tuomas Virtanen; Sharon Gannot. Audio source separation and speech enhancement, Wiley, 2018, 978-1-119-27989-1
Accès au texte intégral et bibtex

titre: Introduction
auteur: Emmanuel Vincent, Sharon Gannot, Tuomas Virtanen
article: Emmanuel Vincent; Tuomas Virtanen; Sharon Gannot. Audio source separation and speech enhancement, Wiley, 2018, 978-1-119-27989-1
Accès au texte intégral et bibtex

titre: Time-frequency processing – Spectral properties
auteur: Tuomas Virtanen, Emmanuel Vincent, Sharon Gannot
article: Emmanuel Vincent; Tuomas Virtanen; Sharon Gannot. Audio source separation and speech enhancement, Wiley, 2018, 978-1-119-27989-1
Accès au texte intégral et bibtex

titre: Acoustics – Spatial properties
auteur: Emmanuel Vincent, Sharon Gannot, Tuomas Virtanen
article: Emmanuel Vincent; Tuomas Virtanen; Sharon Gannot. Audio source separation and speech enhancement, Wiley, 2018, 978-1-119-27989-1
Accès au texte intégral et bibtex

titre: Perspectives
auteur: Emmanuel Vincent, Tuomas Virtanen, Sharon Gannot
article: Emmanuel Vincent; Tuomas Virtanen; Sharon Gannot. Audio source separation and speech enhancement, Wiley, 2018, 978-1-119-27989-1
Accès au texte intégral et bibtex

titre: An introduction to multichannel NMF for audio source separation
auteur: Alexey Ozerov, Cédric Févotte, Emmanuel Vincent
article: Audio Source Separation, Springer, 2018, Signals and Communication Technology
Accès au texte intégral et bibtex

titre: Single-channel audio source separation with NMF: divergences, constraints and algorithms
auteur: Cédric Févotte, Emmanuel Vincent, Alexey Ozerov
article: Audio Source Separation, Springer, 2018
Accès au texte intégral et bibtex

titre: Deep neural network based multichannel audio source separation
auteur: Aditya Arie Nugraha, Antoine Liutkus, Emmanuel Vincent
article: Audio Source Separation, Springer, pp.157-195, 2018, 978-3-319-73030-1. ⟨10.1007/978-3-319-73031-8_7⟩
Accès au texte intégral et bibtex

titre: Articulatory Speech Synthesis from Static Context-Aware Articulatory Targets
auteur: Anastasiia Tsukanova, Benjamin Elie, Yves Laprie
article: Qiang Fang; Jianwu Dang; Pascal Perrier; Jianguo Wei; Longbiao Wang; Nan Yan. Studies on Speech Production, 10733, Springer, pp.37-47, 2018, Lecture Notes in Computer Science, 978-3-030-00125-4. ⟨10.1007/978-3-030-00126-1_4⟩
Accès au texte intégral et bibtex

Books

titre: Audio source separation and speech enhancement
auteur: Emmanuel Vincent, Tuomas Virtanen, Sharon Gannot
article: Wiley, pp.504, 2018, 9781119279860. ⟨10.1002/9781119279860⟩
Accès au bibtex

Patents

titre: Image processing device
auteur: Slim Ouni, Guillaume Gris
article: United States, Patent n° : US2018/0061109 A1. 2018
Accès au texte intégral et bibtex

Poster communications

titre: MODALISA : MultiMODalité lors de l’Acquisition du Langage : Interaction entre le Signal de parole et la gestuAlité
auteur: Christelle Dodane, Fabrice Hirsch, Slim Ouni, Dominique Boutet, Aliyah Morgenstern
article: Colloque de restitution, Instrumentation aux Limites CNRS, May 2018, Paris, France
Accès au bibtex

Reports

titre: Benchmarking seventeen clustering methods on a text dataset
auteur: Martine Cadot, Alain Lelu, Michel Zitt
article: [Research Report] LORIA. 2018
Accès au texte intégral et bibtex

Theses

titre: Acoustic control of wind farms
auteur: Baldwin Dumortier
article: Systèmes et contrôle [cs.SY]. Université de Lorraine, 2018. Français. ⟨NNT : 2018LORR0131⟩
Accès au texte intégral et bibtex

titre: Learning of musical structures in the context of improvisation
auteur: Ken Déguernel
article: Intelligence artificielle [cs.AI]. Université de Lorraine, 2018. Français. ⟨NNT : 2018LORR0011⟩
Accès au texte intégral et bibtex

Preprints, Working Papers, …

titre: Can We Use Speaker Recognition Technology to Attack Itself? Enhancing Mimicry Attacks Using Automatic Target Speaker Selection
auteur: Tomi Kinnunen, Rosa González Hautamäki, Ville Vestman, Md Sahidullah
article: 2018
Accès au texte intégral et bibtex

2017

Journal articles

titre: Simulating alveolar trills using a two-mass model of the tongue tip
auteur: Benjamin Elie, Yves Laprie
article: Journal of the Acoustical Society of America, 2017, 142 (5), pp.3245-3256. ⟨10.1121/1.5012688⟩
Accès au texte intégral et bibtex

titre: Scolarité et handicap : parcours de 170 jeunes dysphasiques ou dyslexiques- dysorthographiques âgés de 6 à 20 ans
auteur: Agnès Piquard-Kipffer, Tamara Léonova
article: A.N.A.E. Approche neuropsychologique des apprentissages chez l’enfant, 2017
Accès au texte intégral et bibtex

titre: Inclusive education for students with specific language disorders: What schooling according to country and language
auteur: Tamara Léonova, Agnès Piquard-Kipffer, Askar Jumageldinov, Marie Robert, Mikhaïl Berebin
article: A.N.A.E. Approche neuropsychologique des apprentissages chez l’enfant, 2017, n°147 – Troubles de l’apprentissage du langage écrit et prise en charge multidisciplinaire : De la science à la salle de classe, 29 (2)
Accès au texte intégral et bibtex

titre: Inclusive education: a particular system of teaching with dyslexic and dysphasic children, in a specialized school
auteur: Céline Leclerc, Agnès Piquard-Kipffer, C Rosin, M Wernet
article: A.N.A.E. Approche neuropsychologique des apprentissages chez l’enfant, 2017
Accès au texte intégral et bibtex

titre: Acoustic impact of the gradual glottal abduction on the production of fricatives: A numerical study
auteur: Benjamin Elie, Yves Laprie
article: Journal of the Acoustical Society of America, 2017, 142 (3), pp.1303-1317. ⟨10.1121/1.5000232⟩
Accès au texte intégral et bibtex

titre: A combined evaluation of established and new approaches for speech recognition in varied reverberation conditions
auteur: Sunit Sivasankaran, Emmanuel Vincent, Irina Illina
article: Computer Speech and Language, 2017, 46, pp.444-460. ⟨10.1016/j.csl.2017.02.003⟩
Accès au texte intégral et bibtex

titre: The third ‘CHIME’ speech separation and recognition challenge: Analysis and outcomes
auteur: Jon Barker, Ricard Marxer, Emmanuel Vincent, Shinji Watanabe
article: Computer Speech and Language, 2017, 46, pp.605-626. ⟨10.1016/j.csl.2016.10.005⟩
Accès au texte intégral et bibtex

titre: Multi-microphone speech recognition in everyday environments
auteur: Jon Barker, Ricard Marxer, Emmanuel Vincent, Shinji Watanabe
article: Computer Speech and Language, 2017, 46, pp.386-387. ⟨10.1016/j.csl.2017.02.007⟩
Accès au texte intégral et bibtex

titre: An analysis of environment, microphone and data simulation mismatches in robust speech recognition
auteur: Emmanuel Vincent, Shinji Watanabe, Aditya Arie Nugraha, Jon Barker, Ricard Marxer
article: Computer Speech and Language, 2017, 46, pp.535-557. ⟨10.1016/j.csl.2016.11.005⟩
Accès au texte intégral et bibtex

titre: Feature Learning with Matrix Factorization Applied to Acoustic Scene Classification
auteur: Victor Bisot, Romain Serizel, Slim Essid, Gael Richard
article: IEEE/ACM Transactions on Audio, Speech and Language Processing, 2017, 25 (6), pp.1216 – 1229. ⟨10.1109/TASLP.2017.2690570⟩
Accès au texte intégral et bibtex

titre: A consolidated perspective on multi-microphone speech enhancement and source separation
auteur: Sharon Gannot, Emmanuel Vincent, Shmulik Markovich-Golan, Alexey Ozerov
article: IEEE/ACM Transactions on Audio, Speech and Language Processing, 2017, 25 (4), pp.692-730. ⟨10.1109/TASLP.2016.2647702⟩
Accès au texte intégral et bibtex

titre: Modelling Semantic Context of OOV Words in Large Vocabulary Continuous Speech Recognition
auteur: Imran Ahamad Sheikh, Dominique Fohr, Irina Illina, Georges Linares
article: IEEE/ACM Transactions on Audio, Speech and Language Processing, 2017, 25 (3), pp.598 – 610. ⟨10.1109/TASLP.2017.2651361⟩
Accès au texte intégral et bibtex

titre: A preliminary study of the temporal organization of the labial closure in fluent speech produced by persons who stutter.
auteur: Ivana Didirkova, Camille Fauth, Slim Ouni, Fabrice Hirsch
article: Glossa, 2017, Spécial Montpellier, 121, pp.1-14
Accès au bibtex

titre: Estimating the structural segmentation of popular music pieces under regularity constraints
auteur: Gabriel Sargent, Frédéric Bimbot, Emmanuel Vincent
article: IEEE/ACM Transactions on Audio, Speech and Language Processing, 2017, ⟨10.1109/TASLP.2016.2635031⟩
Accès au texte intégral et bibtex

Conference papers

titre: Consistent DNN Uncertainty Training and Decoding for Robust ASR
auteur: Karan Nathwani, Emmanuel Vincent, Irina Illina
article: 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), Dec 2017, Okinawa, Japan
Accès au texte intégral et bibtex

titre: Topic segmentation in ASR transcripts using bidirectional rnns for change detection
auteur: Imran Sheikh, Dominique Fohr, Irina Illina
article: ASRU 2017 – IEEE Automatic Speech Recognition and Understanding Workshop, Dec 2017, Okinawa, Japan
Accès au texte intégral et bibtex

titre: Is statistical machine translation approach dead?
auteur: Mohamed Amine Menacer, David Langlois, Odile Mella, Dominique Fohr, Denis Jouvet, Kamel Smaïli
article: ICNLSSP 2017 – International Conference on Natural Language, Signal and Speech Processing, ISGA, Dec 2017, Casablanca, Morocco. pp.1-5
Accès au texte intégral et bibtex

titre: About vocabulary adaptation for automatic speech recognition of video data
auteur: Denis Jouvet, David Langlois, Mohamed Amine Menacer, Dominique Fohr, Odile Mella, Kamel Smaïli
article: ICNLSSP’2017 – International Conference on Natural Language, Signal and Speech Processing, Dec 2017, Casablanca, Morocco. pp.1-5
Accès au texte intégral et bibtex

titre: Data Selection in the Framework of Automatic Speech Recognition
auteur: Ismael Bada, Juan Karsten, Dominique Fohr, Irina Illina
article: ICNLSSP 2017 – International conference on natural language, signal and speech processing 2017, Dec 2017, Casablanca, Morocco. pp.1-5
Accès au texte intégral et bibtex

titre: Statistical modelling of speech units in HMM-based speech synthesis for Arabic
auteur: Amal Houidhek, Vincent Colotte, Zied Mnasri, Denis Jouvet, Imene Zangar
article: LTC 2017 – 8th Language & Technology Conference, Nov 2017, Poznań, Poland. pp.1-5
Accès au texte intégral et bibtex

titre: Out-of-Vocabulary Word Probability Estimation using RNN Language Model
auteur: Irina Illina, Dominique Fohr
article: 8th Language & Technology Conference, Nov 2017, Poznan, Poland
Accès au texte intégral et bibtex

titre: DCASE 2017 Challenge setup: Tasks, datasets and baseline system
auteur: Annamaria Mesaros, Toni Heittola, Aleksandr Diment, Benjamin Elizalde, Ankit Shah, Emmanuel Vincent, Bhiksha Raj, Tuomas Virtanen
article: DCASE 2017 – Workshop on Detection and Classification of Acoustic Scenes and Events, Nov 2017, Munich, Germany
Accès au texte intégral et bibtex

titre: Nonnegative Feature Learning Methods for Acoustic Scene Classification
auteur: Victor Bisot, Romain Serizel, Slim Essid, Gael Richard
article: DCASE 2017 – Workshop on Detection and Classification of Acoustic Scenes and Events, Nov 2017, Munich, Germany
Accès au texte intégral et bibtex

titre: Acoustic correlates of L2 prosodic boundaries by German learners of French
auteur: Anne Bonneau
article: SLaP3 2017 – 3rd Workshop on Second Language Prosody, Nov 2017, Bangor United Kingdom. pp.1
Accès au bibtex

titre: Development of the Arabic Loria Automatic Speech Recognition system (ALASR) and its evaluation for Algerian dialect
auteur: Mohamed Amine Menacer, Odile Mella, Dominique Fohr, Denis Jouvet, David Langlois, Kamel Smaïli
article: ACLing 2017 – 3rd International Conference on Arabic Computational Linguistics, Nov 2017, Dubai, United Arab Emirates. pp.1-8
Accès au texte intégral et bibtex

titre: Analysis and Automatic Classification of Some Discourse Particles on a Large Set of French Spoken Corpora
auteur: Denis Jouvet, Katarina Bartkova, Mathilde Dargnat, Lou Lee
article: SLSP’2017, 5th International Conference on Statistical Language and Speech Processing, Oct 2017, Le Mans, France
Accès au texte intégral et bibtex

titre: Articulatory model of the epiglottis
auteur: Yves Laprie, Benjamin Elie, Pierre-André Vuissoz, Anastasiia Tsukanova
article: The 11th International Seminar on Speech Production, Oct 2017, Tianjin, China
Accès au texte intégral et bibtex

titre: Articulatory Speech Synthesis from Static Context-Aware Articulatory Targets
auteur: Anastasiia Tsukanova, Benjamin Elie, Yves Laprie
article: ISSP 2017 – 11th International Seminar on Speech Production, Oct 2017, Tianjin, China
Accès au texte intégral et bibtex

titre: DYCI2 agents: merging the “free”, “reactive”, and “scenario-based” music generation paradigms
auteur: Jérôme Nika, Ken Déguernel, Axel Chemla–Romeu-Santos, Emmanuel Vincent, Gérard Assayag
article: International Computer Music Conference, Oct 2017, Shangai, China
Accès au texte intégral et bibtex

titre: Lévy NMF for Robust Nonnegative Source Separation
auteur: Paul Magron, Roland Badeau, Antoine Liutkus
article: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA 2017), IEEE, Oct 2017, New Paltz, NY, United States
Accès au texte intégral et bibtex

titre: Explaining the Parameterized Wiener Filter with Alpha-Stable Processes
auteur: Mathieu Fontaine, Antoine Liutkus, Laurent Girin, Roland Badeau
article: WASPAA 2017 – IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, Oct 2017, New Paltz, New York, United States
Accès au texte intégral et bibtex

titre: Leveraging deep neural networks with nonnegative representations for improved environmental sound classification
auteur: Victor Bisot, Romain Serizel, Slim Essid, Gael Richard
article: IEEE International Workshop on Machine Learning for Signal Processing MLSP, Sep 2017, Tokyo, Japan
Accès au texte intégral et bibtex

titre: A diagonal plus low-rank covariance model for computationally efficient source separation
auteur: Antoine Liutkus, Kazuyoshi Yoshii
article: IEEE international workshop on machine learning for signal processing (MLSP), Sep 2017, Tokyo, Japan
Accès au texte intégral et bibtex

titre: Etre parent d’enfant atteint des troubles du spectre de l’autisme : Le stress parental à travers l’analyse interprétative phénoménologique
auteur: Tamara Léonova, Delphine Sardin, Aline Gosse, Marie Robert, Agnès Piquard-Kipffer, Philippe Claudon, Stéphanie Claudel, Stéphanie Caharel
article: 14ème congrès international de recherche sur le handicap, Sep 2017, Genève, Suisse
Accès au texte intégral et bibtex

titre: When mismatched training data outperform matched data
auteur: Emmanuel Vincent
article: Systematic approaches to deep learning methods for audio, Sep 2017, Vienna, Austria
Accès au texte intégral et bibtex

titre: Lévy NMF : un modèle robuste de séparation de sources non-négatives
auteur: Paul Magron, Roland Badeau, Antoine Liutkus
article: Colloque GRETSI, Sep 2017, Juan-Les-Pins, France
Accès au texte intégral et bibtex

titre: L’anxiété et les symptômes dépressifs chez les parents d’enfants atteints de syndrome de Dravet
auteur: Tamara Léonova, Anne de Saint-Martin, Rima Nabbout, Stéphane Auvin, Marie Robert, Stéphanie Caharel, Nathalie Coqué, Agnès Piquard-Kipffer
article: SFP 2017 – 58 ème Congrès Accuel de la Société Francaise de Psychologie, Aug 2017, Nice, France. pp.1-2
Accès au texte intégral et bibtex

titre: Performance Analysis of Several Pitch Detection Algorithms on Simulated and Real Noisy Speech Data
auteur: Denis Jouvet, Yves Laprie
article: EUSIPCO’2017, 25th European Signal Processing Conference , Aug 2017, Kos, Greece
Accès au texte intégral et bibtex

titre: Scalable Source Localization with Multichannel Alpha-Stable Distributions
auteur: Mathieu Fontaine, Charles Vanwynsberghe, Antoine Liutkus, Roland Badeau
article: 25th European Signal Processing Conference (EUSIPCO), Aug 2017, Kos, Greece. pp.11-15
Accès au texte intégral et bibtex

titre: On the quality of an expressive audiovisual corpus: a case study of acted speech
auteur: Slim Ouni, Sara Dahmani, Vincent Colotte
article: The 14th International Conference on Auditory-Visual Speech Processing, KTH, Aug 2017, Stockholm, Sweden
Accès au texte intégral et bibtex

titre: End-to-End Acoustic Feedback in Language Learning for Correcting Devoiced French Final-Fricatives
auteur: Sucheta Ghosh, Camille Fauth, Yves Laprie, Aghilas Sini
article: Interspeech 2017, Aug 2017, Stockholm, Sweden. pp.1-5, ⟨10.21437/Interspeech.2017-1031⟩
Accès au texte intégral et bibtex

titre: Glottal Opening and Strategies of Production of Fricatives
auteur: Benjamin Elie, Yves Laprie
article: Interspeech 2017, Aug 2017, Stockholm, Sweden. pp.206-209, ⟨10.21437/Interspeech.2017-1039⟩
Accès au texte intégral et bibtex

titre: Generating Equivalent Chord Progressions to Enrich Guided Improvisation : Application to Rhythm Changes
auteur: Ken Déguernel, Jérôme Nika, Emmanuel Vincent, Gérard Assayag
article: SMC 2017 – 14th Sound and Music Computing Conference, Jul 2017, Espoo, Finland. pp.8
Accès au texte intégral et bibtex

titre: Annotation of discourse particles in French over a large variety of speech corpora
auteur: Katarina Bartkova, Mathilde Dargnat, Denis Jouvet, Lou Lee
article: ACor4French – Les corpus annotés du français, TALN’2017 – Traitement Automatique des Langues Naturelles, Jun 2017, Orléans, France
Accès au texte intégral et bibtex

titre: Gaussian framework for interference reduction in live recordings
auteur: Diego Di Carlo, Ken Déguernel, Antoine Liutkus
article: AES International Conference on Semantic Audio, Jun 2017, Erlangen, Germany
Accès au texte intégral et bibtex

titre: Segmentation and Classification of Opinions with Recurrent Neural Networks
auteur: Imran Ahamad Sheikh, Irina Illina, Dominique Fohr
article: IEEE Information Systems and Economic Intelligence, May 2017, Al Hoceima, Morocco
Accès au texte intégral et bibtex

titre: New Paradigm in Speech Recognition: Deep Neural Networks
auteur: Dominique Fohr, Odile Mella, Irina Illina
article: IEEE International Conference on Information Systems and Economic Intelligence, Apr 2017, Marrakech, Morocco
Accès au texte intégral et bibtex

titre: An enhanced automatic speech recognition system for Arabic
auteur: Mohamed Amine Menacer, Odile Mella, Dominique Fohr, Denis Jouvet, David Langlois, Kamel Smaïli
article: The third Arabic Natural Language Processing Workshop – EACL 2017, Apr 2017, Valencia, Spain
Accès au texte intégral et bibtex

titre: Supervised Group Nonnegative Matrix Factorisation With Similarity Constraints And Applications To Speaker Identification
auteur: Romain Serizel, Victor Bisot, Slim Essid, Gael Richard
article: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Mar 2017, New Orleans, United States
Accès au texte intégral et bibtex

titre: Discriminative importance weighting of augmented training data for acoustic model training
auteur: Sunit Sivasankaran, Emmanuel Vincent, Irina Illina
article: 42th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2017), Mar 2017, New Orleans, United States
Accès au texte intégral et bibtex

titre: Towards Confidence Measures on Fundamental Frequency Estimations
auteur: Boyuan Deng, Denis Jouvet, Yves Laprie, Ingmar Steiner, Aghilas Sini
article: IEEE International Conference on Acoustics, Speech and Signal Processing, Mar 2017, New Orleans, United States
Accès au texte intégral et bibtex

titre: Recursive Bayesian estimation of the acoustic noise emitted by wind farms
auteur: Baldwin Dumortier, Emmanuel Vincent, Madalina Deaconu
article: 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)., Mar 2017, New Orleans, United States
Accès au texte intégral et bibtex

titre: Very Low Bitrate Spatial Audio Coding with Dimensionality Reduction
auteur: Christian Rohlfing, Jeremy E Cohen, Antoine Liutkus
article: 42nd International Conference on Acoustics, Speech and Signal Processing (ICASSP), Mar 2017, New Orleans, United States
Accès au texte intégral et bibtex

titre: Quantization-aware Parameter Estimation for Audio Upmixing
auteur: Christian Rohlfing, Antoine Liutkus, Julian M Becker
article: 42nd International Conference on Acoustics, Speech and Signal Processing (ICASSP), Mar 2017, New Orleans, United States
Accès au texte intégral et bibtex

titre: A multi-resolution approach to common fate-based audio separation
auteur: Fatemeh Pishdadian, Bryan Pardo, Antoine Liutkus
article: 42nd International Conference on Acoustics, Speech and Signal Processing (ICASSP), Mar 2017, New Orleans, United States
Accès au texte intégral et bibtex

titre: User Assisted Separation of Repeating Patterns in Time and Frequency using Magnitude Projections
auteur: Derry Fitzgerald, Zafar Rafii, Antoine Liutkus
article: 42nd International Conference on Acoustics, Speech and Signal Processing (ICASSP), Mar 2017, New Orleans, United States
Accès au texte intégral et bibtex

titre: Alpha-Stable Multichannel Audio Source Separation
auteur: Simon Leglaive, Umut Şimşekli, Antoine Liutkus, Roland Badeau, Gael Richard
article: 42nd International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, Mar 2017, New Orleans, United States
Accès au texte intégral et bibtex

titre: An extended experimental investigation of DNN uncertainty propagation for noise robust ASR
auteur: Karan Nathwani, Juan A Morales-Cordovilla, Sunit Sivasankaran, Irina Illina, Emmanuel Vincent
article: 5th Joint Workshop on Hands-free Speech Communication and Microphone Arrays (HSCMA 2017), Mar 2017, San Francisco, United States
Accès au texte intégral et bibtex

titre: Long-term robot motion planning for active sound source localization with Monte Carlo tree search
auteur: Quan Nguyen Van, Francis Colas, Emmanuel Vincent, François Charpillet
article: HSCMA 2017 – Hands-free Speech Communication and Microphone Arrays , Mar 2017, San Francisco, United States
Accès au texte intégral et bibtex

titre: Sketching for nearfield acoustic imaging of heavy-tailed sources
auteur: Mathieu Fontaine, Charles Vanwynsberghe, Antoine Liutkus, Roland Badeau
article: 13th International Conference on Latent Variable Analysis and Signal Separation (LVA/ICA 2017), Feb 2017, Grenoble, France. pp.80-88, ⟨10.1007/978-3-319-53547-0_8⟩
Accès au texte intégral et bibtex

titre: The 2016 Signal Separation Evaluation Campaign
auteur: Antoine Liutkus, Fabian-Robert Stöter, Zafar Rafii, Daichi Kitamura, Bertrand Rivet, Nobutaka Ito, Nobutaka Ono, Julie Fontecave
article: LVA/ICA 2017 – 13th International Conference on Latent Variable Analysis and Signal Separation, Feb 2017, Grenoble, France. pp.323 – 332, ⟨10.1007/978-3-319-53547-0_31⟩
Accès au texte intégral et bibtex

Book sections

titre: The CHiME challenges: Robust speech recognition in everyday environments
auteur: Jon Barker, Ricard Marxer, Emmanuel Vincent, Shinji Watanabe
article: New era for robust speech recognition – Exploiting deep learning, Springer, pp.327-344, 2017
Accès au texte intégral et bibtex

titre: Multiview approaches to event detection and scene analysis
auteur: Slim Essid, Sanjeel Parekh, Ngoc Q. K. Duong, Romain Serizel, Alexey Ozerov, Fabio Antonacci, Augusto Sarti
article: Tuomas Virtanen; Mark D. Plumbley; Dan Ellis. Computational Analysis of Sound Scenes and Events, Springer, pp.243-276, 2017, 978-3319634494. ⟨10.1007/978-3-319-63450-0_9⟩
Accès au texte intégral et bibtex

titre: Acoustic Features for Environmental Sound Analysis
auteur: Romain Serizel, Victor Bisot, Slim Essid, Gael Richard
article: Tuomas Virtanen; Mark D. Plumbley; Dan Ellis. Computational Analysis of Sound Scenes and Events, Springer International Publishing AG, pp.71-101, 2017, 978-3-319-63449-4. ⟨10.1007/978-3-319-63450-0_4⟩
Accès au texte intégral et bibtex

titre: Multiview Approaches to Event Detection and Scene Analysis
auteur: Slim Essid, Sanjeel Parekh, Ngoc Q. K. Duong, Romain Serizel, Alexey Ozerov, Fabio Antonacci, Augusto Sarti
article: Computational Analysis of Sound Scenes and Events, Springer International Publishing AG, 2017
Accès au bibtex

Master thesis

titre: Apprentissage par renforcement pour l’improvisation musicale automatique
auteur: Rémi Decelle
article: Intelligence artificielle [cs.AI]. 2017
Accès au texte intégral et bibtex

Other publications

titre: La musique comme une langue
auteur: Ken Déguernel, Nathan Libermann, Emmanuel Vincent
article: 2017
Accès au texte intégral et bibtex

Books

titre: Segmental, prosodic and fluency features in phonetic learner corpora Special issue of the International Journal of Learner Corpus Research 3:2
auteur: Jürgen Trouvain, Frank Zimmerer, Bernd Möbius, Maria Gosy, Anne Bonneau
article: John Benjamins Publishing Company, 3 (2), pp.176, 2017, Segmental, prosodic and fluency features in phonetic learner corpora, ⟨10.1075/ijlcr.3.2⟩
Accès au bibtex

Proceedings

titre: The proceedings of the 14th International Conference on Auditory-Visual Speech Processing
auteur: Slim Ouni, Chris Davis, Alexandra Jesse, Jonas Beskow
article: The 14th International Conference on Auditory-Visual Speech Processing (AVSP2017), Aug 2017, Stockholm, Sweden. , 2017
Accès au bibtex

Theses

titre: Deep neural networks for source separation and noise-robust speech recognition
auteur: Aditya Arie Nugraha
article: Signal and Image Processing. Université de Lorraine, 2017. English. ⟨NNT : 2017LORR0212⟩
Accès au texte intégral et bibtex

titre: Mapping of a sound environment by a mobile robot
auteur: van Quan Nguyen
article: Robotics [cs.RO]. Université de Lorraine, 2017. English. ⟨NNT : 2017LORR0172⟩
Accès au texte intégral et bibtex

2016

Journal articles

titre: Extension of the single-matrix formulation of the vocal tract: consideration of bilateral channels and connection of self-oscillating models of the vocal folds with a glottal chink
auteur: Benjamin Elie, Yves Laprie
article: Speech Communication, 2016, 82, pp.85-96. ⟨10.1016/j.specom.2016.06.002⟩
Accès au texte intégral et bibtex

titre: Variational Bayesian Inference for Source Separation and Robust Feature Extraction
auteur: Kamil Adiloğlu, Emmanuel Vincent
article: IEEE Transactions on Audio, Speech and Language Processing, 2016, ⟨10.1109/TASLP.2016.2583794⟩
Accès au texte intégral et bibtex

titre: Multichannel audio source separation with deep neural networks
auteur: Aditya Arie Nugraha, Antoine Liutkus, Emmanuel Vincent
article: IEEE/ACM Transactions on Audio, Speech and Language Processing, 2016, 24 (10), pp.1652-1664. ⟨10.1109/TASLP.2016.2580946⟩
Accès au texte intégral et bibtex

titre: System & Contrast : A Polymorphous Model of the Inner Organization of Structural Segments within Music Pieces
auteur: Frédéric Bimbot, Emmanuel Deruty, Gabriel Sargent, Emmanuel Vincent
article: Music Perception, 2016, 33 (5), pp.631-661. ⟨10.1525/mp.2016.33.5.631⟩
Accès au texte intégral et bibtex

titre: Is markerless acquisition of speech production accurate ?
auteur: Slim Ouni, Sara Dahmani
article: Journal of the Acoustical Society of America, 2016, EL234, 139 (6), ⟨10.1121/1.4954497⟩
Accès au texte intégral et bibtex

titre: Projection-based demixing of spatial audio
auteur: Derry Fitzgerald, Antoine Liutkus, Roland Badeau
article: IEEE Transactions on Audio, Speech and Language Processing, 2016, ⟨10.1109/TASLP.2016.2570945⟩
Accès au texte intégral et bibtex

titre: Fusion methods for speech enhancement and audio source separation
auteur: Xabier Jaureguiberry, Emmanuel Vincent, Gael Richard
article: IEEE Transactions on Audio, Speech and Language Processing, 2016, ⟨10.1109/TASLP.2016.2553441⟩
Accès au texte intégral et bibtex

titre: Faire voir une histoire : Louis et son incroyable chien Noisette
auteur: Agnès Piquard-Kipffer
article: Les Cahiers Pédagogiques, 2016, Dossier Lire et écrire avec la littérature numérique coordonné par Yaël Boublil et Jacques Crinon., Hors série numérique N°42, pp.7
Accès au bibtex

titre: Démixer la musique
auteur: Antoine Liutkus, Emmanuel Vincent
article: Interstices, 2016
Accès au bibtex

titre: Extraction d’un modèle articulatoire à partir d’une analyse tri-directionnelle de cinéradiographies d’un locuteur
auteur: Martine Cadot, Yves Laprie
article: Revue des Nouvelles Technologies de l’Information, 2016, Fouille de Données Complexes (RNTI-E-31), pp.73-92
Accès au bibtex

titre: Multimodal acquisition of articulatory data: Geometrical and temporal registration
auteur: Michaël Aron, Marie-Odile Berger, Erwan Kerrien, Brigitte Wrobel-Dautcourt, Blaise Potard, Yves Laprie
article: Journal of the Acoustical Society of America, 2016, 139 (2), pp.13. ⟨10.1121/1.4940666⟩
Accès au texte intégral et bibtex

Conference papers

titre: Dynamic adjustment of language models for automatic speech recognition using word similarity
auteur: Anna Currey, Irina Illina, Dominique Fohr
article: IEEE Workshop on Spoken Language Technology (SLT 2016), Dec 2016, San Diego, CA, United States
Accès au texte intégral et bibtex

titre: A study of speech distortion conditions in real scenarios for speech processing applications
auteur: Dayana Ribas, Emmanuel Vincent, José Ramón Calvo
article: 2016 IEEE Workshop on Spoken Language Technology, Dec 2016, San Diego, United States
Accès au texte intégral et bibtex

titre: Weakly-supervised text-to-speech alignment confidence measure
auteur: Guillaume Serrière, Christophe Cerisara, Dominique Fohr, Odile Mella
article: International Conference on Computational Linguistics (COLING), Dec 2016, Osaka, Japan
Accès au texte intégral et bibtex

titre: Storytelling with a digital album that use an avatar as narrator
auteur: Agnès Piquard-Kipffer
article: XVIèmes rencontres internationales en orthophonie – Orthophonie et technologies innovantes, Dec 2016, PARIS, France
Accès au texte intégral et bibtex

titre: Acoustic and Visual Analysis of Expressive Speech: A Case Study of French Acted Speech
auteur: Slim Ouni, Vincent Colotte, Sara Dahmani, Soumaya Azzi
article: Interspeech 2016, ISCA, Nov 2016, San Francisco, United States. pp.580 – 584, ⟨10.21437/Interspeech.2016-730⟩
Accès au texte intégral et bibtex

titre: Localizing an intermittent and moving sound source using a mobile robot
auteur: van Quan Nguyen, Francis Colas, Emmanuel Vincent, François Charpillet
article: International Conference on Intelligent Robots and Systems (IROS), Oct 2016, Deajeon, South Korea
Accès au texte intégral et bibtex

titre: Machine listening techniques as a complement to video image analysis in forensics
auteur: Romain Serizel, Victor Bisot, Slim Essid, Gael Richard
article: IEEE International Conference on Image Processing, Sep 2016, Phoenix, AZ, United States. pp.948-952, ⟨10.1109/ICIP.2016.7532497⟩
Accès au texte intégral et bibtex

titre: Mini-batch stochastic approaches for accelerated multiplicative updates in nonnegative matrix factorisation with beta-divergence
auteur: Romain Serizel, Slim Essid, Gael Richard
article: IEEE International Workshop on Machine Learning for Signal Processing (MLSP 2016), Sep 2016, Salerne, Italy
Accès au texte intégral et bibtex

titre: L1-L2 Interference: The case of final devoicing of French voiced fricatives in final position by German learners
auteur: Sucheta Ghosh, Camille Fauth, Aghilas Sini, Yves Laprie
article: Interspeech 2016, Sep 2016, San Francisco, United States. pp.3156 – 3160, ⟨10.21437/Interspeech.2016-954⟩
Accès au texte intégral et bibtex

titre: A French corpus for distant-microphone speech processing in real homes
auteur: Nancy Bertin, Ewen Camberlein, Emmanuel Vincent, Romain Lebarbenchon, Stéphane Peillon, Éric Lamandé, Sunit Sivasankaran, Frédéric Bimbot, Irina Illina, Ariane Tom, Sylvain Fleury, Eric Jamet
article: Interspeech 2016, Sep 2016, San Francisco, United States
Accès au texte intégral et bibtex

titre: Improved Neural Bag-of-Words Model to Retrieve Out-of-Vocabulary Words in Speech Recognition
auteur: Imran Sheikh, Irina Illina, Dominique Fohr, Georges Linares
article: INTERSPEECH 2016, Sep 2016, San Francisco, United States. ⟨10.21437/Interspeech.2016-1219⟩
Accès au texte intégral et bibtex

titre: Copy synthesis of running speech based on vocal tract imaging and audio recording
auteur: Benjamin Elie, Yves Laprie
article: 22nd International Congress on Acoustics (ICA), Sep 2016, Buenos Aires, Argentina
Accès au texte intégral et bibtex

titre: Robust tonal and noise separation in presence of colored noise, and application to voiced fricatives
auteur: Benjamin Elie, Gilles Chardon
article: 22nd International Congress on Acoustics (ICA), Sep 2016, Buenos Aires, Argentina
Accès au texte intégral et bibtex

titre: Experiments on the DCASE Challenge 2016: Acoustic scene classification and sound event detection in real life recording
auteur: Benjamin Elizalde, Anurag Kumar, Ankit Shah, Rohan Badlani, Emmanuel Vincent, Bhiksha Raj, Ian Lane
article: DCASE2016 Workshop on Detection and Classification of Acoustic Scenes and Events, Sep 2016, Budapest, Hungary
Accès au texte intégral et bibtex

titre: Using Multidimensional Sequences For Improvisation In The OMax Paradigm
auteur: Ken Déguernel, Emmanuel Vincent, Gérard Assayag
article: 13th Sound and Music Computing Conference, Aug 2016, Hamburg, Germany
Accès au texte intégral et bibtex

titre: Copy synthesis of phrase-level utterances
auteur: Benjamin Elie, Yves Laprie
article: EUSIPCO2016, Aug 2016, Budapest, Hungary
Accès au texte intégral et bibtex

titre: Multichannel Music Separation with Deep Neural Networks
auteur: Aditya Arie Nugraha, Antoine Liutkus, Emmanuel Vincent
article: European Signal Processing Conference (EUSIPCO), Aug 2016, Budapest, Hungary. pp.1748-1752
Accès au texte intégral et bibtex

titre: High spatiotemporal cineMRI films using compressed sensing for acquiring articulatory data
auteur: Benjamin Elie, Yves Laprie, Pierre-André Vuissoz, Freddy Odille
article: 24th European Signal Processing Conference – EUSIPCO2016, Aug 2016, Budapest, Hungary. ⟨10.1109/EUSIPCO.2016.7760469⟩
Accès au texte intégral et bibtex

titre: Evaluation of Audio Source Separation Models Using Hypothesis-Driven Non-Parametric Statistical Methods
auteur: Andrew J R Simpson, Gerard Roma, Emad M Grais, Russell D Mason, Chris Hummersone, Antoine Liutkus, Mark D Plumbley
article: European Signal Processing Conference, EURASIP, Aug 2016, Budapest, Hungary
Accès au texte intégral et bibtex

titre: Learning Word Importance with the Neural Bag-of-Words Model
auteur: Imran Sheikh, Irina Illina, Dominique Fohr, Georges Linares
article: ACL, Representation Learning for NLP (Repl4NLP) workshop, Aug 2016, Berlin, Germany
Accès au texte intégral et bibtex

titre: Methods of investigating vowel interferences of French learners of German
auteur: Frank Zimmerer, Jürgen Trouvain, Anne Bonneau
article: New Sounds 2016, Jun 2016, Aarhus, Denmark
Accès au texte intégral et bibtex

titre: Influence of L1 prominence on L2 production: French and German speakers
auteur: Frank Zimmerer, Anne Bonneau, Bistra Andreeva
article: Speech Prosody 2016, May 2016, Boston, United States. pp.370 – 374, ⟨10.21437/SpeechProsody.2016-76⟩
Accès au texte intégral et bibtex

titre: Prosodic Parameters and Prosodic Structures of French Emotional Data
auteur: Katarina Bartkova, Denis Jouvet, Elisabeth Delais-Roussarie
article: Speech Prosody 2016, May 2016, Boston, United States
Accès au texte intégral et bibtex

titre: How Diachronic Text Corpora Affect Context based Retrieval of OOV Proper Names for Audio News
auteur: Imran Sheikh, Irina Illina, Dominique Fohr
article: LREC 2016, May 2016, Portoroz, Slovenia
Accès au texte intégral et bibtex

titre: The IFCASL Corpus of French and German Non-native and Native Read Speech
auteur: Jürgen Trouvain, Anne Bonneau, Vincent Colotte, Camille Fauth, Dominique Fohr, Denis Jouvet, Jeanin Jügler, Yves Laprie, Odile Mella, Bernd Möbius, Frank Zimmerer
article: LREC’2016, 10th edition of the Language Resources and Evaluation Conference, May 2016, Portorož, Slovenia
Accès au texte intégral et bibtex

titre: Acquisition temps-réel de données articulatoires par IRM : application à la synthèse par copie
auteur: Benjamin Elie, Yves Laprie, Pierre-André Vuissoz
article: 13ème Congrès Français d’Acoustique (CFA 2016), SFA, Apr 2016, Le Mans, France
Accès au texte intégral et bibtex

titre: Séparation de sources: quand l’acoustique rencontre le machine learning
auteur: Emmanuel Vincent
article: 13e Congrès Français d’Acoustique, Apr 2016, Le Mans, France
Accès au texte intégral et bibtex

titre: A glottal chink model for the synthesis of voiced fricatives
auteur: Benjamin Elie, Yves Laprie
article: International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, Mar 2016, Shanghai, China
Accès au texte intégral et bibtex

titre: Document Level Semantic Context for Retrieving OOV Proper Names
auteur: Imran Sheikh, Irina Illina, Dominique Fohr, Georges Linares
article: 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , Mar 2016, Shanghai, China. pp.6050-6054, ⟨10.1109/ICASSP.2016.7472839⟩
Accès au texte intégral et bibtex

titre: Du fichier audio à l’intonation en Français : Graphes pour l’apprentissage de 3 classes intonatives
auteur: Martine Cadot, Anne Bonneau
article: Fouille de données complexes (FDC@EGC2016), Jan 2016, Reims, France
Accès au texte intégral et bibtex

titre: PROJET – Spatial Audio Separation Using Projections
auteur: Derry Fitzgerald, Antoine Liutkus, Roland Badeau
article: 41st International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2016, Shanghai, China
Accès au texte intégral et bibtex

titre: Common Fate Model for Unison source Separation
auteur: Fabian-Robert Stöter, Antoine Liutkus, Roland Badeau, Bernd Edler, Paul Magron
article: 41st International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2016, Shanghai, China
Accès au texte intégral et bibtex

Book sections

titre: Recoder les variables pour obtenir un modèle implicatif optimal
auteur: Martine Cadot
article: Régis Gras. L’Analyse Statisqtique Implicative, Cépaduès, 2016
Accès au texte intégral et bibtex

titre: Temporal and Lexical Context of Diachronic Text Documents for Automatic Out-Of-Vocabulary Proper Name Retrieval
auteur: Irina Illina, Dominique Fohr, Georges Linares, Imane Nkairi
article: Zygmunt Vetulani; Hans Uszkoreit; Marek Kubis Human Language Technology. Challenges for Computer Science and Linguistics, 9561, Springer, pp.41-54, 2016, Lecture Notes in Computer Science, 978-3-319-43808-5. ⟨10.1007/978-3-319-43808-5_4⟩
Accès au texte intégral et bibtex

Patents

titre: Dispositif de traitement d’image
auteur: Slim Ouni, Guillaume Gris
article: France, N° de brevet: 15 52058 2016
Accès au bibtex

Poster communications

titre: Improvisation musicale multidimensionnelle dans le paradigme OMax
auteur: Ken Déguernel, Emmanuel Vincent, Gérard Assayag
article: Journées Jeunes Chercheurs en Acoustique, Audition et Signal, Nov 2016, Paris, France. 2016
Accès au texte intégral et bibtex

Proceedings

titre: Instrumentations pour l’étude des consonnes géminées du tarifit
auteur: Fayssal Bouarourou, Béatrice Vaxelaire, Yves Laprie, Rachid Ridouane, Rudolph Sock
article: Actes de la conférence internationale sur les Technologies d’Information et de Communication pour l’AMazighe (TICAM 2016), 2016
Accès au bibtex

Reports

titre: Supplementary material to the article: Estimating the structural segmentation of popular music pieces under regularity constraints
auteur: Gabriel Sargent, Frédéric Bimbot, Emmanuel Vincent
article: [Research Report] IRISA-INRIA, Campus de Beaulieu, 35042 Rennes cedex; INRIA Nancy, équipe Multispeech. 2016
Accès au texte intégral et bibtex

titre: Generalized Wiener filtering for positive alpha-stable random variables
auteur: Paul Magron, Roland Badeau, Antoine Liutkus
article: [Research Report] 2016D000, Télécom ParisTech. 2016
Accès au texte intégral et bibtex

Theses

titre: Exploiting Semantic and Topic Context to Improve Recognition of Proper Names in Diachronic Audio Documents
auteur: Imran Sheikh
article: Human-Computer Interaction [cs.HC]. Université de Lorraine, 2016. English. ⟨NNT : 2016LORR0260⟩
Accès au texte intégral et bibtex

Preprints, Working Papers, …

titre: Efficient optimisation of wind power under acoustic constraints
auteur: Baldwin Dumortier, Emmanuel Vincent, Madalina Deaconu, Patrice Cornu
article: 2016
Accès au texte intégral et bibtex

2015

Journal articles

titre: Nonparametric uncertainty estimation and propagation for noise robust ASR
auteur: Dung T. Tran, Emmanuel Vincent, Denis Jouvet
article: IEEE/ACM Transactions on Audio, Speech and Language Processing, 2015, 23 (11), pp.1835-1846. ⟨10.1109/TASLP.2015.2450497⟩
Accès au texte intégral et bibtex

titre: A preliminary study on improving the recognition of esophageal speech using a hybrid system based on statistical voice conversion
auteur: Othman Lachhab, Joseph Di Martino, El Hassane Ibn Elhaj, Ahmed Hammouch
article: SpringerPlus, 2015, ⟨10.1186/s40064-015-1428-2⟩
Accès au bibtex

titre: An architectural comparison of signal reconstruction algorithms from short-time Fourier transform magnitude spectra
auteur: Mouhcine Chami, Maryem Immassi, Joseph Di Martino
article: International Journal of Speech Technology, 2015, 18 (3), pp.9. ⟨10.1007/s10772-015-9281-9⟩
Accès au bibtex

titre: Alpha-Stable Matrix Factorization
auteur: Umut Şimşekli, Antoine Liutkus, Taylan Cemgil
article: IEEE Signal Processing Letters, 2015, 22 (12), pp.2289 – 2293. ⟨10.1109/LSP.2015.2477535⟩
Accès au texte intégral et bibtex

titre: Multi-channel audio source separation using multiple deformed references
auteur: Nathan Souviraà-Labastie, Anaik Olivero, Emmanuel Vincent, Frédéric Bimbot
article: IEEE Transactions on Audio, Speech and Language Processing, 2015, 23 (11), pp.1775-1787. ⟨10.1109/taslp.2015.2450494⟩
Accès au texte intégral et bibtex

titre: Blind suppression of nonstationary diffuse noise based on spatial covariance matrix decomposition
auteur: Nobutaka Ito, Emmanuel Vincent, Tomohiro Nakatani, Nobutaka Ono, Shoko Araki, Shigeki Sagayama
article: Journal of Signal Processing Systems, 2015, 79 (2), pp.145-157. ⟨10.1007/s11265-014-0922-z⟩
Accès au texte intégral et bibtex

titre: Reference-less measurement of the transmission matrix of a highly scattering material using a DMD and phase retrieval techniques
auteur: Angélique Drémeau, Antoine Liutkus, David Martina, Ori Katz, Christophe Schülke, Florent Krzakala, Sylvain Gigan, Laurent Daudet
article: Optics Express, 2015, 29 (9), pp.11898-11911. ⟨10.1364/OE.23.011898⟩
Accès au texte intégral et bibtex

titre: Random Calibration for Accelerating MR-ARFI Guided Ultrasonic Focusing in Transcranial Therapy
auteur: Na Liu, Antoine Liutkus, Jean-François Aubry, Laurent Marsac, Mickael Tanter, Laurent Daudet
article: Physics in Medicine and Biology, 2015, 60 (3), pp.21. ⟨10.1088/0031-9155/60/3/1069⟩
Accès au texte intégral et bibtex

Conference papers

titre: Different word representations and their combination for proper name retrieval from diachronic documents
auteur: Irina Illina, Dominique Fohr
article: IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2015) , Dec 2015, Scottsdale, United States
Accès au bibtex

titre: The third `CHiME’ Speech Separation and Recognition Challenge: Dataset, task and baselines
auteur: Jon Barker, Ricard Marxer, Emmanuel Vincent, Shinji Watanabe
article: 2015 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2015), Dec 2015, Scottsdale, AZ, United States
Accès au texte intégral et bibtex

titre: Robust ASR using neural network based speech enhancement and feature simulation
auteur: Sunit Sivasankaran, Aditya A Nugraha, Emmanuel Vincent, Juan Andrés Morales Cordovilla, Siddharth Dalmia, Irina Illina, Antoine Liutkus
article: ASRU, Dec 2015, Arizona, United States
Accès au texte intégral et bibtex

titre: Terminal portable de communication et affichage de la reconnaissance vocale. Enjeux et rapports à l’écrit. Étude préliminaire auprès d’adultes déficients auditifs
auteur: Agnès Piquard-Kipffer, Odile Mella, Jérémy Miranda, Denis Jouvet, Luiza Orosanu
article: Ideki 2015 – 3ème colloque international “Didactiques, Métiers de l’Humain, Intelligence collective : construction de savoirs et de dispositifs didactiques”, Dec 2015, Colmar, France. pp.1-15
Accès au texte intégral et bibtex

titre: Neural Networks Revisited for Proper Name Retrieval from Diachronic Documents
auteur: Irina Illina, Dominique Fohr
article: LTC Language & Technology Conference, Nov 2015, Poznan, Poland. pp.120-124
Accès au texte intégral et bibtex

titre: Discourse Particles In French: Prosodic Parameters Extraction and Analysis
auteur: Mathilde Dargnat, Katarina Bartkova, Denis Jouvet
article: International Conference on Statistical Language and Speech Processing, Nov 2015, Budapest, Hungary
Accès au texte intégral et bibtex

titre: Combining lexical and prosodic features for automatic detection of sentence modality in French
auteur: Luiza Orosanu, Denis Jouvet
article: International Conference on Statistical Language and Speech Processing, Nov 2015, Budapest, Hungary
Accès au texte intégral et bibtex

titre: Acoustical Frame Rate and Pronunciation Variant Statistics
auteur: Denis Jouvet, Katarina Bartkova
article: International Conference on Statistical Language and Speech Processing, Nov 2015, Budapest, Hungary
Accès au texte intégral et bibtex

titre: Toward Realistic Expressive Audiovisual Speech Synthesis
auteur: Slim Ouni
article: Expressive Virtual Actors workshop, Gipsa-Lab, Nov 2015, Grenoble, France
Accès au bibtex

titre: Acoustic control of wind farms
auteur: Baldwin Dumortier, Emmanuel Vincent, Madalina Deaconu
article: Ewea 2015 – The European Wind Energy Association Conference, Nov 2015, Paris, France
Accès au texte intégral et bibtex

titre: Pourquoi et comment transformer des variables quantitatives en catégorielles ? Application à l’intonation de la langue française.
auteur: Martine Cadot, Anne Bonneau
article: ASI-8, Nov 2015, Radès, France
Accès au bibtex

titre: Adding new words into a language model using parameters of known words with similar behavior
auteur: Luiza Orosanu, Denis Jouvet
article: International Conference on Natural Language and Speech Processing, Oct 2015, Alger, Algeria
Accès au texte intégral et bibtex

titre: Source Separation for Target Enhancement of Food Intake Acoustics from Noisy Recordings
auteur: Antoine Liutkus, Temiloluwa Olubanjo, Elliot Moore, Maysam Ghovanloo
article: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), Oct 2015, New Paltz, NY, United States
Accès au texte intégral et bibtex

titre: Cauchy Nonnegative Matrix Factorization
auteur: Antoine Liutkus, Derry Fitzgerald, Roland Badeau
article: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), Oct 2015, New Paltz, NY, United States
Accès au texte intégral et bibtex

titre: Detection of sentence modality on French automatic speech-to-text transcriptions
auteur: Luiza Orosanu, Denis Jouvet
article: International Conference on Natural Language and Speech Processing, Oct 2015, Alger, Algeria
Accès au texte intégral et bibtex

titre: Textual Data Selection for Language Modelling in the Scope of Automatic Speech Recognition
auteur: Freha Mezzoudj, David Langlois, Denis Jouvet, Abdelkader Benyettou
article: International Conference on Natural Language and Speech Processing, Oct 2015, Alger, Algeria
Accès au texte intégral et bibtex

titre: The timing of geminate consonants in Tarifit Berber
auteur: Fayssal Bouarourou, Béatrice Vaxelaire, Yves Laprie, Rachid Ridouane, Rudolph Sock
article: 1st International Conference on Natural Language and Speech Processing, Oct 2015, Algiers, Algeria
Accès au texte intégral et bibtex

titre: Qualitative investigation of the display of speech recognition results for communication with deaf people
auteur: Agnès Piquard-Kipffer, Odile Mella, Jérémy Miranda, Denis Jouvet, Luiza Orosanu
article: 6th Workshop on Speech and Language Processing for Assistive Technologies, SIG-SLPAT, Sep 2015, Dresden, Germany. pp.7
Accès au texte intégral et bibtex

titre: German non-native realizations of French voiced fricatives in final position of a group of words
auteur: Anne Bonneau, Martine Cadot
article: Interspeech 2015, Möller, S., Ney, H., Moebius, B., Nöth, E., Sep 2015, Dresde, Germany
Accès au bibtex

titre: Accuracy of a markerless acquisition technique for studying speech articulators. In Interspeech 2015
auteur: Andrea Bandini, Slim Ouni, Piero Cosi, Silvia Orlandi, Claudia Manfredi
article: Interspeech 2015, Sep 2015, Dresden, Germany
Accès au texte intégral et bibtex

titre: Continuous Word Representation using Neural Networks for Proper Name Retrieval from Diachronic Documents
auteur: Dominique Fohr, Irina Illina
article: Interspeech 2015, Sep 2015, Dresden, Germany
Accès au bibtex

titre: Uncertainty propagation through deep neural networks
auteur: Ahmed Hussen Abdelaziz, Shinji Watanabe, John R. Hershey, Emmanuel Vincent, Dorothea Kolossa
article: Interspeech 2015, Sep 2015, Dresden, Germany
Accès au texte intégral et bibtex

titre: Full multicondition training for robust i-vector based speaker recognition
auteur: Dayana Ribas, Emmanuel Vincent, José Ramon Calvo
article: Interspeech 2015, Sep 2015, Dresden, Germany
Accès au texte intégral et bibtex

titre: Study of Entity-Topic Models for OOV Proper Name Retrieval
auteur: Imran Sheikh, Irina Illina, Dominique Fohr
article: Interspeech 2015, Sep 2015, Dresden, Germany
Accès au bibtex

titre: Uncertainty propagation for noise robust speaker recognition: the case of NIST-SRE
auteur: Dayana Ribas, Emmanuel Vincent, José Ramon Calvo
article: Interspeech 2015, Sep 2015, Dresden, Germany. pp.5
Accès au texte intégral et bibtex

titre: Analysis of phone confusion matrices in a manually annotated French-German learner corpus
auteur: Denis Jouvet, Anne Bonneau, Jürgen Trouvain, Frank Zimmerer, Yves Laprie, Bernd Möbius
article: Workshop on Speech and Language Technology in Education, Sep 2015, Leipzig, Germany
Accès au texte intégral et bibtex

titre: De l’importance de l’homogénéisation des conventions de transcription pour l’alignement automatique de corpus oraux de parole spontanée
auteur: Dominique Fohr, Odile Mella, Denis Jouvet
article: 8es Journées Internationales de Linguistique de Corpus (JLC2015), Sep 2015, Orléans, France
Accès au texte intégral et bibtex

titre: Detection of Phone Boundaries for Non-Native Speech using French-German Models
auteur: Dominique Fohr, Odile Mella
article: Workshop on Speech and Language Technology in Education, Sep 2015, Leipzig, Germany
Accès au texte intégral et bibtex

titre: Inter-annotator agreement for a speech corpus pronounced by French and German language learners
auteur: Odile Mella, Dominique Fohr, Anne Bonneau
article: Workshop on Speech and Language Technology in Education, ISCA Special Interest Group (SIG) on Speech and Language Technology in Education, Sep 2015, Leipzig, Germany
Accès au texte intégral et bibtex

titre: Evaluation of PNCC and extended spectral subtraction methods for robust speech recognition
auteur: Thibaut Fux, Denis Jouvet
article: EUSIPCO 2015 – 23rd European Signal Processing Conference , Aug 2015, Nice, France
Accès au texte intégral et bibtex

titre: The 2015 Signal Separation Evaluation Campaign
auteur: Nobutaka Ono, Zafar Rafii, Daichi Kitamura, Nobutaka Ito, Antoine Liutkus
article: International Conference on Latent Variable Analysis and Signal Separation (LVA/ICA), Aug 2015, Liberec, France. pp.387-395, ⟨10.1007/978-3-319-22482-4_45⟩
Accès au texte intégral et bibtex

titre: Extraction of Temporal Patterns in Multi-rate and Multi-modal Datasets
auteur: Antoine Liutkus, Umut Şimşekli, Taylan Cemgil
article: International Conference on Latent Variable Analysis and Signal Separation (LVA/ICA), Aug 2015, Liberec, Czech Republic
Accès au texte intégral et bibtex

titre: Speech enhancement with LSTM recurrent neural networks and its application to noise-robust ASR
auteur: Felix Weninger, Hakan Erdogan, Shinji Watanabe, Emmanuel Vincent, Jonathan Le Roux, John R. Hershey, Björn Schuller
article: 12th International Conference on Latent Variable Analysis and Signal Separation (LVA/ICA), Aug 2015, Liberec, Czech Republic
Accès au texte intégral et bibtex

titre: One corpus, one research question, three methods “German vowels produced by French speakers”
auteur: Frank Zimmerer, Jürgen Trouvain, Anne Bonneau
article: Worshop on Phonetic learner corpora. Satellite meeting of ICPhS 2015., Trouvain, J., Zimmerer, F., Gosy, M., Bonneau, A., Aug 2015, Glasgow, United Kingdom
Accès au bibtex

titre: Impact of frame rate on automatic speech-text alignment for corpus-based phonetic studies
auteur: Katarina Bartkova, Denis Jouvet
article: ICPhS’2015 – 18th International Congress of Phonetic Sciences, Aug 2015, Glasgow, United Kingdom
Accès au texte intégral et bibtex

titre: 2D Articulatory Velum Modeling Applied to Copy Synthesis of Sentences Containing Nasal Phonemes
auteur: Yves Laprie, Benjamin Elie, Anastasiia Tsukanova
article: International Congress of Phonetic Sciences, Aug 2015, Glasgow, United Kingdom
Accès au texte intégral et bibtex

titre: Realizations of French voiced fricatives by German learners as a function of speaker level and prosodic boundaries
auteur: Anne Bonneau
article: 18th International Congress of Phonetic Sciences, ICPhS 2015, University of Glasgow, Aug 2015, Glasgow, United Kingdom. pp.5
Accès au bibtex

titre: Experience of an International Intensive Project with First Year Programming Students
auteur: James Paterson, Markku Karhu, Walter Cazzola, Irina Illina, Dario Malchiodi, Marisa Maximiano, Catarina Silva
article: IEEE International Computers, Software & Applications Conference, COMPSAC2015, Jul 2015, Taichung, Taiwan. ⟨10.1109/COMPSAC.2015.49⟩
Accès au bibtex

titre: Reconnaissance de la parole, application aux personnes sourdes et malentendantes
auteur: Agnès Piquard-Kipffer
article: Journées scientifiques d’Inria, Inria, Jun 2015, Villers-Les-nancy, France
Accès au bibtex

titre: An articulatory model of the velum developed from cineradiographic data
auteur: Yves Laprie
article: 169th Meeting: Acoustical Society of America, May 2015, Pittsburgh, United States
Accès au bibtex

titre: Contribution of the acoustic cues to the non-native accent
auteur: Yves Laprie
article: 169th meeting: Acoustical Society of America, May 2015, Pittsburgh, United States
Accès au bibtex

titre: Audio source localization by optimal control of a mobile robot
auteur: Emmanuel Vincent, Aghilas Sini, François Charpillet
article: IEEE 2015 International Conference on Acoustics, Speech and Signal Processing (ICASSP), Apr 2015, Brisbane, Australia
Accès au texte intégral et bibtex

titre: Kernel additive modeling for interference reduction in multi-channel music recordings
auteur: Thomas Prätzlich, Rachel Bittner, Antoine Liutkus, Meinard Müller
article: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Apr 2015, Brisbane, Australia
Accès au texte intégral et bibtex

titre: Generalized Wiener filtering with fractional power spectrograms
auteur: Antoine Liutkus, Roland Badeau
article: 40th International Conference on Acoustics, Speech and Signal Processing (ICASSP), Apr 2015, Brisbane, Australia
Accès au texte intégral et bibtex

titre: Discriminative uncertainty estimation for noise robust ASR
auteur: Dung Tien Tran, Emmanuel Vincent, Denis Jouvet
article: 40th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2015, Apr 2015, Brisbane, Queensland, Australia
Accès au texte intégral et bibtex

titre: A simple user interface system for recovering patterns repeating in time and frequency in mixtures of sounds
auteur: Zafar Rafii, Antoine Liutkus, Bryan Pardo
article: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Apr 2015, Brisbane, France
Accès au texte intégral et bibtex

titre: Music separation guided by cover tracks: designing the joint NMF model
auteur: Nathan Souviraà-Labastie, Emmanuel Vincent, Frédéric Bimbot
article: 40th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2015, Apr 2015, Brisbane, Australia
Accès au texte intégral et bibtex

titre: MICbots: collecting large realistic datasets for speech and audio research using mobile robots
auteur: Jonathan Le Roux, Emmanuel Vincent, John R. Hershey, Daniel P.W. Ellis
article: IEEE 2015 International Conference on Acoustics, Speech and Signal Processing (ICASSP), Apr 2015, Brisbane, Australia
Accès au texte intégral et bibtex

titre: Scalable audio separation with light kernel additive modelling
auteur: Antoine Liutkus, Derry Fitzgerald, Zafar Rafii
article: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, Apr 2015, Brisbane, Australia
Accès au texte intégral et bibtex

titre: Fast DNN training based on auxiliary function technique
auteur: Dung T. Tran, Nobutaka Ono, Emmanuel Vincent
article: ICASSP 2015 – 40th IEEE International Conference on Acoustics, Speech and Signal Processing, Apr 2015, Brisbane, Queensland, Australia
Accès au texte intégral et bibtex

titre: OOV Proper Name Retrieval using Topic and Lexical Context Model
auteur: Imran Sheikh, Irina Illina, Dominique Fohr, Georges Linarès
article: IEEE International Conference on Acoustics, Speech and Signal Processing, 2015, Brisbane, Australia
Accès au bibtex

titre: Recognition of OOV Proper Names in Diachronic Audio News
auteur: Imran Sheikh, Irina Illina, Dominique Fohr
article: IEEE International Conference on Information Systems and Economic Intelligence, 2015, Hammamet, Tunisia
Accès au bibtex

titre: Neural Networks for Proper Name Retrieval in the Framework of Automatic Speech Recognition
auteur: Dominique Fohr, Irina Illina
article: IEEE International Conference on Information Systems and Economic Intelligence, 2015, hammamet, Tunisia
Accès au bibtex

Master thesis

titre: Apprentissage de structures multi-dimensionnelles pour l’improvisation musicale
auteur: Ken Deguernel
article: Informatique et langage [cs.CL]. 2015
Accès au texte intégral et bibtex

Other publications

titre: La dyslexie du point de vue des chercheurs et des praticiens
auteur: Agnès Piquard-Kipffer
article: 2015
Accès au bibtex

titre: Is audio signal processing still useful in the era of machine learning?
auteur: Emmanuel Vincent
article: 2015
Accès au bibtex

titre: Advanced spatial speech and audio processing
auteur: Emmanuel Vincent, Emanuël A. P. Habets
article: 2015
Accès au bibtex

titre: Les troubles Dys : la dyslexie-dysorthographie
auteur: Agnès Piquard-Kipffer
article: 2015
Accès au bibtex

Poster communications

titre: Improvements for a German Vowel Trainer CAPT Tool
auteur: Patrick Carroll, Jürgen Trouvain, Frank Zimmerer, Yves Laprie, Odile Mella, Dominique Fohr
article: Individualized Feedback for Computer-Assisted Spoken Language Learning, Nov 2015, Tholey, Germany. 2015
Accès au bibtex

titre: Dynamic realistic lip animation using a limited number of control points
auteur: Slim Ouni, Guillaume Gris
article: ACM. SIGGRAPH 2015, Aug 2015, Los Angeles, California, United States. ACM, Proceeding SIGGRAPH ’15 ACM SIGGRAPH 2015 Posters, pp.1, 2015, Proceeding SIGGRAPH ’15 ACM SIGGRAPH 2015 Posters. ⟨10.1145/2787626.2787628⟩
Accès au bibtex

titre: Sound synchronization and motion compensated reconstruction for speech Cine MRI
auteur: Pierre-André Vuissoz, Freddy Odille, Yves Laprie, Emmanuel Vincent, Jacques Felblinger
article: ISMRM 2015 Annual Meeting, May 2015, Toronto, Canada
Accès au texte intégral et bibtex

titre: Synchronisation vocale et mouvement compensé en reconstruction pour une ciné IRM de la parole
auteur: Pierre-André Vuissoz, Freddy Odille, Emmanuel Vincent, Jacques Felblinger, Yves Laprie
article: 2e Congrès de la SFRMBM (Société Française de Résonance Magnétique en Biologie et Médecine), Mar 2015, Grenoble, France
Accès au texte intégral et bibtex

Proceedings

titre: LNCS 9237 – Proceedings of the 12th International Conference on Latent Variable Analysis and Signal Separation
auteur: Emmanuel Vincent, Arie Yeredor, Zbynek Koldovsky, Petr Tichavsky
article: 12th International Conference, LVA/ICA 2015, Aug 2015, Liberec, Czech Republic. Springer, 2015, 978-3-319-22481-7
Accès au bibtex

Reports

titre: JCorpusRecorder
auteur: Vincent Colotte, Emilien Casano
article: [Technical Report] Université de Lorraine. 2015
Accès au bibtex

titre: Combining blockwise and multi-coefficient stepwise approches in a general framework for online audio source separation
auteur: Laurent S. R. Simon, Emmanuel Vincent
article: [Research Report] RR-8766, Inria. 2015, pp.18
Accès au texte intégral et bibtex

titre: Listening to features
auteur: Manuel Moussallam, Antoine Liutkus, Laurent Daudet
article: [Research Report] Institut Langevin, ESPCI – CNRS – Paris Diderot University – UPMC. 2015, pp.24
Accès au texte intégral et bibtex

titre: Scale-Space Peak Picking
auteur: Antoine Liutkus
article: [Research Report] Inria Nancy – Grand Est (Villers-lès-Nancy, France). 2015
Accès au texte intégral et bibtex

Theses

titre: Speech recognition as a communication aid for deaf and hearing impaired people
auteur: Luiza Orosanu
article: Traitement du signal et de l’image [eess.SP]. Université de Lorraine, 2015. Français. ⟨NNT : 2015LORR0172⟩
Accès au texte intégral et bibtex

titre: Audio motif spotting for guided source separation. Application to movie soundtracks.
auteur: Nathan Souviraà-Labastie
article: Son [cs.SD]. Université de Rennes 1, 2015. Français. ⟨NNT : ⟩
Accès au texte intégral et bibtex

titre: Uncertainty learning for noise robust ASR
auteur: Dung Tien Tran
article: Sound [cs.SD]. Université de Lorraine, 2015. English. ⟨NNT : 2015LORR0236⟩
Accès au texte intégral et bibtex

Preprints, Working Papers, …

titre: A unifying description of dark energy
auteur: Jérôme Gleyzes, David Langlois, Filippo Vernizzi
article: 2015
Accès au bibtex

2014

Journal articles

titre: Quand les sons se séparent
auteur: Emmanuel Vincent, Joanna Jongwane
article: Interstices, 2014
Accès au bibtex

titre: Constitution d’un Corpus de Français Langue Etrangère destiné aux Apprenants Allemands
auteur: Camille Fauth, Anne Bonneau, Odile Mella, Vincent Colotte, Dominique Fohr, Denis Jouvet, Yves Laprie, Jürgen Trouvain
article: SHS Web of Conferences, 2014, 4e Congrès Mondial de Linguistique Française, 8, pp.14. ⟨10.1051/shsconf/20140801186⟩
Accès au texte intégral et bibtex

titre: Comment faire parler les images aux rayons X du conduit vocal ?
auteur: Yves Laprie, Rudolph Sock, Béatrice Vaxelaire, Benjamin Elie
article: SHS Web of Conferences, 2014, 4e Congrès Mondial de Linguistique Française, 8, pp.14. ⟨10.1051/shsconf/20140801344⟩
Accès au texte intégral et bibtex

titre: Imaging With Nature: Compressive Imaging Using a Multiply Scattering Medium
auteur: Antoine Liutkus, David Martina, Sébastien Popoff, Gilles Chardon, Ori Katz, Geoffroy Lerosey, Sylvain Gigan, Laurent Daudet, Igor Carron
article: Scientific Reports, 2014, 4, pp.14. ⟨10.1038/srep05552⟩
Accès au texte intégral et bibtex

titre: Kernel Additive Models for Source Separation
auteur: Antoine Liutkus, Derry Fitzgerald, Zafar Rafii, Bryan Pardo, Laurent Daudet
article: IEEE Transactions on Signal Processing, 2014, pp.14. ⟨10.1109/TSP.2014.2332434⟩
Accès au texte intégral et bibtex

titre: From blind to guided audio source separation: How models and side information can improve the separation of sound
auteur: Emmanuel Vincent, Nancy Bertin, Rémi Gribonval, Frédéric Bimbot
article: IEEE Signal Processing Magazine, 2014, 31 (3), pp.107-115. ⟨10.1109/MSP.2013.2297440⟩
Accès au texte intégral et bibtex

titre: Genre-based music language modelling with latent hierarchical Pitman-Yor process allocation
auteur: Stanislaw Raczynski, Emmanuel Vincent
article: IEEE/ACM Transactions on Audio, Speech and Language Processing, 2014, 22 (3), pp.672-681. ⟨10.1109/TASLP.2014.2300344⟩
Accès au texte intégral et bibtex

titre: Convex regularizations for the simultaneous recording of room impulse responses
auteur: Alexis Benichoux, Laurent S. R. Simon, Emmanuel Vincent, Rémi Gribonval
article: IEEE Transactions on Signal Processing, 2014, ⟨10.1109/TSP.2014.2303431⟩
Accès au texte intégral et bibtex

titre: Ajout de nouveaux noms propres au vocabulaire d’un système de transcription en utilisant un corpus diachronique
auteur: Irina Illina, Dominique Fohr, Georges Linarès
article: Revue TAL : traitement automatique des langues, 2014, 55 (2), pp.47-72
Accès au bibtex

titre: Modal Overlap Factor of a beam with an acoustic black hole termination
auteur: Vivien Denis, Adrien Pelat, François Gautier, Benjamin Elie
article: Journal of Sound and Vibration, 2014, 333 (12), pp.2475-2488. ⟨10.1016/j.jsv.2014.02.005⟩
Accès au texte intégral et bibtex

titre: Tongue control and its implication in pronunciation training
auteur: Slim Ouni
article: Computer Assisted Language Learning, 2014, 27 (5), pp.439-453. ⟨10.1080/09588221.2012.761637⟩
Accès au texte intégral et bibtex

Conference papers

titre: 3D Visual Speech Animation from Image Sequences
auteur: Utpala Musti, Slim Ouni, Zhou Ziheng
article: Indian Conference on Computer Vision, Graphics and Image Processing (ICVGIP), Dec 2014, Bangalore, India
Accès au bibtex

titre: Improving the recognition of pathological voice using the discriminant HLDA transformation
auteur: Othman Lachhab, Joseph Di Martino, El Hassane Ibn Elhaj, Ahmed Hammouch
article: 3rd International IEEE Colloquium on Information Science and Technology, Oct 2014, Tetuan-Chefchaouen, Morocco
Accès au texte intégral et bibtex

titre: Structured GMM Based on Unsupervised Clustering for Recognizing Adult and Child Speech
auteur: Arseniy Gorin, Denis Jouvet
article: SLSP 2014, 2nd International Conference on Statistical Language and Speech Processing, Oct 2014, Grenoble, France. pp.108 – 119, ⟨10.1007/978-3-319-11397-5_8⟩
Accès au texte intégral et bibtex

titre: An investigation of likelihood normalization for robust ASR
auteur: Emmanuel Vincent, Aggelos Gkiokas, Dominik Schnitzer, Arthur Flexer
article: Interspeech, Sep 2014, Singapore, Singapore
Accès au texte intégral et bibtex

titre: Component Structuring and Trajectory Modeling for Speech Recognition
auteur: Arseniy Gorin, Denis Jouvet
article: Interspeech, Sep 2014, Singapoore, Singapore
Accès au texte intégral et bibtex

titre: Pronunciation variation in read and conversational austrian german
auteur: Barbara Schuppler, Martine Adda-Decker, Juan Andrés Morales Cordovilla
article: 15th Annual Conference of the Inter- national Speech Communication Association (INTERSPEECH 2014) , Sep 2014, Singapour, Singapore. pp.1453-1457
Accès au bibtex

titre: About Combining Forward and Backward-Based Decoders for Selecting Data for Unsupervised Training of Acoustic Models
auteur: Denis Jouvet, Dominique Fohr
article: INTERSPEECH 2014, 15th Annual Conference of the International Speech Communication Association, Sep 2014, Singapour, Singapore
Accès au texte intégral et bibtex

titre: Hybrid language models for speech transcription
auteur: Luiza Orosanu, Denis Jouvet
article: INTERSPEECH 2014, 15th Annual Conference of the International Speech Communication Association, Sep 2014, Singapour, Singapore
Accès au texte intégral et bibtex

titre: Proper Name Retrieval from Diachronic Documents for Automatic Speech Transcription using Lexical and Temporal Context
auteur: Irina Illina, Dominique Fohr, Georges Linarès
article: Workshop on Speech, Language and Audio in Multimedia, Sep 2014, Penang, Malaysia
Accès au texte intégral et bibtex

titre: Compressed sensing under strong noise. Application to imaging through multiply scattering media
auteur: Antoine Liutkus, David Martina, Sylvain Gigan, Laurent Daudet
article: European Signal Processing Conference (EUSIPCO), Sep 2014, Lisbon, Portugal
Accès au texte intégral et bibtex

titre: Audio source separation using multiple deformed references
auteur: Nathan Souviraà-Labastie, Anaik Olivero, Emmanuel Vincent, Frédéric Bimbot
article: Eusipco, Sep 2014, Lisboa, Portugal
Accès au texte intégral et bibtex

titre: Audiovisual to area and length functions inversion of human tract
auteur: Benjamin Elie, Yves Laprie
article: Eusipco 2014, Sep 2014, Lisbonne, Portugal
Accès au texte intégral et bibtex

titre: Perceptual coding-based informed source separation
auteur: Serap Kirbiz, Alexey Ozerov, Antoine Liutkus, Laurent Girin
article: EUSIPCO 2014 – 22th European Signal Processing Conference, Sep 2014, Lisbonne, Portugal
Accès au texte intégral et bibtex

titre: OOPS: une approche orientée objet pour l’interrogation et l’analyse linguistique de l’interface prosodie/syntaxe/discours
auteur: Julie Beliao, Antoine Liutkus
article: 4e Congrès Mondial de Linguistique Française, Jul 2014, Berlin, Allemagne. pp.2565-2581, ⟨10.1051/shsconf/20140801273⟩
Accès au texte intégral et bibtex

titre: Variational Bayesian model averaging for audio source separation
auteur: Xabier Jaureguiberry, Emmanuel Vincent, Gael Richard
article: SSP (IEEE Workshop on Statistical Signal Processing), Jun 2014, Gold Coast, Australia. pp.4
Accès au texte intégral et bibtex

titre: Multiple-order non-negative matrix factorization for speech enhancement
auteur: Xabier Jaureguiberry, Emmanuel Vincent, Gael Richard
article: Interspeech, Jun 2014, Singapour, Singapore. pp.4
Accès au texte intégral et bibtex

titre: Extension du vocabulaire d’un système de transcription avec de nouveaux noms propres en utilisant un corpus diachronique
auteur: Irina Illina, Dominique Fohr, Georges Linarès
article: Journées d’Etude sur la parole, Jun 2014, Le Mans, France
Accès au texte intégral et bibtex

titre: Explicit trajectories and speaker class modeling for child and adult speech recognition
auteur: Arseniy Gorin, Denis Jouvet
article: XXXème édition des Journées d’Etudes sur la Parole, Jun 2014, Le Mans, France
Accès au texte intégral et bibtex

titre: Combining words and syllables for speech transcription
auteur: Luiza Orosanu, Denis Jouvet
article: XXXème édition des Journées d’Etudes sur la Parole, Jun 2014, Le Mans, France
Accès au texte intégral et bibtex

titre: Harmonic/Percussive Separation Using Kernel Additive Modelling
auteur: Derry Fitzgerald, Antoine Liutkus, Zafar Rafii, Bryan Pardo, Laurent Daudet
article: IET Irish Signals & Systems Conference 2014, Jun 2014, Limerick, Ireland
Accès au texte intégral et bibtex

titre: Designing a Bilingual Speech Corpus for French and German Language Learners: a Two-Step Process
auteur: Camille Fauth, Anne Bonneau, Frank Zimmerer, Jürgen Trouvain, Bistra Andreeva, Vincent Colotte, Dominique Fohr, Denis Jouvet, Jeanin Jügler, Yves Laprie, Odile Mella, Bernd Möbius
article: LREC – 9th Language Resources and Evaluation Conference, The European Language Resources Association, May 2014, Reykjavik, Iceland
Accès au texte intégral et bibtex

titre: Links between Manual Punctuation Marks and Automatically Detected Prosodic Structures
auteur: Katarina Bartkova, Denis Jouvet
article: Speech Prosody 2014, May 2014, Dublin, Ireland
Accès au bibtex

titre: Investigating Stranded GMM for Improving Automatic Speech Recognition
auteur: Arseniy Gorin, Denis Jouvet, Emmanuel Vincent, Dung Tran
article: 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays (HSCMA 2014), May 2014, Nancy, France
Accès au texte intégral et bibtex

titre: Kernel Spectrogram models for source separation
auteur: Antoine Liutkus, Zafar Rafii, Bryan Pardo, Derry Fitzgerald, Laurent Daudet
article: HSCMA, May 2014, Nancy, France
Accès au texte intégral et bibtex

titre: Geometric articulatory model adapted to the production of consonants
auteur: Yves Laprie, Béatrice Vaxelaire, Martine Cadot
article: 10th International Seminar on Speech Production (ISSP), May 2014, Köln, Germany
Accès au texte intégral et bibtex

titre: PLOSIVE AND FRICATIVE GEMINATES IN TARIFIT AN ARTICULATORY AND ACOUSTIC STUDY
auteur: Fayssal Bouarourou, Béatrice Vaxelaire, Yves Laprie, Rachid Ridouane, Marion Bechet, Rudolph Sock
article: International, ISSP 2014, May 2014, Köln, Germany
Accès au texte intégral et bibtex

titre: Investigating the effects of posture and noise on speech production
auteur: Ingmar Steiner, Peter Knopp, Sebastian Musche, Astrid Schmiedel, Angelika Braun, Slim Ouni
article: 10th International Seminar on Speech Production (ISSP), Susanne Fuchs, Martine Grice, Anne Hermes, Leonardo Lancia, Doris Mücke, May 2014, Cologne, Germany
Accès au bibtex

titre: Studying MRI acquisition protocols of sustained sounds with a multimodal acquisition system
auteur: Yves Laprie, Michael Aron, Marie-Odile Berger, Brigitte Wrobel-Dautcourt
article: 10th International Seminar on Speech Production (ISSP), May 2014, Köln, Germany
Accès au texte intégral et bibtex

titre: Extension of uncertainty propagation to dynamic MFCCs for noise robust ASR
auteur: Dung Tien Tran, Emmanuel Vincent, Denis Jouvet
article: 2014 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), May 2014, Florence, Italy
Accès au texte intégral et bibtex

titre: Fusion of Multiple Uncertainty Estimators and Propagators for Noise Robust ASR
auteur: Dung Tien Tran, Denis Jouvet, Emmanuel Vincent
article: 2014 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), May 2014, Florence, Italy
Accès au texte intégral et bibtex

titre: Blind RT60 estimation robust across room sizes and source distances
auteur: Baldwin Dumortier, Emmanuel Vincent
article: 2014 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), May 2014, Firenze, Italy
Accès au texte intégral et bibtex

titre: Estimation de la longueur du conduit vocal pour l’inversion acoustique-articulatoire
auteur: Benjamin Elie, Y Laprie
article: Congrès Français d’Acoustique, SFA, Apr 2014, Poitiers, France
Accès au texte intégral et bibtex

titre: L1-L2 interference: the case of devoicing of French voiced obstruents in final position by German learners – Pilot study
auteur: Camille Fauth, Anne Bonneau
article: International Workshop on Multilinguality in Speech Research: Data, Methods and Models., Bernd Möbius et Jürgen Trouvain, Université de la Sarre, Allemagne, Apr 2014, Dagstuhl, Germany
Accès au bibtex

titre: Spatial properties of the DEMAND noise recordings
auteur: Joachim Thiemann, Emmanuel Vincent, Steven van de Par
article: 40th Annual German Congress on Acoustics (DAGA 2014), Mar 2014, Oldenburg, Germany
Accès au texte intégral et bibtex

titre: Méthodologie 3-way d’extraction d’un modèle articulatoire de la parole à partir des données d’un locuteur
auteur: Martine Cadot, Yves Laprie
article: Atelier Fouille de Données Complexes des 14èmes Journées Francophones “Extraction et Gestion des Connaissances”, Jan 2014, Rennes, France. pp.1-12
Accès au texte intégral et bibtex

titre: Semiotic Description of Music Structure: an Introduction to the Quaero/Metiss Structural Annotations
auteur: Frédéric Bimbot, Gabriel Sargent, Emmanuel Deruty, Corentin Guichaoua, Emmanuel Vincent
article: AES 53rd International Conference on Semantic Audio, Jan 2014, London, United Kingdom. pp.P1-1
Accès au texte intégral et bibtex

Book sections

titre: REPET for Background/Foreground Separation in Audio
auteur: Zafar Rafii, Antoine Liutkus, Bryan Pardo
article: G.R. Naik and W. Wang. Blind Source Separation, Springer Berlin Heidelberg, pp.395-411, 2014, 978-3-642-55015-7. ⟨10.1007/978-3-642-55016-4_14⟩
Accès au texte intégral et bibtex

Other publications

titre: Critères d’évaluation d’un album numérique pour des enfants en difficulté de langage
auteur: Agnès Piquard-Kipffer
article: 2014, pp.287-309
Accès au bibtex

titre: Les sons à domicile
auteur: Emmanuel Vincent
article: 2014
Accès au bibtex

titre: Evaluation campaigns and reproducibility
auteur: Emmanuel Vincent
article: 2014
Accès au bibtex

titre: Poursuivre une scolarité avec une langue déficiente
auteur: Agnès Piquard-Kipffer
article: 2014
Accès au bibtex

Poster communications

titre: Speech Cine SSFP with optical microphone synchronization and motion compensated reconstruction
auteur: Pierre-André Vuissoz, Freddy Odille, Yves Laprie, Emmanuel Vincent, Gabriela Hossu, Jacques Felblinger
article: ISMRM Workshop on Motion Correction in MRI, Jul 2014, Tromso, Norway. 2014
Accès au texte intégral et bibtex

titre: The Flexible Audio Source Separation Toolbox Version 2.0
auteur: Yann Salaün, Emmanuel Vincent, Nancy Bertin, Nathan Souviraà-Labastie, Xabier Jaureguiberry, Dung T. Tran, Frédéric Bimbot
article: ICASSP, May 2014, Florence, Italy. 2014
Accès au texte intégral et bibtex

Documents associated with scientific events

titre: Phonetic variation in non-native speech
auteur: Anne Bonneau
article: Spring School : “Individual-centered Approaches to Speech Processing”, Apr 2014, Dagstuhl, Germany
Accès au bibtex

Reports

titre: Proof of Wiener-like linear regression of isotropic complex symmetric alpha-stable random variables
auteur: Roland Badeau, Antoine Liutkus
article: 2014
Accès au texte intégral et bibtex

titre: A categorization of robust speech processing datasets
auteur: Jonathan Le Roux, Emmanuel Vincent
article: [Technical Report] Mitsubishi Electric Research Labs TR2014-116, 2014
Accès au texte intégral et bibtex

Theses

titre: Acoustic Model Structuring for Improving Automatic Speech Recognition Performance
auteur: Arseniy Gorin
article: Sound [cs.SD]. Université de Lorraine, 2014. English. ⟨NNT : 2014LORR0161⟩
Accès au texte intégral et bibtex

2013

Journal articles

titre: Melody harmonisation with interpolated probabilistic models
auteur: Stanislaw Raczynski, Satoru Fukayama, Emmanuel Vincent
article: Journal of New Music Research, 2013, 42 (3), pp.223-235. ⟨10.1080/09298215.2013.822000⟩
Accès au bibtex

titre: Acoustic-visual synthesis technique using bimodal unit-selection
auteur: Slim Ouni, Vincent Colotte, Utpala Musti, Asterios Toutios, Brigitte Wrobel-Dautcourt, Marie-Odile Berger, Caroline Lavecchia
article: EURASIP Journal on Audio, Speech, and Music Processing, 2013, 2013:16, ⟨10.1186/1687-4722-2013-16⟩
Accès au texte intégral et bibtex

titre: An episodic memory-based solution for the acoustic-to-articulatory inversion problem
auteur: Sébastien Demange, Slim Ouni
article: Journal of the Acoustical Society of America, 2013, 133 (5), pp.2921-2930. ⟨10.1121/1.4798665⟩
Accès au texte intégral et bibtex

titre: Dynamic Bayesian networks for symbolic polyphonic pitch modeling
auteur: Stanislaw Raczynski, Emmanuel Vincent, Shigeki Sagayama
article: IEEE Transactions on Audio, Speech and Language Processing, 2013, 21 (9), pp.1830-1840. ⟨10.1109/TASL.2013.2258012⟩
Accès au texte intégral et bibtex

titre: The PASCAL CHiME Speech Separation and Recognition Challenge
auteur: Jon Barker, Emmanuel Vincent, Ning Ma, Heidi Christensen, Phil Green
article: Computer Speech and Language, 2013, 27 (3), pp.621-633. ⟨10.1016/j.csl.2012.10.004⟩
Accès au texte intégral et bibtex

titre: Uncertainty-based learning of acoustic models from noisy data
auteur: Alexey Ozerov, Mathieu Lagrange, Emmanuel Vincent
article: Computer Speech and Language, 2013, 27 (3), pp.874-894. ⟨10.1016/j.csl.2012.07.002⟩
Accès au texte intégral et bibtex

titre: Special Issue on Speech Separation and Recognition in Multisource Environments
auteur: Jon Barker, Emmanuel Vincent
article: Computer Speech and Language, 2013, 27 (3), pp.619-620. ⟨10.1016/j.csl.2012.09.005⟩
Accès au bibtex

titre: Consistent Wiener filtering for audio source separation
auteur: Jonathan Le Roux, Emmanuel Vincent
article: IEEE Signal Processing Letters, 2013, 20 (3), pp.217-220. ⟨10.1109/LSP.2012.2225617⟩
Accès au texte intégral et bibtex

titre: Gestion d’erreurs pour la fiabilisation des retours automatiques en apprentissage de la prosodie d’une langue seconde
auteur: Anne Bonneau, Dominique Fohr, Irina Illina, Denis Jouvet, Odile Mella, Larbi Mesbahi, Luiza Orosanu
article: Revue TAL : traitement automatique des langues, 2013, 53 (3)
Accès au bibtex

titre: Extracting Comparable Articles from Wikipedia and Measuring their Comparabilities
auteur: Motaz Saad, David Langlois, Kamel Smaïli
article: Procedia – Social and Behavioral Sciences, 2013, 95, pp.40-47. ⟨10.1016/j.sbspro.2013.10.620⟩
Accès au texte intégral et bibtex

titre: Early predictors of future reading skills: A follow-up of French-speaking children from the beginning of kindergarten to the end of the second grade (age 5 to 8)
auteur: Agnès Piquard-Kipffer, Liliane Sprenger-Charolles
article: L’Année psychologique, 2013, 113 (4), pp.491-521. ⟨10.4074/S0003503313014012⟩
Accès au bibtex

titre: Spatial location priors for Gaussian model based reverberant audio source separation
auteur: Ngoc Duong, Emmanuel Vincent, Rémi Gribonval
article: EURASIP Journal on Advances in Signal Processing, 2013, 2013 (1), pp.149. ⟨10.1186/1687-6180-2013-149⟩
Accès au texte intégral et bibtex

titre: An overview of the CATE algorithms for real-time pitch determination
auteur: Fadoua Bahja, Joseph Di Martino, El Hassan Ibn Elhaj, Driss Aboutajdine
article: Signal, Image and Video Processing, 2013, ⟨10.1007/s11760-013-0488-4⟩
Accès au bibtex

Conference papers

titre: Rauque ‘n’ Roll : La raucité, entre symptôme pathologique & expression artistique
auteur: Melissa Barkat-Defradas, Camille Fauth, Fabrice Hirsch,, Benoît Amy de La Bretèque, Jérémi Sauvage, Christelle Dodane,
article: 5° Journées de Phonétique Clinique, Dec 2013, Liège, Belgique
Accès au bibtex

titre: The Second ‘CHiME’ Speech Separation and Recognition Challenge: An overview of challenge systems and outcomes
auteur: Emmanuel Vincent, Jon Barker, Shinji Watanabe, Jonathan Le Roux, Francesco Nesta, Marco Matassoni
article: 2013 IEEE Automatic Speech Recognition and Understanding Workshop, Dec 2013, Olomouc, Czech Republic
Accès au texte intégral et bibtex

titre: Efficient constrained parametrization of GMM with class-based mixture weights for Automatic Speech Recognition
auteur: Arseniy Gorin, Denis Jouvet
article: LTC’13 – 6th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, Dec 2013, Poznań, Poland
Accès au bibtex

titre: Exploring temporal context in diachronic text documents for automatic OOV proper name retrieval
auteur: Imane Nkairi, Irina Illina, Georges Linarès, Dominique Fohr
article: Language & Technology Conference, Dec 2013, Poznań, Poland. pp.540-544
Accès au bibtex

titre: What we can learn from asr errors about low-resourced languages: a case-study of luxembourgish and austrian
auteur: Martine Adda-Decker, Barbara Schuppler, Lori Lamel, Juan Andrés Morales Cordovilla, Gilles Adda
article: Errors by Humans and Machines in Multimedia, Multimodal, Multilingual Data Processing (ERRARE 2013), Nov 2013, Ermenonville, France
Accès au bibtex

titre: Etude de l’acceptabilité d’un robot moqueur
auteur: Carole Adam, Wafa Johal Benkaouar, Ilef Benfarhat, Céline Jost, Humbert Fiorino, Sylvie Pesty, Dominique Duhaut
article: III 2013 – 2ième conférence sur l’Intercompréhension de l’Intraspécifique à l’Interspécifique, Workshop, Sep 2013, Lorient, France
Accès au bibtex

titre: Non-negative Tensor Factorization for Single-Channel EEG Artifact Rejection
auteur: Cécilia Damon, Antoine Liutkus, Alexandre Gramfort, Slim Essid
article: MLSP, Sep 2013, Southampton, United Kingdom. ⟨10.1109/MLSP.2013.6661983⟩
Accès au texte intégral et bibtex

titre: General algorithms for estimating spectrogram and transfer functions of target signal for blind suppression of diffuse noise
auteur: Nobutaka Ito, Emmanuel Vincent, Nobutaka Ono, Shigeki Sagayama
article: 2013 IEEE International Workshop on Machine Learning for Signal Processing, Sep 2013, Southampton, United Kingdom
Accès au texte intégral et bibtex

titre: An experimental comparison of source separation and beamforming techniques for microphone array signal enhancement
auteur: Joachim Thiemann, Emmanuel Vincent
article: MLSP – 23rd IEEE International Workshop on Machine Learning for Signal Processing – 2013, Sep 2013, Southampton, United Kingdom
Accès au texte intégral et bibtex

titre: Introducing a simple fusion framework for audio source separation
auteur: Xabier Jaureguiberry, Gael Richard, Pierre Leveau, Romain Hennequin, Emmanuel Vincent
article: 2013 IEEE International Workshop on Machine Learning for Signal Processing, Sep 2013, Southampton, United Kingdom. pp.6
Accès au texte intégral et bibtex

titre: Statistics Based Features for Unvoiced Sound Classification
auteur: Sunit Sivasankaran, Kmm Prabhu
article: MLSP 2013 – IEEE International Workshop on Machine Learning for Signal Processing, Sep 2013, Southampton, United Kingdom. ⟨10.1109/MLSP.2013.6661986⟩
Accès au bibtex

titre: A fast EM algorithm for Gaussian model-based source separation
auteur: Joachim Thiemann, Emmanuel Vincent
article: EUSIPCO – 21st European Signal Processing Conference – 2013, Sep 2013, Marrakech, Morocco
Accès au texte intégral et bibtex

titre: Automatic Feature Selection for Acoustic-Visual Concatenative Speech Synthesis: Towards a Perceptual Objective Measure
auteur: Utpala Musti, Vincent Colotte, Slim Ouni, Caroline Lavecchia, Brigitte Wrobel-Dautcourt, Marie-Odile Berger
article: AVSP – Audio Visual Speech Processing, Sep 2013, Annecy, France
Accès au bibtex

titre: Analysis and Combination of Forward and Backward based Decoders for Improved Speech Transcription
auteur: Denis Jouvet, Dominique Fohr
article: TSD – 16th International Conference on Text, Speech and Dialogue – 2013, Sep 2013, Pilsen, Czech Republic. pp.84-91
Accès au bibtex

titre: Automatic Detection of the Prosodic Structures of Speech Utterances
auteur: Katarina Bartkova, Denis Jouvet
article: SPECOM – 15th International Conference on Speech and Computer – 2013, Sep 2013, Pilsen, Czech Republic. pp.1-8
Accès au bibtex

titre: A Machine Learning Based Approach for Vocabulary Selection for Speech Transcription
auteur: Denis Jouvet, David Langlois
article: TSD – 16th International Conference on Text, Speech and Dialogue – 2013, Sep 2013, Pilsen, Czech Republic. pp.60-67
Accès au bibtex

titre: Comparison and Analysis of Several Phonetic Decoding Approaches
auteur: Luiza Orosanu, Denis Jouvet
article: TSD – 16th International Conference on Text, Speech and Dialogue – 2013, Sep 2013, Pilsen, Czech Republic. pp.161-168
Accès au texte intégral et bibtex

titre: Speech animation using electromagnetic articulography as motion capture data
auteur: Ingmar Steiner, Korin Richmond, Slim Ouni
article: AVSP – 12th International Conference on Auditory-Visual Speech Processing – 2013, Aug 2013, Annecy, France. pp.55-60
Accès au texte intégral et bibtex

titre: Mixing faces and voices: a study of the influence of faces and voices on audiovisual intelligibility
auteur: Jérémy Miranda, Slim Ouni
article: AVSP – 12th International Conference on Auditory-Visual Speech Processing – 2013, Aug 2013, Annecy, France
Accès au texte intégral et bibtex

titre: Articulatory copy synthesis from cine X-ray films
auteur: Yves Laprie, Matthieu Loosvelt, Shinji Maeda, Rudolph Sock, Fabrice Hirsch
article: InterSpeech – 14th Annual Conference of the International Speech Communication Association – 2013, Aug 2013, Lyon, France
Accès au texte intégral et bibtex

titre: Diacritics Restoration for Arabic Dialects
auteur: Salima Harrat, Mourad Abbas, Karima Meftouh, Kamel Smaïli
article: INTERSPEECH 2013 – 14th Annual Conference of the International Speech Communication Association, ISCA, Aug 2013, Lyon, France
Accès au texte intégral et bibtex

titre: Vowel and prosodic factor dependent variations of vocal-tract length
auteur: Shinji Maeda, Yves Laprie
article: InterSpeech – 14th Annual Conference of the International Speech Communication Association – 2013, Aug 2013, Lyon, France
Accès au texte intégral et bibtex

titre: Comparison of approaches for an efficient phonetic decoding
auteur: Luiza Orosanu, Denis Jouvet
article: InterSpeech – 14th Annual Conference of the International Speech Communication Association – 2013, Aug 2013, Lyon, France
Accès au texte intégral et bibtex

titre: Combination of Random Indexing based Language Model and N-gram Language Model for Speech Recognition
auteur: Dominique Fohr, Odile Mella
article: INTERSPEECH – 14th Annual Conference of the International Speech Communication Association – 2013, Aug 2013, Lyon, France
Accès au bibtex

titre: Combining Forward-based and Backward-based Decoders for Improved Speech Recognition Performance
auteur: Denis Jouvet, Dominique Fohr
article: InterSpeech – 14th Annual Conference of the International Speech Communication Association – 2013, Aug 2013, Lyon, France
Accès au bibtex

titre: Comparing Multilingual Comparable Articles Based On Opinions
auteur: Motaz Saad, David Langlois, Kamel Smaïli
article: Proceedings of the 6th Workshop on Building and Using Comparable Corpora, Association for Computational Linguistics ACL, Aug 2013, Sofia, Bulgaria. pp.105-111
Accès au texte intégral et bibtex

titre: LORIA System for the WMT13 Quality Estimation Shared Task
auteur: David Langlois, Kamel Smaïli
article: ACL 2013 – Eighth Workshop on Statistical Machine Translation, Aug 2013, Sofia, Bulgaria. pp.380 – 385
Accès au texte intégral et bibtex

titre: Articulatory Data Acquisition and Processing
auteur: Yves Laprie, Slim Ouni
article: Colloque Corpus et Outils en Linguistique Langue et Parole : Statuts, Usages et Mésusages, Jul 2013, Strasbourg, France
Accès au bibtex

titre: An overview of informed audio source separation
auteur: Antoine Liutkus, Jean-Louis Durrieu, Laurent Daudet, Gael Richard
article: WIAMIS 2013 – The 14th International Workshop on Image and Audio Analysis for Multimedia Interactive Services, Jul 2013, Paris, France. pp.1-4, ⟨10.1109/WIAMIS.2013.6616139⟩
Accès au texte intégral et bibtex

titre: Effects of audio coding on ICA performance: an experimental study
auteur: Matthieu Puigt, Emmanuel Vincent, Yannick Deville, Anthony Griffin, Athanasios Mouchtaris
article: 11th IEEE Int. Workshop on Electronics, Control, Measurement, Signals and their application to Mechatronics, Jun 2013, Toulouse, France
Accès au texte intégral et bibtex

titre: A new Automatic Formant Tracking approach based on scalogram maxima detection using complex wavelets
auteur: Imen Jemaa, Kais Ouni, Yves Laprie, Slim Ouni, Jean-Paul Haton
article: CEIT – International Conference on Control, Engineering & Information Technology – 2013, Jun 2013, Sousse, Tunisia
Accès au texte intégral et bibtex

titre: Acoustic-to-articulatory inversion by analysis-by-synthesis using cepstral coefficients
auteur: Julie Busset, Yves Laprie
article: ICA – 21st International Congress on Acoustics – 2013, Jun 2013, Montréal, Canada
Accès au texte intégral et bibtex

titre: The Diverse Environments Multi-channel Acoustic Noise Database (DEMAND): A database of multichannel environmental noise recordings
auteur: Joachim Thiemann, Nobutaka Ito, Emmanuel Vincent
article: 21st International Congress on Acoustics, Acoustical Society of America, Jun 2013, Montreal, Canada. ⟨10.5281/zenodo.1227120⟩
Accès au texte intégral et bibtex

titre: Using full-rank spatial covariance models for noise-robust ASR
auteur: Dung T. Tran, Emmanuel Vincent, Denis Jouvet, Kamil Adiloglu
article: CHiME – 2nd International Workshop on Machine Listening in Multisource Environments – 2013, Jun 2013, Vancouver, Canada. pp.31-32
Accès au texte intégral et bibtex

titre: Effects of audio latency in a disc jockey interface
auteur: Laurent S. R. Simon, Arthur Vimond, Emmanuel Vincent
article: 21st International Congress on Acoustics, Jun 2013, Montreal, Canada
Accès au texte intégral et bibtex

titre: The second ‘CHiME’ Speech Separation and Recognition Challenge: Datasets, tasks and baselines
auteur: Emmanuel Vincent, Jon Barker, Shinji Watanabe, Jonathan Le Roux, Francesco Nesta, Marco Matassoni
article: ICASSP – 38th International Conference on Acoustics, Speech, and Signal Processing – 2013, May 2013, Vancouver, Canada. pp.126-130
Accès au texte intégral et bibtex

titre: A fundamental pitfall in blind deconvolution with sparse and shift-invariant priors
auteur: Alexis Benichoux, Emmanuel Vincent, Rémi Gribonval
article: ICASSP – 38th International Conference on Acoustics, Speech, and Signal Processing – 2013, May 2013, Vancouver, Canada
Accès au texte intégral et bibtex

titre: Particle swarm optimization for support vector clustering Separating hyper-plane of unlabeled data
auteur: Souad Chaabouni, Salma Jamoussi, Yassine Benayed
article: 5th International Conference on Modeling, Simulation and Applied Optimization (ICMSAO), Apr 2013, Hammamet, Tunisia. ⟨10.1109/ICMSAO.2013.6552696⟩
Accès au texte intégral et bibtex

titre: Fouille d’images animées : cinéradiographies d’un locuteur
auteur: Julie Busset, Martine Cadot
article: FOSTA 2013, atelier de EGC 2013, Jan 2013, Toulouse, France. pp.1-12
Accès au bibtex

titre: Troubles du traitement temporel du langage oral : vers des outils en recherche clinique
auteur: Anne Bonneau
article: 23eme SFNP congres de la société française de neurologie pédiatrique, SFNP société française de neurologie pédiatrique, Jan 2013, Nancy, France
Accès au bibtex

titre: Non-negative matrix factorization for single-channel EEG artifact rejection
auteur: Cécilia Damon, Antoine Liutkus, Alexandre Gramfort, Slim Essid
article: ICASSP, 2013, Vancouver, Canada. ⟨10.1109/ICASSP.2013.6637836⟩
Accès au texte intégral et bibtex

Habilitation à diriger des recherches

titre: Multimodal Speech: from articulatory speech to audiovisual speech
auteur: Slim Ouni
article: Machine Learning [cs.LG]. Université de Lorraine, 2013
Accès au texte intégral et bibtex

Other publications

titre: Incertitudes en traitement de la parole et de l’audio
auteur: Emmanuel Vincent
article: 2013
Accès au bibtex

titre: Leveraging online sound exposure and big data
auteur: Emmanuel Vincent
article: 2013
Accès au bibtex

titre: Overview of the 2nd ‘CHiME’ Speech Separation and Recognition Challenge
auteur: Emmanuel Vincent, Jon Barker, Shinji Watanabe, Jonathan Le Roux, Francesco Nesta, Marco Matassoni
article: 2013
Accès au bibtex

titre: Source separation
auteur: Emmanuel Vincent
article: 2013
Accès au bibtex

titre: Source classification
auteur: Emmanuel Vincent
article: 2013
Accès au bibtex

titre: Introduction to sound scene analysis – Source localization
auteur: Emmanuel Vincent
article: 2013
Accès au bibtex

Patents

titre: Sound source separation method
auteur: Gerald Kergourlay, Johann Citérin, Eric Nguyen, Lionel Le Scolan, Joachim Thiemann, Emmanuel Vincent, Nancy Bertin, Frédéric Bimbot
article: United Kingdom, Patent n° : 1313218.8. 2013
Accès au bibtex

titre: Method and apparatus for sound source separation based on a binary activation model
auteur: Gerald Kergourlay, Joachim Thiemann, Emmanuel Vincent, Nancy Bertin, Frédéric Bimbot
article: United Kingdom, Patent n° : 1304774.1. 2013
Accès au bibtex

Poster communications

titre: MODIS: an audio motif discovery software
auteur: Laurence Catanese, Nathan Souviraà-Labastie, Bingqing Qu, Sébastien Campion, Guillaume Gravier, Emmanuel Vincent, Frédéric Bimbot
article: Show & Tell – Interspeech 2013, Aug 2013, Lyon, France. 2013
Accès au texte intégral et bibtex

titre: Photocurrent Spectroscopy of a Core-Shell GaAs/AlGaAs Nanowire Heterostructure
auteur: Xing Dai, Sen Zhang, Zilong Wang, Giorgio Adamo, Hai Liu, Yizhong Huang, Christophe Couteau, Cesare Soci
article: Asian Conference Spectroscopy 2013, 2013, Singapore, Singapore
Accès au bibtex

Proceedings

titre: The 12th International Conference on Auditory-Visual Speech Processing
auteur: Slim Ouni, Frédéric Berthommier, Alexandra Jesse
article: AVSP 2013 – 12th International Conference on Auditory-Visual Speech Processing, pp.247, 2013, 2308-975X
Accès au bibtex

Reports

titre: Médiation Scientifique : une facette de nos métiers de la recherche
auteur: Antoine Rousseau, Aurélie Darnaud, Brice Goglin, Céline Acharian, Christine Leininger, Christophe Godin, Clarisse Holik, Claude Kirchner, Diane Rives, Elodie Darquie, Erwan Kerrien, Fabrice Neyret, Florent Masseglia, Florian Dufour, Gérard Berry, Gilles Dowek, Hélène Robak, Hélène Xypas, Irina Illina, Isabelle Gnaedig, Joanna Jongwane, Jocelyne Ehrel, Laurent Viennot, Laure Guion, Lisette Calderan, Lola Kovacic, Marie Collin, Marie-Agnès Enard, Marie-Hélène Comte, Martin Quinson, Martine Olivi, Mathieu Giraud, Mathilde Dorémus, Mia Ogouchi, Muriel Droin, Nathalie Lacaux, Nicolas P. Rougier, Nicolas Roussel, Pascal Guitton, Pierre Peterlongo, Rose-Marie Cornus, Simon Vandermeersch, Sophie Maheo, Sylvain Lefebvre, Sylvie Boldo, Thierry Viéville, Véronique Poirel, Aline Chabreuil, Arnaud Fischer, Claude Farge, Claude Vadel, Isabelle Astic, Jean-Pierre Dumont, Loic Féjoz, Patrick Rambert, Pierre Paradinas, Sophie de Quatrebarbes, Stéphane Laurent
article: [Interne] Inria. 2013, pp.34
Accès au texte intégral et bibtex

Theses

titre: Détection du fondamental de la parole en temps réel : application aux voix pathologiques
auteur: Fadoua Bahja
article: Traitement du signal et de l’image [eess.SP]. Université Mohammed V-Agdal UFR Informatique et Télécommunications Laboratoire LRIT Unité associée au CNRST, URAC 29, Faculté des sciences, 2013. Français. ⟨NNT : ⟩
Accès au texte intégral et bibtex

titre: Acoustic-to-articulatory mapping from cepstral coefficients
auteur: Julie Busset
article: Traitement du signal et de l’image [eess.SP]. Université de Lorraine, 2013. Français. ⟨NNT : ⟩
Accès au texte intégral et bibtex

titre: Acoustic-Visual Speech Synthesis by Bimodal Unit Selection
auteur: Utpala Musti
article: Machine Learning [cs.LG]. Université de Lorraine, 2013. English. ⟨NNT : 2013LORR0003⟩
Accès au texte intégral et bibtex

titre: Formant tracking via a multiresolution analysis
auteur: Imen Jemaa
article: Interface homme-machine [cs.HC]. Université de Lorraine; Faculté des Sciences de Tunis, 2013. Français. ⟨NNT : ⟩
Accès au texte intégral et bibtex

2012

Journal articles

titre: Continuations intra- et interphrastiques du français : premiers résultats expérimentaux
auteur: Mathilde Dargnat, Vincent Colotte, Katarina Bartkova, Anne Bonneau
article: SHS Web of Conferences, 2012, 3e Congrès Mondial de Linguistique Française, 1, pp.1471-1485. ⟨10.1051/shsconf/20120100142⟩
Accès au bibtex

titre: Multilingual Recognition of Non-Native Speech using Acoustic Model Transformation and Pronunciation Modeling
auteur: Ghazi Bouselmi, Dominique Fohr, Irina Illina
article: International Journal of Speech Technology, 2012, 15 (2), pp.203 – 213
Accès au bibtex

titre: Je peux voir les mots que tu dis !
auteur: Agnès Piquard-Kipffer, Christian Blonz
article: Interstices, 2012
Accès au bibtex

titre: Prédire dès l’âge de 5 ans le niveau de lecture de fin de cycle 2. Suivi de 85 enfants de langue maternelle française de 4 à 8 ans.
auteur: Agnès Piquard-Kipffer
article: L’information grammaticale, 2012, 133, pp.20-26
Accès au bibtex

titre: The Magnetic Resonance Imaging subset of the mngu0 articulatory corpus
auteur: Ingmar Steiner, Korin Richmond, Ian Marshall, Calum Gray
article: Journal of the Acoustical Society of America, 2012, 131 (2), pp.106-111. ⟨10.1121/1.3675459⟩
Accès au texte intégral et bibtex

Conference papers

titre: Combining criteria for the detection of incorrect entries of non-native speech in the context of foreign language learning
auteur: Luiza Orosanu, Denis Jouvet, Dominique Fohr, Irina Illina, Anne Bonneau
article: SLT 2012 – 4th IEEE Workshop on Spoken Language Technology, Dec 2012, Miami, United States
Accès au texte intégral et bibtex

titre: Class-based speech recognition using a maximum dissimilarity criterion and a tolerance classification margin
auteur: Arseniy Gorin, Denis Jouvet
article: SLT 2012 – 4th IEEE Workshop on Spoken Language Technology, Dec 2012, Miami, United States
Accès au bibtex

titre: Démêler les actions des articulateurs en jeu lors de la production de parole avec le logiciel C.H.I.C. : Analyse de séquences de radiographies de la tête.
auteur: Julie Busset, Martine Cadot
article: 6th International Conference Implicative Statistic Analysis – A.S.I. 6 – 2012, Nov 2012, Caen, France. pp.291-305
Accès au bibtex

titre: Using multimodal speech production data to evaluate articulatory animation for audiovisual speech synthesis
auteur: Ingmar Steiner, Korin Richmond, Slim Ouni
article: 3rd International Symposium on Facial Analysis and Animation – FAA 2012, Sep 2012, Vienna, Austria
Accès au texte intégral et bibtex

titre: ViSAC : Acoustic-Visual Speech Synthesis: The system and its evaluation
auteur: Utpala Musti, Caroline Lavecchia, Vincent Colotte, Slim Ouni, Brigitte Wrobel-Dautcourt, Marie-Odile Berger
article: FAA: The ACM 3rd International Symposium on Facial Analysis and Animation, Sep 2012, Vienne, Austria
Accès au bibtex

titre: A new method for learning Phrase Based Machine Translation with Multivariate Mutual Information
auteur: Cyrine Nasri, Kamel Smaïli, Chiraz Latiri, Yahya Slimani
article: The 8th International Conference on Natural Language Processing and Knowledge Engineering – NLP-KE’12, Sep 2012, HuangShan, China
Accès au texte intégral et bibtex

titre: VisArtico: a visualization tool for articulatory data
auteur: Slim Ouni, Loïc Mangeonjean, Ingmar Steiner
article: 13th Annual Conference of the International Speech Communication Association – InterSpeech 2012, Sep 2012, Portland, OR, United States
Accès au texte intégral et bibtex

titre: Application of an Ultrasound-Based Silent Speech Interface to the French Language for Normal and Post-Laryngectomy Speakers
auteur: Jun Cai, Bruce Denby, Pierre Roussel, Gérard Dreyfus, Lise Crevier-Buchman
article: Interspeech, Sep 2012, Portland, United States
Accès au bibtex

titre: LORIA System for the WMT12 Quality Estimation Shared Task
auteur: David Langlois, Sylvain Raybaud, Kamel Smaïli
article: NAACL 2012 – The Seventh Workshop on Statistical Machine Translation, Jun 2012, Montréal, Canada. pp.114–119
Accès au texte intégral et bibtex

titre: Détection de transcriptions incorrectes de parole non-native dans le cadre de l’apprentissage de langues étrangères
auteur: Luiza Orosanu, Denis Jouvet, Dominique Fohr, Irina Illina, Anne Bonneau
article: JEP-TALN-RECITAL 2012, Jun 2012, Grenoble, France
Accès au bibtex

titre: Génération des prononciations de noms propres à l’aide des champs aéatoires conditionnels
auteur: Irina Illina, Dominique Fohr, Denis Jouvet
article: JEP-TALN-RECITAL 2012, Jun 2012, Grenoble, France
Accès au bibtex

titre: Étude comparée de la précision de mesure des systèmes d’articulographie électromagnétique 3D : Wave et AG500
auteur: Christophe Savariaux, Pierre Badin, Slim Ouni, Brigitte Wrobel-Dautcourt
article: JEP-TALN-RECITAL 2012 – conférence conjointe 29e Journées d’Études sur la Parole, 19e Traitement Automatique des Langues Naturelles, 14e Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues, Jun 2012, Grenoble, France. pp.513-520
Accès au bibtex

titre: Speech clarity and coarticulatory effects in standard and dialectal Arabic
auteur: Mohamed Embarki, Slim Ouni, Fathi Salam
article: Journées d’Études sur la Parole, Association Francophone pour la Communication Parlée, Jun 2012, Grenoble, France
Accès au texte intégral et bibtex

titre: Exploitation d’une marge de tolérance de classification pour améliorer l’apprentissage de modèles acoustiques de classes en reconnaissance de la parole
auteur: Denis Jouvet, Arseniy Gorin, Nicolas Vinuesa
article: JEP-TALN-RECITAL 2012, Jun 2012, Grenoble, France. pp.763-770
Accès au bibtex

titre: VisArtico : visualiser les données articulatoires obtenues par un articulographe
auteur: Slim Ouni, Loïc Mangeonjean
article: Actes de la conférence conjointe JEP-TALN-RECITAL 2012, Jun 2012, Grenoble, France. pp.129-135
Accès au bibtex

titre: Je peux voir les mots que tu dis ! Histoire d’un projet
auteur: Agnès Piquard-Kipffer, Christian Blonz
article: 13ème édition du Festival du film de chercheur CNRS 2012, Jun 2012, Nancy, France
Accès au bibtex

titre: Productions of “continuation contours” by French speakers in L1 (French) and L2 (English)
auteur: Katarina Bartkova, Anne Bonneau, Vincent Colotte, Mathilde Dargnat
article: Speech Prosody, May 2012, Shangai, China. pp.426-429
Accès au bibtex

titre: CoALT: A Software for Comparing Automatic Labelling Tools
auteur: Dominique Fohr, Odile Mella
article: Language Resources and Evaluation LREC 2012, May 2012, Istanbul, Turkey. pp.325-328
Accès au bibtex

titre: On the Use of Wavelets and Cepstrum Excitation for Pitch Determination in Real-Time
auteur: Fadoua Bahja, El Hassan Ibn Elhaj, Joseph Di Martino
article: 3rd International Conference on Multimedia Computing and Systems – ICMCS’12, May 2012, Tangier, Morocco
Accès au texte intégral et bibtex

titre: Real Time Context-Independent Phone Recognition Using a Simplified Statistical Training Algorithm
auteur: Othman Lachhab, Joseph Di Martino, El Hassan Ibn Elhaj, Ahmed Hammouch
article: 3rd International Conference on Multimedia Computing and Systems – ICMCS’12, May 2012, Tangier, Morocco
Accès au texte intégral et bibtex

titre: A Study of a Non-Resourced Language: The Case of one of the Algerian Dialects
auteur: Karima Meftouh, Najette Bouchemal, Kamel Smaïli
article: The third International Workshop on Spoken Languages Technologies for Under-resourced Languages – SLTU’12, May 2012, Cape-town, South Africa. pp.1-7
Accès au texte intégral et bibtex

titre: Evaluating grapheme-to-phoneme converters in automatic speech recognition context
auteur: Denis Jouvet, Dominique Fohr, Irina Illina
article: ICASSP – 2012 – IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Mar 2012, Kyoto, Japan. pp.4821 – 4824, ⟨10.1109/ICASSP.2012.6288998⟩
Accès au texte intégral et bibtex

titre: Classification margin for improved class-based speech recognition performance
auteur: Denis Jouvet, Nicolas Vinuesa
article: ICASSP – 2012 – IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Mar 2012, Kyoto, Japan. pp.4285 – 4288, ⟨10.1109/ICASSP.2012.6288866⟩
Accès au bibtex

titre: Artimate: an articulatory animation framework for audiovisual speech synthesis
auteur: Ingmar Steiner, Slim Ouni
article: Workshop on Innovation and Applications in Speech Technology, UCD, TCD, Mar 2012, Dublin, Ireland
Accès au texte intégral et bibtex

titre: Prediction of Cepstral Excitation Pulses for Voice Conversion
auteur: Fadoua Bahja, Joseph Di Martino, El Hassan Ibn Elhaj, Driss Aboutajdine
article: 5th. International Conference on Information Systems and Economic Intelligence – SIIE ̓ 2012, Feb 2012, Djerba, Tunisia
Accès au texte intégral et bibtex

titre: Real-Time Signal Reconstruction from Short-Time Fourier Transform Magnitude Spectra Using FPGAs
auteur: Mouhcine Chami, Joseph Di Martino, Laurent Pierron, El Hassan Ibn Elhaj
article: 5th. International Conference on Information Systems and Economic Intelligence – SIIE ̓ 2012, Feb 2012, Djerba, Tunisia
Accès au texte intégral et bibtex

titre: Continuations intra et interphrastiques du français : premiers résultats expérimentaux
auteur: Mathilde Dargnat, Vincent Colotte, Katarina Bartkova, Anne Bonneau
article: CMLF 2012, 2012, Lyon, France. pp.1471-1485, ⟨10.1051/shsconf/20120100142⟩
Accès au texte intégral et bibtex

Other publications

titre: Développement et apprentissages : enjeux, démarches, perspectives
auteur: Agnès Piquard-Kipffer
article: 2012
Accès au bibtex

titre: Je peux voir les mots que tu dis !
auteur: Agnès Piquard-Kipffer, Blonz Christian
article: 2012, pp.4
Accès au bibtex

Books

titre: Special Issue on Latent Variable Analysis and Signal Separation
auteur: Vincent Vigneron, Vicente Zarzoso, Rémi Gribonval, Emmanuel Vincent
article: 92, 2012, Signal Processing
Accès au bibtex

Reports

titre: System & Contrast : a Polymorphous Model of the Inner Organization of Structural Segments within Music Pieces (Original Extensive Version)
auteur: Frédéric Bimbot, Emmanuel Deruty, Gabriel Sargent, Emmanuel Vincent
article: [Research Report] IRISA PI-1999, 2012, pp.40
Accès au texte intégral et bibtex

titre: Spatial location priors for Gaussian model based reverberant audio source separation
auteur: Ngoc Q. K. Duong, Emmanuel Vincent, Rémi Gribonval
article: [Research Report] RR-8057, INRIA. 2012
Accès au texte intégral et bibtex

Theses

titre: Language models exploiting the structural similarity between sequences for automatic speech recognition
auteur: Christian Gillot
article: Intelligence artificielle [cs.AI]. Université de Lorraine, 2012. Français. ⟨NNT : ⟩
Accès au texte intégral et bibtex

titre: On the use of confidence measures in machine translation : evaluation, post edition and application to speech translation
auteur: Sylvain Raybaud
article: Informatique et langage [cs.CL]. Université de Lorraine, 2012. Français. ⟨NNT : 2012LORR0260⟩
Accès au texte intégral et bibtex

2011

Journal articles

titre: This sentence is wrong.” Detecting errors in machine-translated sentences.
auteur: Sylvain Raybaud, David Langlois, Kamel Smaïli
article: Machine Translation, 2011, 25 (1), p. 1–34. ⟨10.1007/s10590-011-9094-9⟩
Accès au texte intégral et bibtex

titre: Combinaisons d’automates et de boules de mots pour la classification de séquences
auteur: Frédéric Tantini, Alain Terlutte, Fabien Torre
article: Revue des Sciences et Technologies de l’Information – Série RIA : Revue d’Intelligence Artificielle, 2011, Apprentissage artificiel, 25 (3), pp.411-434. ⟨10.3166/ria.25.411-434⟩
Accès au texte intégral et bibtex

titre: Estimating the control parameters of an articulatory model from electromagnetic articulograph data
auteur: Asterios Toutios, Slim Ouni, Yves Laprie
article: Journal of the Acoustical Society of America, 2011, Speech Production, 129 (5), pp.3245-3257. ⟨10.1121/1.3569714⟩
Accès au bibtex

titre: Evaluation of Topic Identification Methods on Arabic Corpora
auteur: Mourad Abbas, Kamel Smaili, Daoud Berkani
article: Journal of Digital Information Management, 2011, 9 (5), pp.8 double column
Accès au bibtex

titre: Frame-Synchronous and Local Confidence Measures for Automatic Speech recognition
auteur: Joseph Razik, Odile Mella, Dominique Fohr, Jean-Paul Haton
article: International Journal of Pattern Recognition and Artificial Intelligence, 2011, 25 (2), pp.1-26. ⟨10.1142/S0218001411008543⟩
Accès au bibtex

titre: Phrase-based Machine Translation based on Text Mining and Statistical Language Modeling Techniques
auteur: Chiraz Latiri, Kamel Smaili, Caroline Lavecchia, Cyrine Nasri, David Langlois
article: International Journal of Computational Linguistics and Applications, 2011, 2 (1-2), pp.16
Accès au texte intégral et bibtex

Conference papers

titre: Création d’une collection de livres numériques pour enfants présentant des difficultés de langage (enfants sourds, enfants avec troubles spécifiques ou retards de langage)
auteur: Agnès Piquard-Kipffer, Hélène Adam-Piquard, Sylvie Nussbaum
article: L’apprentissage de la lecture : convergences, innovations, perspectives., Université de Cergy-Pontoise & IUFM, Dec 2011, Gennevilliers, France
Accès au bibtex

titre: Impact of Pronunciation Variant Frequency on Automatic Non-Native Speech Segmentation
auteur: Denis Jouvet, Larbi Mesbahi, Anne Bonneau, Dominique Fohr, Irina Illina, Yves Laprie
article: 5th Language & Technology Conference – LTC’11, Nov 2011, Poznan, Poland. pp.145-148
Accès au bibtex

titre: Clustering repeated Out-Of-Vocabulary word tokens in order to model them for broadcast news transcription
auteur: Frederik Stouten, Irina Illina, Dominique Fohr
article: The XIVth International Conference Speech and Computer – SPECOM’2011, Sep 2011, Kazan, Russia. pp.73-80
Accès au bibtex

titre: Building a Pronunciation Lexicon for a Speech Transcription System from Wiktionary Pronunciations only
auteur: Denis Jouvet, Dominique Fohr, Irina Illina
article: XIV International Conference “Speech and Computer” (SPECOM’2011), Sep 2011, Kazan, Russia
Accès au bibtex

titre: Multiple Pronunciation Generation using Grapheme-to-Phoneme Conversion based on Conditional Random Fields
auteur: Irina Illina, Dominique Fohr, Denis Jouvet
article: XIV International Conference “Speech and Computer” (SPECOM’2011), Sep 2011, Kazan, Russia
Accès au bibtex

titre: Intra- and Inter-clausal Continuation Slopes in French: First Results
auteur: Mathilde Dargnat, Anne Bonneau, Vincent Colotte, Katarina Bartkova
article: Experimental and Theoretical Advances in Prosody 2, Sep 2011, Montréal, Canada
Accès au bibtex

titre: Broadcast news speech-to-text translation experiments
auteur: Sylvain Raybaud, David Langlois, Kamel Smaïli
article: The Thirteenth Machine Translation Summit, Sep 2011, Xiamen, China. pp.378-381
Accès au texte intégral et bibtex

titre: Non-conclusive” Slopes in French: First Results
auteur: Mathilde Dargnat, Anne Bonneau, Katarina Bartkova, Vincent Colotte
article: Interface Discours et prosodie 2011, University of Salford, Sep 2011, Manchester, United Kingdom
Accès au bibtex

titre: Introducing Visual Target Cost within an Acoustic-Visual Unit-Selection Speech Synthesizer
auteur: Utpala Musti, Vincent Colotte, Asterios Toutios, Slim Ouni
article: International Conference on Auditory-Visual Speech Processing – AVSP2011, Aug 2011, Volterra, Italy
Accès au bibtex

titre: Construction and evaluation of an articulatory model of the vocal tract
auteur: Yves Laprie, Julie Busset
article: 19th European Signal Processing Conference – EUSIPCO‐2011, Aug 2011, Barcelona, Spain
Accès au texte intégral et bibtex

titre: Weight Optimization for Bimodal Unit-Selection Talking Head Synthesis
auteur: Asterios Toutios, Utpala Musti, Slim Ouni, Vincent Colotte
article: 12thAnnual Conference of the International Speech Communication Association – Interspeech 2011, Aug 2011, Florence, Italy
Accès au bibtex

titre: Predicting Tongue Positions from Acoustics and Facial Features
auteur: Asterios Toutios, Slim Ouni
article: 12th Annual Conference of the International Speech Communication Association – Interspeech 2011, Aug 2011, Florence, Italy
Accès au texte intégral et bibtex

titre: Continuous episodic memory based speech recognition using articulatory dynamics
auteur: Sébastien Demange, Slim Ouni
article: 12th Annual Conference of the International Speech Communication Association – Interspeech 2011, Aug 2011, Florence, Italy
Accès au texte intégral et bibtex

titre: Tongue Gestures Awareness and Pronunciation Training
auteur: Slim Ouni
article: 12th Annual Conference of the International Speech Communication Association – Interspeech 2011, Aug 2011, Florence, Italy
Accès au bibtex

titre: Grapheme-to-Phoneme Conversion using Conditional Random Fields
auteur: Irina Illina, Dominique Fohr, Denis Jouvet
article: 12th Annual Conference of the International Speech Communication Association – Interspeech 2011, International Speech Communication Association (ISCA) et The Italian Regional SIG – AISV (Italian Speech Communication Association), Aug 2011, Florence, Italy
Accès au bibtex

titre: The JSafran platform for semi-automatic speech processing
auteur: Christophe Cerisara, Claire Gardent
article: 12th Annual Conference of the International Speech Communication Association – Interspeech 2011, Aug 2011, Florence, Italy. pp.4
Accès au bibtex

titre: Recognition and Real Time Performance of a Lightweight Ultrasound Based Silent Speech Interface Employing a Language Model
auteur: Jun Cai, Bruce Denby, Pierre Roussel, Gérard Dreyfus, Lise Crevier-Buchman
article: Interspeech, Aug 2011, Florence, Italy
Accès au bibtex

titre: About handling boundary uncertainty in a speaking rate dependent modeling approach
auteur: Denis Jouvet, Dominique Fohr, Irina Illina
article: 12th Annual Conference of the International Speech Communication Association – Interspeech 2011, International Speech Communication Association (ISCA) et The Italian Regional SIG – AISV (Italian Speech Communication Association), Aug 2011, Florence, Italy
Accès au bibtex

titre: Commas recovery with syntactic features in French and in Czech
auteur: Christophe Cerisara, Pavel Kral, Claire Gardent
article: 12thAnnual Conference of the International Speech Communication Association – Interspeech 2011, Aug 2011, Florence, Italy. pp.4
Accès au bibtex

titre: Similarity language model
auteur: Christian Gillot, Christophe Cerisara
article: 12thAnnual Conference of the International Speech Communication Association – Interspeech 2011, Aug 2011, Florence, Italy. pp.4
Accès au bibtex

titre: Reliability of non-native speech automatic segmentation for prosodic feedback
auteur: Larbi Mesbahi, Denis Jouvet, Anne Bonneau, Dominique Fohr, Irina Illina, Yves Laprie
article: Workshop on Speech and Language Technology in Education – SLaTE 2011, ISCA, Aug 2011, Venise, Italy
Accès au bibtex

titre: Speech clarity and coarticulation in Modern standard Arabic and Dialectal Arabic
auteur: Mohamed Embarki, Slim Ouni, Fathi Salam
article: 17th International Congress of Phonetic Sciences – ICPhS XVII, Aug 2011, Hong Kong, China. pp.635-638
Accès au bibtex

titre: Open source voice creation toolkit for the MARY TTS Platform
auteur: Marc Schröder, Marcela Charfuelan, Sathish Pammi, Ingmar Steiner
article: 12th Annual Conference of the International Speech Communication Association – Interspeech 2011, Aug 2011, Florence, Italy. pp.3253-3256
Accès au texte intégral et bibtex

titre: Acquisition de données articulatoires par un articulographe
auteur: Slim Ouni
article: Typologie des rhétiques : manifestations phonétiques et enjeux phonologiques, Jun 2011, Paris, France
Accès au bibtex

titre: Vers la détection des dislocations à gauche dans les transcriptions automatiques du Français parlé / Towards automatic recognition of left dislocation in transcriptions of Spoken French
auteur: Corinna Anderson, Christophe Cerisara, Claire Gardent
article: Traitement Automatique des Langues Naturelles – TALN’2011, Jun 2011, Montpellier, France. pp.6
Accès au texte intégral et bibtex

titre: An X-ray database, tools and procedures for the study of speech production
auteur: Rudolph Sock, Fabrice Hirsch, Yves Laprie, Pascal Perrier, Béatrice Vaxelaire, Gilbert Brock, Fayssal Bouarourou, Camille Fauth, Véronique Ferbach-Hecker, Liang Ma, Julie Busset, Jean Sturm
article: ISSP 2011 – 9th International Seminar on Speech Production, Jun 2011, Montréal, Canada. pp.41-48
Accès au texte intégral et bibtex

titre: Efficiency of five labial correlates for /i/ and /y/ in adverse contexts
auteur: Anne Bonneau, Brigitte Wrobel-Dautcourt
article: The ninth International Seminar on Speech Production – ISSP’11, Jun 2011, Montreal, Canada
Accès au bibtex

titre: A curvilinear tongue articulatory model
auteur: Yves Laprie, Julie Busset
article: International Seminar on Speech Production 2011 – ISSP’11, Jun 2011, Montréal, Canada
Accès au texte intégral et bibtex

titre: Adaptation of cepstral coefficients for acoustic-to-articulatory inversion
auteur: Julie Busset, Yves Laprie
article: International Seminar on Speech Production 2011 – ISSP’11, Jun 2011, Montréal, Canada
Accès au texte intégral et bibtex

titre: Investigating articulatory differences between upright and supine posture using 3D EMA
auteur: Ingmar Steiner, Slim Ouni
article: 9th International Seminar on Speech Production – ISSP’11, UQAM, Jun 2011, Montreal, Canada
Accès au bibtex

titre: Towards an articulatory tongue model using 3D EMA
auteur: Ingmar Steiner, Slim Ouni
article: 9th International Seminar on Speech Production – ISSP’11, UQAM, Jun 2011, Montreal, Canada. pp.147-154
Accès au texte intégral et bibtex

titre: Acoustic-to-articulatory inversion using an episodic memory
auteur: Sébastien Demange, Slim Ouni
article: International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE – Signal Processing Society, May 2011, Prague, Czech Republic
Accès au bibtex

Book sections

titre: Automatic Feedback for L2 Prosody Learning
auteur: Anne Bonneau, Vincent Colotte
article: Ivo Ipsic. Speech and Language Technologies, Intech, pp.55-70, 2011, 978-953-307-322-4
Accès au texte intégral et bibtex

titre: Acoustic and EMA study of pharyngealization : Coarticulatory effects as index of stylistic and regional distinction
auteur: Mohamed Embarki, Slim Ouni, Mohamed Yeou, Christian Guilleminot, Sallal Al Maqtari
article: M. Hassan and B. Heselwood. Instrumental Studies in Arabic Phonetics, Benjamins, pp.1-56, 2011, Current Issues in Linguistic Theory
Accès au bibtex

titre: Progress in animation of an EMA-controlled tongue model for acoustic-visual speech synthesis
auteur: Ingmar Steiner, Slim Ouni
article: Bernd J. Kröger and Peter Birkholz. Elektronische Sprachsignalverarbeitung 2011, TUDpress, pp.245-252, 2011, 978-3-942710-37-4
Accès au texte intégral et bibtex

Books

titre: Information Systems and Economic Intelligence : 4th. International Conference SIIE 2011. (Proceedings). February 17-19, 2011 (Marrakech, Morocco)
auteur: Sahbi Sidhom, Jean-Paul Haton, Malek Ghenima
article: Jean-Paul Haton and Sahbi Sidhom and Malek Ghenima. IGA Morocco, 1, pp.529, 2011, IGA Morocco
Accès au bibtex

Reports

titre: Parcours des enfants présentant des Troubles Spécifiques du Langage (TSL) en situation de handicap. Région Lorraine Enfance de 4 à 20 ans
auteur: Tamara Léonova, Piquard-Kipffer Agnès
article: [Rapport de recherche] ARS. 2011, pp.111
Accès au bibtex

Videos

titre: Je peux voir les mots que tu dis !
auteur: Blonz Christian, Lelarge Denis, Piquard-Kipffer Agnès
article: 2011
Accès au texte intégral et bibtex

2010

Journal articles

titre: Idée reçue : Télécharger des fichiers, ça alourdit mon ordinateur
auteur: Jean-Paul Haton
article: Interstices, 2010
Accès au bibtex

titre: Modeling Arabic Language using statistical methods
auteur: Karima Meftouh, Med Tayeb Tayeb Laskri, Kamel Smaïli
article: Arabian Journal for Science and Engineering, 2010, Theme issue on Arabic Computing, 35 (2C), pp.69-82
Accès au texte intégral et bibtex

titre: Mining monolingual and bilingual corpora
auteur: Chiraz Latiri, Kamel Smaïli, Caroline Lavecchia, David Langlois
article: Intelligent Data Analysis, 2010, 14 (6), pp.663-682
Accès au texte intégral et bibtex

titre: TR-Classifier and kNN Evaluation for Topic Identification tasks
auteur: Mourad Abbas, Kamel Smaïli, Daoud Berkani
article: International Journal on Information and Communication Technologies, 2010, 3 (3), pp.10
Accès au texte intégral et bibtex

titre: Microscopic origins of the ferromagnetic exchange coupling in oxoverdazyl-based Cu(II) complex
auteur: Jean-Baptiste Rota, Carmen J. Calzado, Cyrille Train, Vincent Robert
article: The Journal of Chemical Physics, 2010, 132 (15), pp.154702. ⟨10.1063/1.3378023⟩
Accès au bibtex

titre: A wavelet-based parameterization for speech/music discrimination
auteur: Emmanuel Didiot, Irina Illina, Dominique Fohr, Odile Mella
article: Computer Speech and Language, 2010, 24 (2), pp.341-357. ⟨10.1016/j.csl.2009.05.003⟩
Accès au bibtex

titre: Visualization of hypopharyngeal cavities and vocal tract acoustic modeling
auteur: Kiyoshi Honda, Tatsuya Kitamura, Hironori Takemoto, Seiji Adachi, Parham Mokhtari, Sayoko Takano, Yukiko Nota, Hiroyuki Hirata, Ichiro Fujimoto, Yasuhiro Shimada, Shinobu Masaki, Satoru Fujita, Jianwu Dang
article: Computer Methods in Biomechanics and Biomedical Engineering, 2010, 13 (4), pp.443-453. ⟨10.1080/10255842.2010.490528⟩
Accès au bibtex

titre: Recherche par le contenu dans des documents audiovisuels multilingues
auteur: Georges Quénot, Tien-Ping Tan, Viet-Bac Le, Stéphane Ayache, Laurent Besacier, Philippe Mulhem
article: Document numérique – Revue des sciences et technologies de l’information. Série Document numérique, 2010, 13 (1), pp.229-246
Accès au bibtex

titre: Dialogue act recognition approaches
auteur: Pavel Kral, Christophe Cerisara
article: Computing and Informatics, 2010, 29 (2), pp.227–250
Accès au bibtex

titre: Evaluation of Automatic Formant Tracking Method Using Fourier Ridges
auteur: Imen Jemaa, Kais Ouni, Yves Laprie
article: Cognitive Computation, 2010, 2, pp.170-179
Accès au bibtex

Conference papers

titre: Building and Exploiting a Dependency Treebank for French Radio Broadcast
auteur: Christophe Cerisara, Claire Gardent, Corinna Anderson
article: TLT9 — the ninth international workshop on Treebanks and Linguistic Theories, Dec 2010, Tartu, Estonia
Accès au bibtex

titre: Semi-Automatic Propbanking for French
auteur: Claire Gardent, Christophe Cerisara
article: TLT9 – The Ninth International Workshop on Treebanks and Linguistic Theories, Dec 2010, Tartu, Estonia
Accès au bibtex

titre: An improvement of the eCATE algorithm for F0 detection
auteur: Fadoua Bahja, Joseph Di Martino, El Hassan Ibn Elhaj, Driss Aboutajdine
article: 10th International Symposium on Communications and Information Technologies – ISCIT 2010, Oct 2010, Tokyo, Japan. pp.24-28, ⟨10.1109/ISCIT.2010.5664919⟩
Accès au bibtex

titre: On the Use of an Iterative Estimation of Continuous Probabilistic Transforms for Voice Conversion
auteur: A. Werghi, Joseph Di Martino, S. Ben Jebara
article: 5th International Symposium on I/V Communications over fixed and Mobile Networks – ISIVC 2010, Sep 2010, Rabat, Morocco. pp.1-4, ⟨10.1109/ISVC.2010.5656149⟩
Accès au bibtex

titre: Real-Time Pitch Tracking using the eCate Algorithm
auteur: Fadoua Bahja, Joseph Di Martino, El Hassan Ibn Elhaj
article: 5th International Symposium on I/V Communications over fixed and Mobile Networks – ISIVC 2010, Sep 2010, Rabat, Morocco. pp.1-4, ⟨10.1109/ISVC.2010.5656254⟩
Accès au bibtex

titre: Towards a True Acoustic-Visual Speech Synthesis
auteur: Asterios Toutios, Utpala Musti, Slim Ouni, Vincent Colotte, Brigitte Wrobel-Dautcourt, Marie-Odile Berger
article: 9th International Conference on Auditory-Visual Speech Processing – AVSP2010, Sep 2010, Hakone, Kanagawa, Japan. pp.POS1-8
Accès au texte intégral et bibtex

titre: Setup for Acoustic-Visual Speech Synthesis by Concatenating Bimodal Units
auteur: Asterios Toutios, Utpala Musti, Slim Ouni, Vincent Colotte, Brigitte Wrobel-Dautcourt, Marie-Odile Berger
article: Interspeech 2010, ISCA, Sep 2010, Makuhari, Chiba, Japan. pp.486-489
Accès au texte intégral et bibtex

titre: HMM-based Automatic Visual Speech Segmentation Using Facial Data
auteur: Utpala Musti, Asterios Toutios, Slim Ouni, Vincent Colotte, Brigitte Wrobel-Dautcourt, Marie-Odile Berger
article: Interspeech 2010, ISCA, Sep 2010, Makuhari, Chiba, Japan. pp.1401-1404
Accès au texte intégral et bibtex

titre: Memory-Based Active Learning for French Broadcast News
auteur: Frédéric Tantini, Christophe Cerisara, Claire Gardent
article: INTERSPEECH 2010, Sep 2010, Tokyo, Japan. pp.1377-1380
Accès au bibtex

titre: Similar N-Gram Language Model
auteur: Christian Gillot, Christophe Cerisara, David Langlois, Jean-Paul Haton
article: INTERSPEECH 2010, Sep 2010, Tokyo, Japan. pp.1824-1827
Accès au bibtex

titre: Detailed pronunciation variant modeling for speech transcription
auteur: Denis Jouvet, Dominique Fohr, Irina Illina
article: INTERSPEECH, ISCA, Sep 2010, Makuhari, Japan
Accès au bibtex

titre: Sequences Classification by Least General Generalisations
auteur: Frédéric Tantini, Alain Terlutte, Fabien Torre
article: 10th International Colloquium on Grammatical Inference, Sep 2010, Valencia, Spain. pp.189-202, ⟨10.1007/978-3-642-15488-1_16⟩
Accès au texte intégral et bibtex

titre: Automatic adaptation of a vocal tract model
auteur: Blaise Potard, Yves Laprie
article: Proceedings of the 18th European Signal Processing Conference – EUSIPCO-2010, Aug 2010, Aalborg, Denmark
Accès au texte intégral et bibtex

titre: Tongue Control and Implication in Pronunciation Training
auteur: Slim Ouni
article: Natural Language Processing and Language Learning Workshop (NaTAL’10),, Jun 2010, Nancy, France
Accès au bibtex

titre: Evaluation d’une nouvelle méthode de suivi de formants sur un corpus Arabe
auteur: Imen Jemaa, Oussama Rekhis, Kais Ouni, Yves Laprie
article: XXVIIIèmes Journées d’Etude sur la Parole – JEP’10, May 2010, Mons, Belgique
Accès au texte intégral et bibtex

titre: Regroupement des occurrences des mots hors-vocabulaire répétés en vue de leur modélisation pour la transcription d’émissions radio
auteur: Frederik Stouten, Irina Illina, Dominique Fohr
article: 28ème Journées d’étude sur la parole – JEP’10, Université de Mons, May 2010, Mons, Belgique
Accès au bibtex

titre: Combinaisons de boules de mots pour la classification de séquences
auteur: Frédéric Tantini, Alain Terlutte, Fabien Torre
article: CAp – Conférence Francophone sur l’Apprentissage Automatique – 2010, May 2010, Clermont-Ferrand, France. pp.161-176
Accès au texte intégral et bibtex

titre: Cleaning statistical language models
auteur: Reda Jourani, David Langlois, Kamel Smaïli, Khalid Daoudi, Driss Aboutajdine
article: 3d. International Conference on Information Systems and Economic Intelligence (SIIE’2010), Feb 2010, Sousse, Tunisia
Accès au bibtex

titre: Utilisation de graphes sémantiques pour l’extraction et la traduction des idées essentielles d’un texte en langue étrangère
auteur: Romain André-Lovichi, Kamel Smaïli, David Langlois
article: 10ième Conférence Internationale Francophone sur l’Extraction et la Gestion des Connaissances – EGC 2010, Jan 2010, Hammamet, Tunisie. pp.687-688
Accès au bibtex

Book sections

titre: Inversion acoustique articulatoire
auteur: Yves Laprie
article: SFA (Société Française d’Acoustique). Le livre blanc de l’acoustique en France en 2010, SFA (Société Française d’Acoustique), pp.91, 2010, 978-2-919340-00-2
Accès au bibtex

Habilitation à diriger des recherches

titre: Quelques contributions en reconnaissance automatique de la parole robuste
auteur: Christophe Cerisara
article: Interface homme-machine [cs.HC]. Université Henri Poincaré – Nancy I, 2010
Accès au texte intégral et bibtex

Other publications

titre: Création de livres numériques pour enfants présentant des troubles du langage
auteur: Agnès Piquard-Kipffer, Denis Lelarge, Laurent Pierron, Fabian Monnay
article: 2010
Accès au texte intégral et bibtex

titre: EVALEC, Batterie informatisée d’évaluation diagnostique des troubles spécifiques d’apprentissage de la lecture.
auteur: Liliane Sprenger-Charolles, Pascale Colé, Agnès Piquard-Kipffer, Gilles Leloup
article: 2010
Accès au bibtex

Books

titre: Information Systems and Economic Intelligence: Proceedings of the 3d international conference SIIE 2010 : February 18-20, 2010 (in Sousse, Tunisia)
auteur: Sahbi Sidhom, Kamel Smaïli, Malek Ghenima
article: Kamel Smaili and Sahbi Sidhom and Malek Ghenima. IHE Tunis, 1, pp.400, 2010, Kamel SMAILI (France), Sahbi SIDHOM (France), Malek GHENIMA (Tunisia)., 978-9973-868-24-4
Accès au bibtex

Patents

titre: Synthétiseur numérique audio amélioré
auteur: Joseph Di Martino, Laurent Pierron
article: France, N° de brevet: 10/02674. 2010
Accès au bibtex

Theses

titre: Inter-lingual Triggers for Statistical Machine Translation
auteur: Caroline Lavecchia
article: Informatique [cs]. Université Nancy II, 2010. Français. ⟨NNT : ⟩
Accès au texte intégral et bibtex

2009

Journal articles

titre: Efficient likelihood evaluation and dynamic Gaussian selection for HMM-based speech recognition
auteur: Jun Cai, Ghazi Bouselmi, Yves Laprie, Jean-Paul Haton
article: Computer Speech and Language, 2009, 23 (2), pp.147-256
Accès au bibtex

titre: Automatic discovery of topics and acoustic morphemes from speech
auteur: Christophe Cerisara
article: Computer Speech and Language, 2009, 23 (2), pp.220-239. ⟨10.1016/j.csl.2008.06.004⟩
Accès au bibtex

titre: Reliability and prevalence of an atypical development of phonological skills in French-speaking dyslexics
auteur: Liliane Sprenger-Charolles, Pascale Colé, Agnès Kipffer-Piquard, Florence Pinton, Catherine Billard
article: Reading and Writing : an interdisciplinary journal, 2009, 22 (7), pp.811-842. ⟨10.1007/s11145-008-9117-y⟩
Accès au texte intégral et bibtex

titre: Stabilité dans le temps des déficits en et hors lecture chez des adolescents dyslexiques (données longitudinales)
auteur: Liliane Sprenger-Charolles, Caroline Bogliotti, Agnès Piquard-Kipffer, Gilles Leloup
article: A.N.A.E. Approche neuropsychologique des apprentissages chez l’enfant, 2009, 21 (103), pp.243-253
Accès au texte intégral et bibtex

titre: Arabic Statistical N-gram Models
auteur: Karima Meftouh, Kamel Smaïli, Mohamed Tayeb Laskri
article: International Review on Computers and Software (IRECOS), 2009, 4 (1)
Accès au bibtex

titre: Missing data mask estimation with frequency and temporal dependencies
auteur: Sébastien Demange, Christophe Cerisara, Jean-Paul Haton
article: Computer Speech and Language, 2009, 23 (1), pp.25-41. ⟨10.1016/j.csl.2008.02.002⟩
Accès au bibtex

Conference papers

titre: Detection of OOV words by combining acoustic confidence measures with linguistic features
auteur: Frederik Stouten, Dominique Fohr, Irina Illina
article: The eleventh biannual IEEE workshop on Automatic Speech Recognition and Understanding (ASRU), Dec 2009, Merano, Italy. pp.1-4
Accès au bibtex

titre: Utilisation d’une grille polaire adaptative pour la construction d’un modèle articulatoire de la langue
auteur: Julie Busset
article: Rencontres Jeunes Chercheurs en Parole – RJCP 2009, Nov 2009, Avignon, France. pp.72
Accès au texte intégral et bibtex

titre: Analyse syntaxique du français parlé
auteur: Christophe Cerisara, Claire Gardent
article: Journée ATALA, Oct 2009, Paris, France
Accès au texte intégral et bibtex

titre: Speaker normalization for template based speech recognition
auteur: Sébastien Demange, Dirk van Compernolle
article: 10th Annual Conference of the International Speech Communication Association – Interspeech 2009, Sep 2009, Brighton, United Kingdom. pp.560–563
Accès au bibtex

titre: An Evaluation of Formant Tracking methods on an Arabic Database
auteur: Imen Jemaa, Oussama Rekhis, Kais Ouni, Yves Laprie
article: 10th Annual Conference of the International Speech Communication Association – INTERSPEECH 2009, Sep 2009, Brighton, United Kingdom
Accès au texte intégral et bibtex

titre: Efficient Combination of Confidence Measures for Machine Translation
auteur: Sylvain Raybaud, David Langlois, Kamel Smaïli
article: 10th Annual Conference of the International Speech Communication Association – INTERSPEECH 2009, Sep 2009, Brighton, United Kingdom
Accès au texte intégral et bibtex

titre: A robust variational method for the acoustic-to-articulatory problem
auteur: Blaise Potard, Yves Laprie
article: 10th Annual Conference of the International Speech Communication Association – INTERSPEECH 2009, Sep 2009, Brighton, United Kingdom
Accès au texte intégral et bibtex

titre: Articulatory Modeling Based on Semi-polar Coordinates and Guided PCA Technique
auteur: Jun Cai, Yves Laprie, Julie Busset, Fabrice Hirsch
article: 10th Annual Conference of the International Speech Communication Association – INTERSPEECH 2009, Sep 2009, Brighton, United Kingdom
Accès au texte intégral et bibtex

titre: JTrans, an open-source software for semi-automatic text-to-speech alignment
auteur: Christophe Cerisara, Odile Mella, Dominique Fohr
article: Proceedings of the 10th Annual Conference of the International Speech Communication Association – Interspeech 2009, Sep 2009, Brighton, United Kingdom
Accès au texte intégral et bibtex

titre: Contextual effects on protrusion and lip opening for /i,y
auteur: Anne Bonneau, Julie Busset, Brigitte Wrobel-Dautcourt
article: 10th Annual Conference of the International Speech Communication Association – Interspeech 2009, ISCA, Sep 2009, Brighton, United Kingdom
Accès au texte intégral et bibtex

titre: HEAR: An hybrid episodic-abstract speech recognizer
auteur: Sébastien Demange, Dirk van Compernolle
article: 10th Annual Conference of the International Speech Communication Association – Interspeech 2009, Sep 2009, Brighton, United Kingdom. pp.3067–3070
Accès au bibtex

titre: Word- and sentence-level confidence measures for machine translation
auteur: Sylvain Raybaud, Caroline Lavecchia, David Langlois, Kamel Smaïli
article: 13th Annual Meeting of the European Association for Machine Translation – EAMT 09, May 2009, Barcelona, Spain
Accès au texte intégral et bibtex

titre: Comparing TR-Classifier and KNN by using Reduced Sizes of Vocabularies
auteur: Mourad Abbas, Kamel Smaïli, D Berkani
article: 3rd International Conference on Arabic Language Processing, May 2009, Rabat, Morocco
Accès au texte intégral et bibtex

titre: Registration of Multimodal Data for Estimating the Parameters of an Articulatory Model
auteur: Michael Aron, Asterios Toutios, Marie-Odile Berger, Erwan Kerrien, Brigitte Wrobel-Dautcourt, Yves Laprie
article: IEEE International Conference on Acoustics, Speech, and Signal Processing – ICASSP 2009, Apr 2009, Taipei, Taiwan. pp.4489 – 4492, ⟨10.1109/ICASSP.2009.4960627⟩
Accès au texte intégral et bibtex

titre: Studying pharyngealisation using an articulograph
auteur: Slim Ouni, Yves Laprie
article: International Workshop on Pharyngeals and Pharyngealisation, Mar 2009, Newcastle, United Kingdom
Accès au bibtex

titre: Multi-Category Support Vector Machines for Identifying Arabic Topics
auteur: Mourad Abbas, Kamel Smaïli, Daoud Berkani
article: 10th International Conference on Intelligent Text Processing and Computational Linguistics – CICLing 2009, Mar 2009, Mexico, Mexico
Accès au texte intégral et bibtex

titre: Massive Pruning for Building an Operational Set of Association Rules: Metarules for Eliminating Conflicting and Redundant Rules.
auteur: Martine Cadot, Alain Lelu
article: International Conference on Information, Process, and Knowledge Management – eKnow09, Feb 2009, Cancun, Mexico. pp.90-98
Accès au bibtex

titre: Comparative study of Arabic and french statistical language models
auteur: Karima Meftouh, Kamel Smaïli, Med Tayeb Laskri
article: ICAART’09 – International Conference On agents and Artificial Intelligence, INSTICC, Jan 2009, Porto, Portugal
Accès au texte intégral et bibtex

titre: New Confidence Measures for Statistical Machine Translation
auteur: Sylvain Raybaud, Caroline Lavecchia, David Langlois, Kamel Smaïli
article: International Conference On Agents and Artificial Intelligence – ICAART 09, Jan 2009, Porto, Portugal
Accès au texte intégral et bibtex

titre: Recherche par le contenu dans des documents audiovisuels multilingues
auteur: Georges Quénot, Tien-Ping Tan, Viet-Bac Le, Stéphane Ayache, Laurent Besacier, Philippe Mulhem
article: Actes de la conférence CORIA, 2009, Giens, France. pp.67-82
Accès au bibtex

titre: Content-Based Search in Multilingual Audiovisual Documents using the International Phonetic Alphabet
auteur: Georges Quénot, Tien-Ping Tan, Viet-Bac Le, Stéphane Ayache, Laurent Besacier, Philippe Mulhem
article: 7th International Workshop on Content-Based Multimedia Indexing (CBMI 2009), 2009, Chania, Crete. 3-5 June 2009
Accès au bibtex

Book sections

titre: Acquisition multimodale de données articulatoires
auteur: Michael Aron, Marie-Odile Berger, Erwan Kerrien, Yves Laprie
article: Alain Marchal et Christian Cavé. L’imagerie médicale pour l’étude de la parole, Hermes Science Publications, pp.175-196, 2009, Traité Cognition et Traitement de l’Information, IC2, 978-2-7462-2235-9
Accès au bibtex

Other publications

titre: Voice Disguise and Reversibility
auteur: Patrick Perrot, Joseph Razik, Gérard Chollet
article: 2009
Accès au bibtex

titre: Voice Forgery Based on Audio Web Data
auteur: Patrick Perrot, Joseph Razik, Gérard Chollet
article: 2009
Accès au bibtex

2008

Journal articles

titre: À propos de l’intelligence artificielle
auteur: Jean-Paul Haton, Joanna Jongwane
article: Interstices, 2008
Accès au bibtex

titre: Incorporation of phonetic constraints in acoustic-to-articulatory inversion
auteur: Blaise Potard, Yves Laprie, Slim Ouni
article: Journal of the Acoustical Society of America, 2008, 123 (4), pp.2310-2323. ⟨10.1121/1.2885747⟩
Accès au texte intégral et bibtex

titre: Selective acoustic cues for French voiceless stop consonants
auteur: Anne Bonneau, Yves Laprie
article: Journal of the Acoustical Society of America, 2008, 123 (6), pp.4482-4497
Accès au bibtex

Conference papers

titre: Comparison between two predicting methods of labial coarticulation
auteur: Vincent Robert, Jacques Feldmar, Yves Laprie
article: The eighth International Seminar on Speech Production – ISSP’08, Dec 2008, Strasbourg, France
Accès au texte intégral et bibtex

titre: Improving the Sampling of the Null Space of the Acoustic-to-Articulatory Mapping
auteur: Blaise Potard, Yves Laprie
article: The eighth International Seminar on Speech Production – ISSP’08, Dec 2008, Strasbourg, France
Accès au texte intégral et bibtex

titre: Protocol for a Model-based Evaluation of a Dynamic Acoustic-to-Articulatory Inversion Method using Electromagnetic Articulography
auteur: Asterios Toutios, Slim Ouni, Yves Laprie
article: The eighth International Seminar on Speech Production – ISSP’08, INRIA, Dec 2008, Strasbourg, France
Accès au texte intégral et bibtex

titre: Inversion of fricative consonants in vocalic context based on a hypercuboid articulatory table
auteur: Farid Feiz, Yves Laprie
article: 8th International Seminar on Speech Production (ISSP-2008), Dec 2008, Strasbourg, France
Accès au bibtex

titre: Comprehension Improvement using Local Conﬁdence Measure: Towards Automatic Transcription for Classroom
auteur: Joseph Razik, Odile Mella, Dominique Fohr, Jean-Paul Haton
article: Workshop on Child, Computer and Interaction WOCCI08, ICMI’08 post-conference workshop, Oct 2008, Chania, Greece. pp.5
Accès au bibtex

titre: Evaluation of dialogue act recognition approaches
auteur: Pavel Kral, Tomas Pavelka, Christophe Cerisara
article: IEEE International Workshop on Machine Learning for Signal Processing, Oct 2008, Cancun, Mexico
Accès au bibtex

titre: Conflict Ontology Enrichment Based on Triggers
auteur: Chahnaz Zakaria, Olivier Curé, Kamel Smaïli
article: The 2nd International workshop on Ontologies and Information Systems for the Semantic Web, ACM 17th Conference on Information and Knowledge Management, Oct 2008, Napa Valley, California, United States
Accès au texte intégral et bibtex

titre: Pronunciation Training: The Role of Eye and Ear
auteur: Dominic Massaro, Stephanie Bigler, Trevor Chen, Marcus Perlman, Slim Ouni
article: Interspeech 2008, Sep 2008, Brisbane, Australia. pp.FriSe3.O4-2
Accès au bibtex

titre: Discovering Phrases in Machine Translation by Simulated Annealing
auteur: Caroline Lavecchia, David Langlois, Kamel Smaïli
article: INTERSPEECH 2008 – 9th Annual Conference of the International Speech Communication Association, Sep 2008, Brisbane, Australia. pp.2354-2357
Accès au texte intégral et bibtex

titre: Frame-Synchronous and Local Confidence Measures for on-the-fly Automatic Speech Recognition.
auteur: Joseph Razik, Odile Mella, Dominique Fohr, Jean-Paul Haton
article: InterSpeech, Sep 2008, Brisbane, Australia
Accès au bibtex

titre: Aspects of Pharyngealized Phonemes in Arabic Using Articulography
auteur: Slim Ouni
article: Interspeech, Sep 2008, Brisbane, Australia. pp.FriSe3.P4-6
Accès au bibtex

titre: Foreign accent identification based on prosodic parameters
auteur: Marina Piat, Dominique Fohr, Irina Illina
article: INTERSPEECH, Sep 2008, Brisbane, Australia
Accès au bibtex

titre: Multi-Accent and Accent-Independent Non-Native Speech Recognition
auteur: Ghazi Bouselmi, Dominique Fohr, Irina Illina
article: INTERSPEECH, Sep 2008, Brisbane, Australia
Accès au bibtex

titre: How can acoustic-to-articulatory maps be constrained?
auteur: Yves Laprie, Petros Maragos, Jean Schoentgen
article: 16th European Signal Processing Conference – EUSIPCO 2008, Aug 2008, Lausanne, Switzerland
Accès au texte intégral et bibtex

titre: Acoustic-to-articulatory inversion of fricatives
auteur: Farid Feiz, Blaise Potard, Yves Laprie
article: Meeting of the Acoustical Society of America (Acoustics’08), Jul 2008, Paris, France
Accès au bibtex

titre: Transcribing Southern Min Speech Corpora with a Web-Based Language Learning System
auteur: Jun Cai, Jacques Feldmar, Yves Laprie, Dominique Fohr, Jean-Paul Haton
article: International Conference on Audio, Language and Image Processing – ICALIP 2008, Jul 2008, Shangai, China
Accès au texte intégral et bibtex

titre: Exploiting confidence measures for missing data speech recognition
auteur: Christophe Cerisara
article: Proceedings on Acoustics’08, Jul 2008, Paris, France
Accès au texte intégral et bibtex

titre: Inversion des fricatives par codebook hypercuboïque
auteur: Farid Feiz, Blaise Potard, Yves Laprie
article: Journées d’Études de la Parole – JEP 2006, Jun 2008, Avignon, France. pp.1671
Accès au bibtex

titre: Une alternative aux modèles de traduction statistique d’IBM : Les triggers inter-langues
auteur: Caroline Lavecchia, Kamel Smaïli, David Langlois
article: 15eme conférence sur le Traitement Automatique des Langues Naturelles – TALN’08, Jun 2008, Avignon, France
Accès au texte intégral et bibtex

titre: Identification de l’origine des locuteurs non natifs en utilisant des paramètres prosodiques
auteur: Marina Piat, Dominique Fohr, Irina Illina
article: XXVIIèmes Journées d’Étude sur la Parole – JEP’08, Jun 2008, Avignon, France
Accès au bibtex

titre: Phrase-Based Machine Translation based on Simulated Annealing
auteur: Caroline Lavecchia, David Langlois, Kamel Smaïli
article: Sixth international conference on Language Resources and Evaluation – LREC 2008, May 2008, Marrakech, Morocco
Accès au texte intégral et bibtex

titre: Dynamic Gaussian Selection Technique for Speeding Up HMM-Based Continuous Speech Recognition
auteur: Jun Cai, Ghazi Bouselmi, Dominique Fohr, Yves Laprie
article: ICASSP, Apr 2008, Las Vegas, United States
Accès au bibtex

titre: Arabic statistical language modeling
auteur: Karima Meftouh, Kamel Smaïli, Mohamed-Tayeb Laskri
article: 9es Journées internationales d’Analyse statistique des Données Textuelles – JADT 2008, Mar 2008, Lyon, France. pp.837-844
Accès au texte intégral et bibtex

titre: Mesures de confiance locales et trame-synchrones
auteur: Joseph Razik, Odile Mella, Dominique Fohr, Jean-Paul Haton
article: XXVIIèmes Journées d’Etude sur la Parole – JEP 2008, 2008, Avignon, France
Accès au bibtex

titre: Transcription automatique pour malentendants : amélioration à l’aide de mesures de confiance locales
auteur: Joseph Razik, Odile Mella, Dominique Fohr, Jean-Paul Haton
article: XXVIIèmes Journées d’Etude sur la Parole – JEP 2008, 2008, Avignon, France
Accès au bibtex

Patents

titre: Method and device for speech synthesis
auteur: Vincent Colotte, Richard Beaufort
article: European Union, Patent n° : EP 1589524. 2008
Accès au bibtex

Reports

titre: Automatic Transcription for the Hard of Hearing: Comprehension Improvement by Introducing Local Confidence Measures
auteur: Joseph Razik, Odile Mella, Dominique Fohr, Jean-Paul Haton
article: [Intern report] 2008, pp.4
Accès au bibtex

Theses

titre: Contributions à la reconnaissance automatique de la parole non-native
auteur: Ghazi Bouselmi
article: Interface homme-machine [cs.HC]. Université Henri Poincaré – Nancy I, 2008. Français. ⟨NNT : ⟩
Accès au texte intégral et bibtex

titre: Modelling of labial coarticulation: implementation for a talking head
auteur: Vincent Robert
article: Autre [cs.OH]. Université Henri Poincaré – Nancy I, 2008. Français. ⟨NNT : ⟩
Accès au texte intégral et bibtex

titre: Acoustic-to-articulatory inversion with constraints
auteur: Blaise Potard
article: Interface homme-machine [cs.HC]. Université Henri Poincaré – Nancy 1, 2008. Français. ⟨NNT : 2008NAN10085⟩
Accès au texte intégral et bibtex

2007

Journal articles

titre: Lexical Structure for Dialogue Act Recognition
auteur: Pavel Kral, Christophe Cerisara, Jana Kleckova
article: Journal of Multimedia, 2007, 2 (3), pp.1-8
Accès au bibtex

titre: Visual Contribution to Speech Perception: Measuring the Intelligibility of Animated Talking Heads
auteur: Slim Ouni, Michael V. Cohen, Hope Ishak, Dominic Massaro
article: EURASIP Journal on Audio, Speech, and Music Processing, 2007, 2007, pp.ID 47891. ⟨10.1155/2007/47891⟩
Accès au bibtex

titre: On noise masking for automatic missing data speech recognition: a survey and discussion
auteur: Christophe Cerisara, Sébastien Demange, Jean-Paul Haton
article: Computer Speech and Language, 2007, 21 (3), pp.443-457
Accès au bibtex

titre: Anxiety in Mice: A Principal Component Analysis Study
auteur: Yan Clément, Chantal Joubert, Caroline Kopp, Eve M. Lepicard, Patrice Venault, René Misslin, Martine Cadot, Georges Chapouthier
article: Neural plasticity, 2007, ⟨10.1155/2007/35457⟩
Accès au bibtex

Conference papers

titre: Building a bilingual dictionary from movie subtitles based on inter-lingual triggers
auteur: Caroline Lavecchia, Kamel Smaïli, David Langlois
article: Translating and the Computer, Nov 2007, Londres, United Kingdom
Accès au texte intégral et bibtex

titre: Modélisation connexionniste du traitement de l’accès lexical et mise en relation avec des données électro-encéphalographiques
auteur: Laurent Bougrain, Jacques Feldmar, Enrique Sidhoum
article: Colloque de l’Association pour la Recherche Cognitive – ARCo’07 : Cognition – Complexité – Collectif, ARCo – INRIA – EKOS, Nov 2007, Nancy, France. pp.Poster
Accès au bibtex

titre: Text-Independent Foreign Accent Classification Using Statistical Methods
auteur: Dominique Fohr, Irina Illina
article: International Conference on Signal Processing and Communications, Nov 2007, Dubai, United Arab Emirates. pp.4
Accès au bibtex

titre: Importance of Prosody for Dialogue Acts Recognition
auteur: Pavel Kral, Christophe Cerisara, Jana Kleckova
article: XIIth International Conference “Speech and Computer” – SPECOM’07, Oct 2007, Moscou, Russia
Accès au bibtex

titre: Constitution d’un corpus de la langue Arabe à partir du Web
auteur: K. Meftouh, Kamel Smaïli, Med Tayeb Laskri
article: Colloque International sur le Traitement Automatique de la Langue Arabe – CITALA’07, Oct 2007, Rabat, Maroc
Accès au bibtex

titre: Arabic Pharyngeals in Visual Speech
auteur: Slim Ouni, Kais Ouni
article: International Conference on Auditory-Visual Speech Processing 2007 (AVSP), Aug 2007, Hilvarenbeek, Netherlands. pp.212-215
Accès au bibtex

titre: Acquisition and synchronization of multimodal articulatory data
auteur: Michael Aron, Nicolas Ferveur, Erwan Kerrien, Marie-Odile Berger, Yves Laprie
article: 8th Annual Conference of the International Speech Communication Association – Interspeech’07, Aug 2007, Antwerpen, Belgium. pp.1398-1401
Accès au texte intégral et bibtex

titre: Compact representations of the articulatory-to-acoustic mapping
auteur: Blaise Potard, Yves Laprie
article: INTERSPEECH 2007, Aug 2007, Antwerp, Belgium. pp.2481-2483
Accès au texte intégral et bibtex

titre: Speaker Diarization using Normalized Cross Likelihood Ratio
auteur: Viet-Bac Le, Odile Mella, Dominique Fohr
article: INTERSPEECH 2007, Aug 2007, Antwerp, Belgium. pp.4
Accès au bibtex

titre: Combined Acoustic and Pronunciation Modelling for Non-Native Speech Recognition
auteur: Ghazi Bouselmi, Dominique Fohr, Irina Illina
article: InterSpeech 2007, Aug 2007, Antwerp, Belgium
Accès au texte intégral et bibtex

titre: Aspects of Visual Speech in Arabic
auteur: Slim Ouni, Kais Ouni
article: Interspeech 2007, Aug 2007, Antwerp, Belgium. pp.WeB.P1a-7
Accès au bibtex

titre: Accurate marginalization range for missing data recognition
auteur: Sébastien Demange, Christophe Cerisara, Jean-Paul Haton
article: INTERSPEECH 2007, Aug 2007, Antwerp, Belgium
Accès au bibtex

titre: Using inter-lingual triggers for Machine translation
auteur: Caroline Lavecchia, Kamel Smaïli, David Langlois, Jean-Paul Haton
article: 8th Annual Conference of the International Speech Communication Association – INTERSPEECH 2007, Aug 2007, Antwerp, Belgium. pp.2829-2832
Accès au texte intégral et bibtex

titre: A phonetic concatenative approach of labial coarticulation
auteur: Vincent Robert, Yves Laprie, Anne Bonneau
article: INTERSPEECH 2007, ISCA, Aug 2007, Antwerp, Belgium. pp.1402-1405
Accès au texte intégral et bibtex

titre: Construction of perception stimuli with copy synthesis
auteur: Yves Laprie, Anne Bonneau
article: 16th International Congress of Phonetic Sciences – ICPhS 2007, Aug 2007, Saarbrücken, Germany
Accès au texte intégral et bibtex

titre: Tools devoted to the acquisition of the prosody of a foreign language
auteur: Guillaume Henry, Anne Bonneau, Vincent Colotte
article: International Congress of Phonetic Sciences – ICPhS 2007, Aug 2007, Saarbrücken, Germany. pp.1593-1596
Accès au bibtex

titre: Building Parallel Corpora from Movies
auteur: Caroline Lavecchia, Kamel Smaïli, David Langlois
article: The 4th International Workshop on Natural Language Processing and Cognitive Science – NLPCS 2007, Jun 2007, Funchal, Madeira, Portugal
Accès au texte intégral et bibtex

titre: Fusion de capteurs électromagnétiques et d’échographies pour le suivi de la langue
auteur: Michael Aron, Erwan Kerrien, Marie-Odile Berger, Yves Laprie
article: Onzième congrès francophone des jeunes chercheurs en vision par ordinateur – ORASIS’07, Jun 2007, Obernai, France
Accès au texte intégral et bibtex

titre: Random simulations of a datatable for efficiently mining reliable and non-redundant itemsets
auteur: Martine Cadot, Pascal Cuxac, Alain Lelu
article: 12th International Conference on Applied Stochastic Models and Data Analysis – ASMDA 2007, May 2007, Chania, Greece
Accès au texte intégral et bibtex

titre: Détection de la langue maternelle de locuteurs non natifs fondée sur l’extraction de séquences discriminantes de phonèmes
auteur: Ghazi Bouselmi, Dominique Fohr, Irina Illina, Jean-Paul Haton
article: Traitement et Analyse de l’Information Méthodes et Applications, May 2007, Hammamet, Tunisie. pp.6
Accès au bibtex

titre: Amélioration des Performances des Systèmes Automatiques de Reconnaissance de la Parole pour la Parole Non Native
auteur: Ghazi Bouselmi, Dominique Fohr, Irina Illina, Jean-Paul Haton
article: Traitement et Analyse de l’Information : Méthodes et Applications – TAIMA’07, Jean-Paul Haton and Faouzi Ghorbel, May 2007, Hammamet, Tunisie
Accès au texte intégral et bibtex

titre: Confidence measures for semi-automatic labeling of dialog acts
auteur: Pavel Kral, Christophe Cerisara, Jana Kleckova
article: IEEE International Conference on Acoustics, Speech and Signal Processing – ICASSP 2007, Apr 2007, Honolulu, United States
Accès au bibtex

titre: Natural language processing for usage based indexing of web resources
auteur: Anne Boyer, Armelle Brun
article: 29th European Conference on Information Retrieval – ECIR’07, Fondazione Ugo Bordoni; BCS-IRSG; ACM SIGIR, Apr 2007, Rome, Italy. pp.517-524, ⟨10.1007/978-3-540-71496-5_46⟩
Accès au texte intégral et bibtex

titre: L’adaptation des doses d’insuline chez les sujets diabétiques de type 2.
auteur: Dardari Dured, Sylvia Franc, Jacques Feldmar
article: Diabétologie et Endocrinologie, Mar 2007, Marseille, France
Accès au bibtex

titre: Usage based indexing of web resources with natural language processing
auteur: Armelle Brun, Anne Boyer
article: 3rd International Conference on Web Information Systems and Technologies – Webist 07, INSTICC – Institute for Systems and Technologies of Information, Control and Communication ; Open University of Catalonia, Mar 2007, Barcelone, Spain
Accès au texte intégral et bibtex

titre: Improving language models by using distant information
auteur: Armelle Brun, David Langlois, Kamel Smaïli
article: International Symposium on Signal Processing and its Applications – ISSPA 2007, Feb 2007, Sharjah, United Arab Emirates
Accès au texte intégral et bibtex

titre: Frame-Synchronous And Local Confidence Measures For On-The-Fly Keyword Spotting
auteur: Joseph Razik, Odile Mella, Dominique Fohr, Jean-Paul Haton
article: International Symposium on Signal Processing and its Applications – ISSPA 2007, Feb 2007, Sharjah, United Arab Emirates. pp.1-4
Accès au bibtex

titre: Discriminative Phoneme Sequences Extraction for Non-Native Speaker’s Origin Classification
auteur: Ghazi Bouselmi, Dominique Fohr, Irina Illina, Jean-Paul Haton
article: International Symposium on Signal Processing and its Applications – ISSPA 2007, Feb 2007, Sharjah, Saudi Arabia
Accès au texte intégral et bibtex

titre: Towards a statistical grammar of usage for document retrieval in digital libraries
auteur: Anne Boyer, Armelle Brun
article: 9th International Symposium on Signal Processing and its Applications – ISSPA’07, Feb 2007, Shirjah, United Arab Emirates. pp.1-4, ⟨10.1109/ISSPA.2007.4555494⟩
Accès au bibtex

titre: Simuler et épurer pour extraire les motifs sûrs et non redondants
auteur: Martine Cadot, Alain Lelu
article: 7èmes Journées Francophones “Extraction et Gestion des Connaissance” – EGC 2007 – Troisième Atelier Qualité des Données et des Connaissances – QDC, Stéphane Lallich, Philippe Lenca et Fabrice Guillet, Jan 2007, Namur, Belgique. pp.15-24
Accès au texte intégral et bibtex

Book sections

titre: Prédiction phonétique de la coarticulation labiale
auteur: Vincent Robert, Anne Bonneau, Brigitte Wrobel-Dautcourt, Yves Laprie
article: B. Vaxelaire, R. Sock, G. Kleiber et F. Marsac. Perturbations et réajustements : langue et langage, Publications de l’Université Marc Bloch (Strasbourg), pp.155-167, 2007, 978-2-35410-001-8
Accès au texte intégral et bibtex

titre: Selecting Representative Speakers for a Speech Database on the Basis of Heterogeneous Similarity Criteria
auteur: Frédéric Bimbot, Olivier Boëffard, Delphine Charlet, Dominique Fohr, Sacha Krstulovic, Odile Mella
article: Christian Müller. Speaker Classification II, 4441, Springer Berlin / Heidelberg, pp.276-292, 2007, Lecture Notes in Computer Science, 978-3-540-74121-3. ⟨10.1007/978-3-540-74122-0_21⟩
Accès au bibtex

titre: Nouvelles formes d’interaction homme-machine pour l’informatique diffuse
auteur: Christophe Cerisara, Yvon Haradji
article: Marc Dupuis. Informatique diffuse, 31 (31), OFTA, 2007, ARAGO
Accès au bibtex

titre: Inversion acoustique-articulatoire en utilisant des contraintes phonétiques
auteur: Blaise Potard, Yves Laprie
article: B. Vaxelaire, R. Sock, G. Kleiber et F. Marsac. Perturbations et réajustements : langue et langage, Publications de l’Université Marc Bloch (Strasbourg), 2007, 978-2-35410-001-8
Accès au texte intégral et bibtex

Books

titre: Actes des ateliers TAIMA’07
auteur: Faouzi Ghorbel, Stéphane Derrode, Jean-Paul Haton
article: 978-9973-61-802-3, pp.576, 2007
Accès au bibtex

Reports

titre: Evaluation of a talking head for helping HOH people in the classroom
auteur: Lorène Mourot, Marie Rovel, Jacques Feldmar
article: [Rapport de recherche] 2007
Accès au bibtex

Theses

titre: Speech/music segmentation for automatic transcription of continuous speech
auteur: Emmanuel Didiot
article: Acoustique [physics.class-ph]. Université Henri Poincaré – Nancy 1, 2007. Français. ⟨NNT : ⟩
Accès au texte intégral et bibtex

titre: Automatic Recognition of Dialogue Acts
auteur: Pavel Kral
article: Modeling and Simulation. Université Henri Poincaré – Nancy 1, 2007. English. ⟨NNT : 2007NAN10114⟩
Accès au texte intégral et bibtex

titre: Contributions to automatic speech recognition with missing data
auteur: Sébastien Demange
article: Acoustique [physics.class-ph]. Université Henri Poincaré – Nancy 1, 2007. Français. ⟨NNT : ⟩
Accès au texte intégral et bibtex

titre: Local and frame-synchronous confidence measures for automatic speech recognition
auteur: Joseph Razik
article: Interface homme-machine [cs.HC]. Université Henri Poincaré – Nancy I, 2007. Français. ⟨NNT : ⟩
Accès au texte intégral et bibtex

2006

Journal articles

titre: Optimizing the coverage of a speech database through a selection of representative speaker recordings
auteur: Sacha Krstulovic, Frédéric Bimbot, Olivier Boëffard, Delphine Charlet, Dominique Fohr, Odile Mella
article: Speech Communication, 2006, 48 (10), pp.1319-1348. ⟨10.1016/j.specom.2006.07.002⟩
Accès au bibtex

Conference papers

titre: Evaluation of phonetic constraints used in acoustic-to-articulatory inversion
auteur: Yves Laprie, Blaise Potard, Anne Bonneau
article: 7th International Seminar on Speech Production – ISSP 2006, Dec 2006, Sao Paulo/Brésil
Accès au bibtex

titre: Adapting visual data to a linear articulatory model
auteur: Yves Laprie, Blaise Potard
article: 7th International Seminar on Speech Production – ISSP 2006, Dec 2006, Sao Paulo/Brazil
Accès au texte intégral et bibtex

titre: Coupling electromagnetic sensors and ultrasound images for tongue tracking: acquisition setup and preliminary results
auteur: Michael Aron, Erwan Kerrien, Marie-Odile Berger, Yves Laprie
article: 7th International Seminar on Speech Production – ISSP’06, Dec 2006, Ubatuba, Brazil
Accès au texte intégral et bibtex

titre: Making learners aware of the prosody of a foreign language
auteur: Guillaume Henry, Anne Bonneau, Vincent Colotte
article: Nov 2006, 5 p
Accès au bibtex

titre: L’étude Labiao, une aide à l’intégration des étudiants sourds, un partenariat scientifique et pédagogique
auteur: Jacques Feldmar, Jack Sagot, Philippe Suignard
article: Colloque inaugural de l’INS HEA, Oct 2006, Surresnes, France
Accès au bibtex

titre: How to handle gender and number agreement in statistical language models?
auteur: Caroline Lavecchia, Kamel Smaïli, Jean-Paul Haton
article: Ninth International Conference on Spoken Language Processing – INTERSPEECH 2006, Sep 2006, Pittsburgh, Pennsylvania/USA
Accès au texte intégral et bibtex

titre: Multilingual Non-Native Speech Recognition using Phonetic Confusion-Based Acoustic Model Modification and Graphemic Constraints
auteur: Ghazi Bouselmi, Dominique Fohr, Irina Illina, Jean-Paul Haton
article: The Ninth International Conference on Spoken Language Processing – ICSLP 2006, Sep 2006, Pittsburgh, PA/USA
Accès au texte intégral et bibtex

titre: Missing data mask models with global frequency and temporal constraints
auteur: Sébastien Demange, Christophe Cerisara, Jean-Paul Haton
article: Ninth International Conference on Spoken Language Processing – Interspeech 2006 – ICSLP, Sep 2006, Pittsburgh, Pennsylvania/USA
Accès au texte intégral et bibtex

titre: Reconnaissance de parole non native fondée sur l’utilisation de confusion phonétique et de contraintes graphèmiques
auteur: Ghazi Bouselmi, Dominique Fohr, Jean-Paul Haton, Irina Illina
article: XXVIes Journées d’Etude sur la Parole – JEP’06, Jun 2006, Saint-Malo, France
Accès au texte intégral et bibtex

titre: Adjonction de contraintes visuelles pour l’inversion acoustique-articulatoire
auteur: Yves Laprie, Blaise Potard
article: Journées d’Études sur la Parole – JEP 2006, Jun 2006, Dinard/France
Accès au texte intégral et bibtex

titre: Towards Speaker and Environmental Robustness in ASR: The HIWIRE Project
auteur: Alexandros Potamianos, Ghazi Bouselmi, Dimitrios Dimitriadis, Dominique Fohr, Roberto Gemello, Irina Illina, Franco Mana, Petros Maragos, M. Matassoni, Vassilis Pitsikalis, J. Ramirez, E. Sanchez-Soto, J. Segura, P. Svaizer
article: SRIV’06 ITRW on Speech Recognition and Intrinsic Variation, May 2006, Toulouse, France
Accès au texte intégral et bibtex

titre: Mask Estimation For Missing Data Recognition Using Background Noise Sniffing
auteur: Sébastien Demange, Christophe Cerisara, Jean-Paul Haton
article: IEEE International Conference on Acoustics, Speech, and Signal Processing – ICASSP 2006, May 2006, Toulouse/France
Accès au texte intégral et bibtex

titre: Fully Automated Non-Native Speech Recognition Using Confusion-Based Acoustic Model Integration And Graphemic Constraints
auteur: Ghazi Bouselmi, Dominique Fohr, Irina Illina, Jean-Paul Haton
article: IEEE International Conference on Acoustics, Speech, and Signal Processing – ICASSP 2006, May 2006, Toulouse/France
Accès au texte intégral et bibtex

titre: Document stream clustering : experimenting an incremental algorithm and AR-based tools for highlighting dynamic trends
auteur: Alain Lelu, Martine Cadot, Pascal Cuxac
article: International Workshop on Webometrics, Informetrics and Scientometrics & Seventh COLLNET Meeting, LORIA, May 2006, Nancy, France. pp.345-352
Accès au texte intégral et bibtex

titre: Linguistic features modeling based on Partial New Cache
auteur: Kamel Smaïli, Caroline Lavecchia, Jean-Paul Haton
article: International Conference on Language Resources and Evaluation – LREC 2006, May 2006, Magazzini del Cotone Conference Center, Genoa/ITALY
Accès au texte intégral et bibtex

titre: Automatic Speech Recognition and Intrinsic Speech Variation
auteur: M Benzeguiba, Renato de Mori, O Deroo, Simon Dupont, T Erbes, Denis Jouvet, L Fissore, P Laface, A Mertins, C Ris, R Rose, V Tyagi, C Wellekens
article: IEEE International Conference on Acoustics, Speech and Signal Processing, May 2006, Toulouse, France. ⟨10.1109/ICASSP.2006.1661452⟩
Accès au bibtex

titre: Exploration et utilisation d’informations distantes dans les modèles de langage statistiques
auteur: Armelle Brun, David Langlois, Kamel Smaïli
article: 13ème Conférence sur le Traitement Automatique des Langues Naturelles – TALN’2006, Apr 2006, Leuven, Belgique. pp.425-434
Accès au bibtex

titre: Aide à l’interprétation des règles d’association composées
auteur: Martine Cadot, Pascal Cuxac, Claire François
article: Extraction et Gestion des Connaissances (EGC 2006), Jan 2006, Lille, France. pp.31-37
Accès au texte intégral et bibtex

titre: Règles d’association avec une prémisse composée : Mesure du gain d’information
auteur: Martine Cadot, Pascal Cuxac, Claire François
article: Extraction et Gestion des Connaissances (EGC 2006), Jan 2006, Lille, France. pp.599-600
Accès au texte intégral et bibtex

titre: Détection et correction automatique des déviations dans la réalisation de l’accent lexical anglais par des apprenants français
auteur: Guillaume Henry, Anne Bonneau, Vincent Colotte
article: 2006, pp.41–44
Accès au bibtex

titre: Automatic Dialog Acts Recognition based on Words Clusters
auteur: Pavel Kral, Jana Kleckova, Christophe Cerisara
article: 2006, 6 p
Accès au texte intégral et bibtex

titre: Sentence structure for dialog act recognition in Czech
auteur: Pavel Kral, Christophe Cerisara, Jana Kleckova, Tomas Pavelka
article: 2006
Accès au texte intégral et bibtex

titre: Automatic dialog acts recognition based on sentence structure
auteur: Pavel Kral, Christophe Cerisara, Jana Kleckova
article: 2006, pp.61-64
Accès au texte intégral et bibtex

titre: Mesures de confiance trame-synchrone
auteur: Joseph Razik, Odile Mella, Dominique Fohr, Jean-Paul Haton
article: 2006, pp.135–138
Accès au bibtex

titre: A Wavelet-Based Parameterization for Speech/Music Segmentation
auteur: Emmanuel Didiot, Dominique Fohr, Jean-Paul Haton, Irina Illina, Odile Mella
article: 2006, pp.653
Accès au bibtex

titre: Speech/music discrimination based on wavelets for broadcast programs
auteur: Emmanuel Didiot, Irina Illina, Odile Mella, Dominique Fohr, Jean-Paul Haton
article: 2006, pp.151
Accès au bibtex

titre: Une nouvelle approche fondée sur les ondelettes pour la discrimination parole/musique
auteur: Emmanuel Didiot, Irina Illina, Odile Mella, Dominique Fohr, Jean-Paul Haton
article: 2006, pp.209
Accès au bibtex

Books

titre: Reconnaissance Automatique de la Parole Du signal à son interprétation
auteur: Jean-Paul Haton, Christophe Cerisara, Dominique Fohr, Yves Laprie, Kamel Smaïli
article: DUNOD, pp.392, 2006, UniverSciences (Paris) – ISSN 1635-625X, 2-10-005842-8
Accès au bibtex

titre: Prédiction de la réussite ou de l’échec spécifiques en lecture au cycle 2. Suivi d’une population “à risque” et d’une population contrôle de la moyenne section de maternelle à la deuxième année de scolarisation primaire.
auteur: Agnès Kipffer-Piquard
article: ANRT – Lille. ARNT – Lille, pp.277, 2006, Thèse à la carte
Accès au bibtex

titre: Reconnaissance Automatique de la Parole Du signal à son interprétation
auteur: Jean-Paul Haton, Christophe Cerisara, Dominique Fohr, Yves Laprie, Kamel Smaïli
article: DUNOD, pp.392, 2006, UniverSciences (Paris) – ISSN 1635-625X, 2-10-005842-8
Accès au bibtex

Theses

titre: Extraction of Complex Relations in Humanistic : Statistics, Itemsets and Association Rules
auteur: Martine Cadot
article: Interface homme-machine [cs.HC]. Université de Franche-Comté, 2006. Français. ⟨NNT : ⟩
Accès au texte intégral et bibtex

2005

Journal articles

titre: Modeling the articulatory space using a hypercube codebook for acoustic-to-articulatory inversion
auteur: Slim Ouni, Yves Laprie
article: Journal of the Acoustical Society of America, 2005, 118 (1), pp.444–460. ⟨10.1121/1.1921448⟩
Accès au bibtex

titre: Training Baldi to be multilingual: A case study for an Arabic Badr
auteur: Slim Ouni, Michael V. Cohen, Dominic Massaro
article: Speech Communication, 2005, 45, pp.115–137. ⟨10.1016/j.specom.2004.11.008⟩
Accès au bibtex

Conference papers

titre: Modeling and Flexible exploitation of Audio Documents
auteur: Mohamed Mbarki, Chantal Soulé-Dupuy, Nathalie Vallès-Parlangeau
article: 1st International Conference on Signal-Image Technology and Internet-Based Systems (SITIS 2005), Nov 2005, Yaoundé, Cameroon. pp.216-223
Accès au bibtex

titre: A Randomization Test for extracting Robust Association Rules
auteur: Martine Cadot
article: 3rd world conference on Computational Statistics & Data Analysis – CSDA 2005, Oct 2005, Limassol, Cyprus
Accès au texte intégral et bibtex

titre: Comparison of Topic Identification methods for Arabic Language
auteur: Mourad Abbas, Kamel Smaïli
article: International Conference on Recent Advances in Natural Language Processing – RANLP 2005, Sep 2005, Borovets, Bulgaria
Accès au texte intégral et bibtex

titre: Fully Automated Non-Native Speech Recognition Using Confusion-Based Acoustic Model Integration
auteur: Ghazi Bouselmi, Dominique Fohr, Irina Illina, Jean-Paul Haton
article: Interspeech’2005 – Eurospeech — 9th European Conference on Speech Communication and Technology, Sep 2005, Lisbonne, Portugal. pp.1369-1372
Accès au texte intégral et bibtex

titre: Strategies of labial coarticulation
auteur: Vincent Robert, Brigitte Wrobel-Dautcourt, Yves Laprie, Anne Bonneau
article: Proceedings of the 9th European Conference on Speech Communication and Technology – Interspeech – Eurospeech 2005, Sep 2005, Lisbon, Portugal. pp.1021-1024
Accès au texte intégral et bibtex

titre: A low-cost stereovision based system for acquisition of visible articulatory data
auteur: Brigitte Wrobel-Dautcourt, Marie-Odile Berger, Blaise Potard, Yves Laprie, Slim Ouni
article: 5th Conference on Auditory-Visual Speech Processing – AVSP’2005, Jul 2005, Vancouver Island (BC), Canada
Accès au texte intégral et bibtex

titre: Inter Speaker variability of labial coarticulation with the view of developing a formal coarticulation model for French
auteur: Vincent Robert, Brigitte Wrobel-Dautcourt, Yves Laprie, Anne Bonneau
article: 5th Conference on Auditory-Visual Speech Processing – AVSP 2005, Jul 2005, Vancouver Island, Canada
Accès au texte intégral et bibtex

titre: Vers une exploitation flexible de documents multimédia
auteur: Mohamed Mbarki, Chantal Soulé-Dupuy, Nathalie Vallès-Parlangeau
article: 23ème Congès d’INFormatique des ORganisations et Systèmes d’Information et de Décision (INFORSID 2005), May 2005, Grenoble, France. pp.95-112
Accès au bibtex

titre: Rethinking Language Models within the Framework of Dynamic Bayesian Networks
auteur: Murat Deviren, Khalid Daoudi, Kamel Smaïli
article: 18th Conference of the Canadian Society for Computational Studies of Intelligence, Canadian AI 2005, May 2005, Victoria, Canada. pp.432-437
Accès au texte intégral et bibtex

titre: Analyse comparative de classifications : apport des règles d’association floues
auteur: Pascal Cuxac, Martine Cadot, Claire François
article: 5èmes journées d’Extraction et Gestion des connaissances (EGC), CRIP5-SIP – Université René Descartes Paris 5, Jan 2005, Paris, France. pp.519-530
Accès au texte intégral et bibtex

titre: ANTS le système de transcription automatique du LORIA
auteur: Armelle Brun, Christophe Cerisara, Dominique Fohr, Irina Illina, David Langlois, Odile Mella
article: Worshop ESTER, 2005, Avignon, France
Accès au bibtex

titre: Neologos: an optimized database for the development of new speech processing algorithms
auteur: Delphine Charlet, Sacha Krstulovic, Frédéric Bimbot, Olivier Boëffard, Dominique Fohr, Odile Mella, Filip Korkmazsky, Djamel Mostefa, Khalid Choukri, Arnaud Vallée
article: 9th European Conference on Speech Communication and Technology – EUROSPEECH/INTERSPEECH 2005, 2005, France. pp.1549–1552
Accès au bibtex

titre: The predictive self-organizing map : application to speech features extraction
auteur: Bruno Gas, Mohamed Chetouani, Jean-Louis Zarader, Farid Feiz
article: 5th Workshop on Self-Organizing Maps – WSOM’05, 2005, Paris, France
Accès au bibtex

titre: Using phonetic constraints in acoustic-to-articulatory inversion
auteur: Blaise Potard, Yves Laprie
article: 2005, pp.3217-3220
Accès au texte intégral et bibtex

titre: An elitist approach for extracting automatically well-realized speech sounds with high confidence
auteur: Jean-Baptiste Maj, Anne Bonneau, Dominique Fohr, Yves Laprie
article: 2005, pp.2925–2929
Accès au texte intégral et bibtex

titre: Local Word Confidence Measure Using Word Graph and N-Best List
auteur: Joseph Razik, Odile Mella, Dominique Fohr, Jean-Paul Haton
article: 2005, Lisbon, Portugal, pp.3369-3372
Accès au bibtex

titre: The MAP-SPACE denoising algorithm for noise robust speech recognition
auteur: Khalid Daoudi, Christophe Cerisara
article: IEEE Automatic Speech Recognition and Understanding Workshop – ASRU’2005, 2005, San Juan, Puerto Rico, Mexico. pp.4
Accès au bibtex

titre: Can We Retrieve Vocal Tract Dynamics that Produced Speech? Toward a Speaker Articulatory Strategy Model
auteur: Slim Ouni
article: 2005, pp.1037-1040
Accès au bibtex

titre: Experiments on Speaker Profile Portability
auteur: Vincent Barreaud, Douglas O’Shaughnessy, Jean-Guy Dahan
article: 2005, pp.997 — 1000
Accès au bibtex

titre: Combination of classifiers for automatic recognition of dialog acts
auteur: Pavel Kral, Christophe Cerisara, Jana Kleckova
article: 2005, pp.825-828
Accès au bibtex

titre: Sentence modality recognition in French based on prosody
auteur: Pavel Kral, Jana Kleckova, Christophe Cerisara
article: 2005, pp.185–188
Accès au bibtex

titre: Visual Contribution to Speech Perception: Measuring the Intelligibility of Talking heads.
auteur: Slim Ouni, Michael V. Cohen, Dominic Massaro, Hope Ishak
article: 2005
Accès au bibtex

titre: Linguistic features weighting for a Text-To-Speech system without prosody model
auteur: Vincent Colotte, Richard Beaufort
article: 2005, pp.2549–2552
Accès au bibtex

titre: From speech to SQL queries : a speech understanding system
auteur: Salma Jamoussi, Kamel Smaïli, Jean-Paul Haton
article: The twentieth national Conference on Artificial Intelligence workshop on spoken language understanding, 2005, Pittsburg, United States
Accès au texte intégral et bibtex

Book sections

titre: Association Rules and Statistics
auteur: Martine Cadot, Jean-Baptiste Maj, Tarek Ziadé
article: John Wang, Montclair State University, USA. Encyclopedia of Data Warehousing and Mining, Idea Group Inc., pp.74–77, 2005, Vol. 1
Accès au bibtex

Books

titre: Actes des ateliers TAIMA’05
auteur: Faouzi Ghorbel, Stéphane Derrode, Jean-Paul Haton, Lamia Benyoussef
article: 9973-61-039-3, 2005
Accès au bibtex

Reports

titre: Cartes de Kohonen prédictives pour la reconnaissance de phonèmes
auteur: Farid Feiz
article: [Rapport de recherche] 2005, pp.81
Accès au bibtex

2004

Journal articles

titre: FRET multiphoton spectral imaging microscopy of 7-ketocholesterol and Nile Red in U937 monocytic cells loaded with 7-ketocholesterol.
auteur: Edmond Kahn, Anne Vejux, Dominique Dumas, Thomas Montange, Frédérique Frouin, Vincent Robert, Jean-Marc Riedinger, Jean-François Stoltz, Philippe Gambert, Andrew Todd-Pokropek, Gérard Lizard
article: Analytical and Quantitative Cytology and Histology, 2004, 26 (6), pp.304-13
Accès au bibtex

titre: A Data Cleaning Solution by Perl Scripts for the KDD Cup 2003 Task 2
auteur: Martine Cadot, Joseph Di Martino
article: SIGKDD explorations : newsletter of the Special Interest Group (SIG) on Knowledge Discovery & Data Mining, 2004, 5 (2), pp.158-159
Accès au bibtex

titre: Alpha-Jacobian environmental adaptation
auteur: Christophe Cerisara, Luca Rigazio, Jean-Claude Junqua
article: Speech Communication, 2004, 42 (1), pp.25-41
Accès au bibtex

Conference papers

titre: Inversion acoustique-articulatoire en utilisant des contraintes phonétiques
auteur: Yves Laprie, Blaise Potard
article: Colloque COGNIEST “Perturbations et réajustements : langue et langage”, Dec 2004, Hagueneau/France
Accès au texte intégral et bibtex

titre: Analysis of Importance of the prosodic Features for Automatic Sentence Modality Recognition in French in real Conditions
auteur: Pavel Kral, Jana Kleckova, Christophe Cerisara
article: WSEAS International Conference on Electronics, Control and Signal Processing – ICECS’04, Nov 2004, Crete, Greece, pp.1820-1824
Accès au bibtex

titre: Using Phonetic Constraints to improve inversion
auteur: Blaise Potard, Yves Laprie
article: Meeting of the Acoustical Society of America, Nov 2004, San Diego / USA
Accès au bibtex

titre: A concurrent curve strategy for formant tracking
auteur: Yves Laprie
article: Interspeech 2004 – International Conference on Spoken Language Processing, Oct 2004, Jeju, Corée du sud, 4 p
Accès au texte intégral et bibtex

titre: A complete understanding speech system based on semantic concepts
auteur: Salma Jamoussi, Kamel Smaïli, Dominique Fohr, Jean-Paul Haton
article: 4th International Conference on Language Resources and Evaluation – LREC’04, May 2004, Lisbonne, Portugal, 4 p
Accès au bibtex

titre: ANTS : le système de transcription automatique du Loria
auteur: Armelle Brun, Christophe Cerisara, Dominique Fohr, Irina Illina, David Langlois, Odile Mella, Kamel Smaïli
article: JEP 2004 – 25èmes Journées d’Etude sur la Parole, Apr 2004, Fès, Maroc. pp.4
Accès au texte intégral et bibtex

titre: An Effective Lip Tracking Algorithm for Acoustic-to-Articulatory Inversion
auteur: Jingying Chen, Marie-Odile Berger, Yves Laprie
article: 5th International Workshop on Image Analysis for Multimedia – WIAMIS’2004, Apr 2004, Lisbon, Portugal, 3 p
Accès au texte intégral et bibtex

titre: Fiabilité de la référence humaine dans la détection de thème
auteur: Armelle Brun, Kamel Smaïli
article: Traitement Automatique des Langues Naturelles – TALN’2004, Apr 2004, Fès, Maroc. 10 p
Accès au texte intégral et bibtex

titre: Expériences d’inversion basées sur un modèle articulatoire
auteur: Yves Laprie, Blaise Potard, Slim Ouni
article: Journées d’Études sur la Parole – JEP’04, Apr 2004, Fès/Maroc
Accès au texte intégral et bibtex

titre: A computer-assisted learning of English prosody for French students
auteur: Anne Bonneau, Matthieu Camus, Yves Laprie, Vincent Colotte
article: Integrating Speech in Learning (InSTIL 2004), 2004, Venise, Italie, 4 p
Accès au bibtex

titre: Une nouvelle approche de modélisation du langage par des réseaux Bayésiens dynamiques
auteur: Murat Deviren, Khalid Daoudi, Kamel Smaïli
article: XXVes Journées d’Etudes sur la Parole – JEP-TALN-RECITAL 2004, 2004, Fès, Maroc
Accès au texte intégral et bibtex

titre: Synthèse vocale par sélection linguistiquement orientée d’unités non-uniformes : LiONS
auteur: Vincent Colotte, Richard Beaufort
article: Journées d’Etudes sur la Parole – JEP’04, 2004, Fès, Maroc, 4 p
Accès au bibtex

titre: Statistical Feature Language Model
auteur: Kamel Smaïli, Salma Jamoussi, David Langlois, Jean-Paul Haton
article: 8th International Conference on Spoken Language Processing – ICSLP’ 2004, 2004, Jeju, South Korea. 4 p
Accès au texte intégral et bibtex

titre: Compensation en milieu variant abruptement
auteur: Vincent Barreaud, Irina Illina, Dominique Fohr, Vincent Colotte
article: Journées d’Etudes sur la Parole – JEP’04, 2004, Fès, Maroc, 4 p
Accès au bibtex

titre: Experiments on the accuracy of phone models and liaison processing in a French broadcast news transcription system
auteur: Dominique Fohr, Odile Mella, Irina Illina, Christophe Cerisara
article: 8th International Conference on Spoken Language Processing – ICSLP’2004, 2004, Jeju, Corée du Sud
Accès au bibtex

titre: Using Linear Interpolation to Improve Histogram Equalization for Speech Recognition
auteur: Filipp Korkmazsky, Dominique Fohr, Irina Illina
article: 8th International Conference on Spoken Language Processing – ICSLP’2004, 2004, Jeju, Corée du Sud, 4 p
Accès au bibtex

titre: Experiments on Building Language Resources for Multi-Modal Dialogue Systems
auteur: Laurent Romary, Amalia Todirascu, David Langlois
article: International Conference on Language Resources and Evaluation – LREC’2004, 2004, Lisbonne, Portugal. 4 p
Accès au texte intégral et bibtex

titre: A Robust Lip Tracking System for the Acoustic to Articulatory Inversion
auteur: Jingying Chen, Yves Laprie, Marie-Odile Berger
article: 6th IASTED International Conference on Signal and Image Processing – SIP’2004, 2004, Honolulu, Hawaii, USA, 6 p
Accès au texte intégral et bibtex

titre: Un système de compréhension automatique de la parole pour l’interrogation orale d’une base de données de bourse
auteur: Salma Jamoussi, Kamel Smaïli, Dominique Fohr, Jean-Paul Haton
article: Journées d’Etudes sur la Parole – JEP’04, 2004, Fès, Maroc. 4 p
Accès au texte intégral et bibtex

titre: Exploiting models intrinsic robustness for noisy speech recognition
auteur: Christophe Cerisara, Dominique Fohr, Odile Mella, Irina Illina
article: 8th International Conference on Spoken Language Processing – ICSLP’2004, 2004, Jeju, Corée du Sud, 4 p
Accès au texte intégral et bibtex

titre: Development of new telephone speech databases for French: the NEOLOGOS Project
auteur: Elisabeth Pinto, Delphine Charlet, Hélène François, Djamel Mostefa, Olivier Boëffard, Dominique Fohr, Odile Mella, Frédéric Bimbot, Khalid Choukri, Yann Philip, Francis Charpentier
article: 4th International Conference on Language Resources and Evaluation – LREC’04, 2004, Lisbonne, Portugal. 4 p
Accès au bibtex

titre: Détection automatique de sons bien réalisés
auteur: Yves Laprie, Safaa Jarifi, Anne Bonneau, Dominique Fohr
article: Actes des XXVes Journées d’Étude sur la Parole – JEP’2004, 2004, Fès, Maroc, 4 p
Accès au texte intégral et bibtex

titre: Hidden Factor Dynamic Bayesian Networks for Speech Recognition
auteur: Filipp Korkmazsky, Murat Deviren, Dominique Fohr, Irina Illina
article: 8th International Conference on Spoken Language Processing – ICSLP’2004, 2004, Jeju, Corée du Sud, 4 p
Accès au bibtex

titre: Réduction d’un jeu de règles d’association par des méta-règles issues de la logique de “sens commun
auteur: Martine Cadot, Joseph Di Martino, Amedeo Napoli
article: 4èmes journées d’Extraction et de Gestion des Connaissances – EGC’2004, G. Hébrail and L. Lebart and J.-M. Petit, 2004, Clermont-Ferrand, France. pp.353
Accès au bibtex

titre: Comparaison de différentes méthodes de classification pour la détection de mots clés en parole continue
auteur: Yassine Benayed, Dominique Fohr, Jean-Paul Haton, Gérard Chollet
article: 7ème Colloque Africain sur la Recherche en Informatique – CARI’04, 2004, Hammamet, Tunisie, France. 8 p
Accès au bibtex

titre: Une nouvelle architecture de compensation du bruit pour la reconnaissance robuste de la parole
auteur: Khalid Daoudi, Murat Deviren
article: XXVes Journées d’Etudes sur la Parole – JEP-TALN-RECITAL 2004, 2004, Fès, Maroc, 4 p
Accès au texte intégral et bibtex

titre: Segmentation Parole/Musique pour la transcription automatique
auteur: Joseph Razik, Dominique Fohr, Odile Mella, Nathalie Parlangeau-Vallès
article: Actes des XXVes Journées d’Etude sur la Parole – JEP’2004, 2004, Fès, Maroc. 4 p
Accès au texte intégral et bibtex

titre: The Automatic News Transcription System: ANTS some Real Time experiments
auteur: Irina Illina, Dominique Fohr, Odile Mella, Christophe Cerisara
article: 8th International Conference on Spoken Language Processing – ICSLP’ 2004, 2004, Jeju, Corée du Sud, 4 p
Accès au bibtex

titre: Language modeling using dynamic Bayesian networks
auteur: Murat Deviren, Khalid Daoudi, Kamel Smaïli
article: 4th International Conference on Language Resources and Evaluation – LREC 2004, 2004, Lisbonne, Portugal
Accès au texte intégral et bibtex

titre: Using confidence measure for keyword detection in continuous speech recognition
auteur: Yassine Benayed, Dominique Fohr, Jean-Paul Haton, Gérard Chollet
article: Conférence Internationale sur l’accès Intelligent aux Documents Multimédia sur l’Internet – Medinet’04, 2004, Tozeur, Tunisie, France. 10 p
Accès au bibtex

Book sections

titre: Continuous Speech Recognition Using Dynamic Bayesian Networks : A Fast Decoding Algorithm
auteur: Murat Deviren, Khalid Daoudi
article: Gamez, José and Moral, Serafin and Salmeron, Antonio. Advances in Bayesian Networks, Springer Physica Verlag, pp.289-307, 2004, Studies in Fuzziness and Soft Computing
Accès au bibtex

titre: Event coreference and discourse relations
auteur: Laurence Danlos, Bertrand Gaiffe
article: L. Kulda. Language, Music and Cognition, Kluwer Academic Publishers, 2004
Accès au texte intégral et bibtex

2003

Journal articles

titre: Statistical Language Modeling Based on Variable-Length Sequences
auteur: Imed Zitouni, Kamel Smaïli, Jean-Paul Haton
article: Computer Speech and Language, 2003, 17 (1), pp.27-41
Accès au bibtex

titre: Événements impossibles en modélisation stochastique du langage
auteur: David Langlois, Armelle Brun, Kamel Smaïli, Jean-Paul Haton
article: Revue TAL : traitement automatique des langues, 2003, 44 (1), pp.33-61
Accès au bibtex

titre: Dynamic Bayesian Networks for multi-band automatic speech recognition
auteur: Khalid Daoudi, Dominique Fohr, Christophe Antoine
article: Computer Speech and Language, 2003, 17 (2-3), pp.263-285. ⟨10.1016/S0885-2308(03)00011-1⟩
Accès au texte intégral et bibtex

Conference papers

titre: Improving the Performance of a Keyword Spotting System by Using Support Vector Machines
auteur: Yassine Benayed, Dominique Fohr, J.P. Haton, Gérard Chollet
article: IEEE Automatic Speech Recognition and Understanding Workshop – ASRU’2003, Dec 2003, St. Thomas, U.S. Virgin islands, France. 5 p, ⟨10.1109/ASRU.2003.1318419⟩
Accès au bibtex

titre: Inversion experiments based on a descriptive articulatory model
auteur: Yves Laprie, Slim Ouni, Blaise Potard, Shinji Maeda
article: International Seminar on Speech Production, Dec 2003, Sydney, Australie
Accès au texte intégral et bibtex

titre: On-line compensation for non-stationary noise
auteur: Vincent Barreaud, Irina Illina, Dominique Fohr
article: Automatic Speech Recognition and Understanding Workshop – ASRU’2003, Nov 2003, St Thomas, US Virgin Islands, 4 p
Accès au bibtex

titre: Description d’un système de compréhension automatique de la parole
auteur: Salma Jamoussi, Kamel Smaïli, Jean-Paul Haton
article: Troisièmes Ateliers en Traitement et Analyse d’Images : Méthodes et Applications – TAIMA’03, Oct 2003, Hammamet, Tunisie, France. 6 p
Accès au texte intégral et bibtex

titre: Comparing the Order of a Polynomial Phase Model for the Synthesis of Quasi-Harmonic Audio Signals
auteur: Laurent Girin, Sylvain Marchand, Joseph Di Martino, Axel Röbel, Geoffroy Peeters
article: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics – WASPAA’03, Oct 2003, New York, United States. pp.193- 196
Accès au bibtex

titre: Towards Missing Data Recognition with Cepstral Features
auteur: Christophe Cerisara
article: 8th European Conference on Speech Communication and Technology – EUROSPEECH’03, Sep 2003, Geneva, Switzerland, 4 p
Accès au texte intégral et bibtex

titre: Efficient linear combination for distant n-gram models
auteur: David Langlois, Kamel Smaïli, Jean-Paul Haton
article: 8th European Conference on Speech Communication and Technology – Eurospeech’03, Sep 2003, Genève, Switzerland. pp.409-412
Accès au texte intégral et bibtex

titre: Robust speech recognition to non-stationary and unpredictable noise based on model-driven approaches
auteur: Christophe Cerisara, Irina Illina
article: 8th European Conference on Speech Communication and Technology – EUROSPEECH’03, Sep 2003, Geneva, Switzerland, 4 p
Accès au texte intégral et bibtex

titre: Understanding process for speech recognition
auteur: Salma Jamoussi, Kamel Smaïli, Jean-Paul Haton, Kamel Sma¨ilisma¨ili
article: Eighth European Conference on Speech Communication and Technology – EuroSpeech’03, Sep 2003, Genève, Suisse, France. 4 p
Accès au texte intégral et bibtex

titre: Understanding speech based on a Bayesian concept extraction method
auteur: Salma Jamoussi, Kamel Smaïli, Jean-Paul Haton
article: Sixth International Conference on Text Speech and Dialogue – TSD’03, Sep 2003, Ceské-Budejovic, République Tchèque, France. 8 p
Accès au texte intégral et bibtex

titre: Elitist identification of stops from formant transitions
auteur: Anne Bonneau, Yves Laprie
article: 15th International Congress of Phonetic Sciences – ICPhS’2003, Aug 2003, Barcelona, Spain, 4 p
Accès au bibtex

titre: On-Line Frame-Synchronous Noise Compensation
auteur: Vincent Barreaud, Irina Illina, Dominique Fohr
article: The 15th International Congress of Phonetic Sciences – ICPhS 2003, Aug 2003, Barcelone, Espagne, 4 p
Accès au bibtex

titre: Audio Indexing on the Web: a Preliminary Study of Some Audio Descriptors
auteur: Nathalie Vallès-Parlangeau, Jérôme Farinas, Dominique Fohr, Irina Illina, Ivan Magrin-Chagnolleau, Odile Mella, Julien Pinquier, Jean-Luc Rouas, Christine Sénac
article: 7th World Multiconference on Systematics, Cybernetics and Informatics (SCI 2003), Jul 2003, Orlando, Florida, United States. pp.1-4
Accès au texte intégral et bibtex

titre: Comparison of Two Speech/Music Segmentation Systems For Audio Indexing on the Web
auteur: Joseph Razik, Christine Sénac, Dominique Fohr, Odile Mella, Nathalie Vallès-Parlangeau
article: 7th World Multiconference on Systemics, Cybernetics and Informatics, SCI 2003, Jul 2003, Orlando, Florida, United States. pp.1-6
Accès au bibtex

titre: Speech signal resampling by arbitrary rate
auteur: Sen Zhang, Yves Laprie
article: 7th International Symposium on Signal Processing and its Application 2003 – ISSPA’2003, Jul 2003, Paris, France, 4 p
Accès au texte intégral et bibtex

titre: A New Keyword Spotting Approach Based on Reward Function
auteur: Yassine Benayed, Dominique Fohr, J.P. Haton, Gérard Chollet
article: Eventh International Symposium on Signal Processing and Its Applications – ISSPA’2003, Jul 2003, Paris, France, France. 4 p, ⟨10.1109/ISSPA.2003.1224726⟩
Accès au bibtex

titre: Vers la compréhension automatique de la parole : extraction des concepts par réseaux bayésiens
auteur: Salma Jamoussi, Kamel Smaïli, Jean-Paul Haton
article: Dixième Conférence en Traitement Automatique des Langues Naturelles – TALN’03, Jun 2003, Batz-sur-Mer, France, 10 p
Accès au bibtex

titre: Combining EigenVoices and Structural MLLR for Speaker Adaptation
auteur: Fabrice Lauri, Irina Illina, Dominique Fohr
article: IEEE International Conference on Acoustics, Speech and Signal Processing – ICASSP’03, Apr 2003, Hong Kong, China, 4 p
Accès au bibtex

titre: On-Line Frame-Synchronous Compensation of Non-Stationary noise
auteur: Vincent Barreaud, Irina Illina, Dominique Fohr
article: The 2003 IEEE International Conference on Acoustics, Speech and Signal Processing – ICASSP 2003, Apr 2003, Hong Kong, Chine, 4 p
Accès au bibtex

titre: Confidence Measures for Keyword Spotting using Suport Vector Machines
auteur: Yassine Benayed, Dominique Fohr, J.P. Haton, Gérard Chollet
article: IEEE International Conference on Acoustics, Speech and Signal Processing – ICASSP’2003, Apr 2003, Hong Kong, Chine, France. 4 p, ⟨10.1109/ICASSP.2003.1198849⟩
Accès au bibtex

titre: A new supervised-predictive compensation scheme for noisy speech recognition
auteur: Khalid Daoudi, Murat Deviren
article: 8th European Conference on Speech Communication and Technology – Eurospeech 2003, 2003, Geneva, Switzerland, 4 p
Accès au bibtex

titre: Frequency and Wavelet Filtering for Robust Speech Recognition
auteur: Murat Deviren, Khalid Daoudi
article: Artificial Neural Networks and Neural Information Processing – Joint International Conference ICANN/ICONIP2003, 2003, Istanbul, Turquie, pp.452-462
Accès au bibtex

titre: Text-to-pinyin conversion based on contextual knowledge and D-tree for Mandarin
auteur: Sen Zhang, Yves Laprie
article: IEEE International Conference on Natural Language Processing and Knowledge Engineering 2003 – NLP-KE’2003, 2003, Beijing, China, 6 p
Accès au texte intégral et bibtex

titre: Nouvelle approche de la sélection de vocabulaire pour la détection de thème
auteur: Armelle Brun, Kamel Smaïli, Jean-Paul Haton
article: Traitement Automatique du Langage Naturel – TALN’2003, 2003, Batz-sur-Mer, France, 10 p
Accès au texte intégral et bibtex

titre: Structural State-Based Frame Synchronous Compensation
auteur: Vincent Barreaud, Irina Illina, Dominique Fohr, Fillip Korkmazsky
article: European Conference on Speech Communication and Technologies – Eurospeech’03, 2003, Genève, Suisse, 4 p
Accès au bibtex

titre: Internationalization of a Talking Head
auteur: Slim Ouni, Michael V. Cohen, Dominic Massaro, Karl Young, Alexandra Jesse
article: 2003
Accès au bibtex

titre: A study of the French Vowels Through The Main Constriction of the Vocal Tract Using an Acoustic-to-articulatory inversion method
auteur: Slim Ouni, Yves Laprie
article: 15th International Congress of Phonetic Sciences 2003 – ICPhS’2003, 2003, Barcelone, Espagne, 4 p
Accès au texte intégral et bibtex

2002

Journal articles

titre: Signal Representation and Segmentation based on Multifractal Stationarity
auteur: Khalid Daoudi, Jacques Lévy Véhel
article: Signal Processing, 2002, 82 (12), pp.2015-2024. ⟨10.1016/S0165-1684(02)00198-6⟩
Accès au texte intégral et bibtex

Conference papers

titre: Continuous Speech Recognition Using Dynamic Bayesian Networks: A Fast Decoding Algorithm
auteur: Murat Deviren, Khalid Daoudi
article: First European Workshop on Probabilistic Graphical Models – PGM’02, Nov 2002, Cuenca, Spain, 9 p
Accès au bibtex

titre: Continuous Speech Recognition using Structural Learning of Dynamic Bayesian Networks
auteur: Murat Deviren, Khalid Daoudi
article: XI European Signal Processing Conference – EUSIPCO 2002, Sep 2002, Toulouse, France, 4 p
Accès au bibtex

titre: Support Vector Machines for Keyword Spotting
auteur: Yassine Benayed, Dominique Fohr, Jean-Paul Haton, Gérard Chollet
article: International Workshop speech and computer – SPECOM’2002, Sep 2002, St-Petersburg, Russia, France. 4 p
Accès au bibtex

titre: Retrieving phrases by selecting the history: application to Automatic Speech Recognition
auteur: David Langlois, Kamel Smaïli, Jean-Paul Haton
article: 7th International Conference on Spoken Language Processing – ICSLP’2002, Sep 2002, Denver, USA, pp.721
Accès au bibtex

titre: Recognition and Rejection Performance in Wordspotting Systems Using Support Vector Machines
auteur: Yassine Benayed, Dominique Fohr, Jean-Paul Haton, Gérard Chollet
article: 2nd WSEAS International Conference on Signal, Speech and Image Processing – WSEAS ICOSSIP’2002, Sep 2002, Koukounaries, Skiathos Island, Greece, France. 6 p
Accès au bibtex

titre: Recognition and Rejection Performance in Wordspotting Systems Using Hidden Markov modeling techniques
auteur: Yassine Benayed, Dominique Fohr, Jean-Paul Haton, Gérard Chollet
article: International Workshop speech and computer – SPECOM’2002, Sep 2002, St-Petersburg, Russia, France. 4 p
Accès au bibtex

titre: A copy synthesis method to pilot the Klatt synthesiser
auteur: Yves Laprie, Anne Bonneau
article: International Conference on Speech and Language Processing, Sep 2002, Denver, USA, 4 p
Accès au texte intégral et bibtex

titre: Contribution to Topic Identification by Using Word Similarity
auteur: Armelle Brun, Kamel Smaïli, Jean-Paul Haton
article: 7th International Conference on Spoken Language Processing – ICSLP’2002, Sep 2002, Denver, Colorado, USA, 4 p
Accès au bibtex

titre: Dynamic Bayesian Networks for Automatic Speech Recognition
auteur: Murat Deviren
article: Eighteenth National Conference on Artificial Intelligence, AAAI 2002, SIGART/AAAI Doctoral Consortium, Jul 2002, Edmonton, Alberta, Canada, 1 p
Accès au bibtex

titre: Identification thématique hiérarchique : Application aux forums de discussions
auteur: Brigitte Bigi, Kamel Smaïli
article: 9ème conférence annuelle sur le Traitement Automatique des Langues Naturelles – TALN’02, Jun 2002, Nancy, France. pp.24 – 27
Accès au texte intégral et bibtex

titre: Réseaux Bayésiens Dynamiques pour la Reconnaissance Multi-Bandes de la Parole
auteur: Khalid Daoudi, Dominique Fohr, Christophe Antoine
article: XXIVe Journées d’Etudes sur la Parole – JEP’2002, Equipe Parole – LORIA, Jun 2002, Nancy, France, 4 p
Accès au texte intégral et bibtex

titre: Segmentation du bruit d’explosion des occlusives
auteur: Yves Laprie, Anne Bonneau
article: XXIVe Journées d’Etude sur la Parole – JEP’2002, Jun 2002, Nancy, France, 4 p
Accès au texte intégral et bibtex

titre: A platform for the diagnosis of auditory deficiency
auteur: Anne Bonneau, Parham Mokhtari
article: 4th International Workshop on Enterprise Networking and Computing in Health Care Industry – Healthcom 2002, Jun 2002, Nancy, France, 4 p
Accès au bibtex

titre: Détection de séquences par sélection de l’historique : application à la reconnaissance automatique de la parole
auteur: David Langlois, Kamel Smaïli, Jean-Paul Haton
article: XXIVe Journées d’Etudes sur la Parole – JEP’2002, Jun 2002, Nancy, France, pp.301
Accès au texte intégral et bibtex

titre: Apprentissage de structures de réseaux bayésiens dynamiques pour la reconnaissance de la parole
auteur: Murat Deviren, Khalid Daoudi
article: XXlVèmes Journées d’Études sur la Parole – JEP’2002, Jun 2002, Nancy, France, pp.293-296
Accès au bibtex

titre: Reconnaissance de la parole pour les locuteurs non natifs en présence de bruit
auteur: Dominique Fohr, Odile Mella, Irina Illina, Fabrice Lauri, Christophe Cerisara, Christophe Antoine
article: XXIVèmes Journées d’Etude sur la Parole – JEP’02, Jun 2002, Nancy, France, pp.297-301
Accès au bibtex

titre: WSIM : une méthode de détection de thème fondée sur la similarité entre mots
auteur: Armelle Brun, Kamel Smaïli, Jean-Paul Haton
article: Traitement Automatique des Langues Naturelles – TALN’2002, Jun 2002, Nancy, France, 10 p
Accès au bibtex

titre: Comparaison de SMLLR et de SMAP pour une adaptation au locuteur en utilisant des modèles acoustiques markoviens
auteur: Fabrice Lauri, Irina Illina, Dominique Fohr
article: XXIVe Journées d’Etude sur la Parole – JEP’02, Jun 2002, Nancy, France, pp.289-292
Accès au bibtex

titre: Automatic Speech Recognition: the New Millennium
auteur: Khalid Daoudi
article: International Conference on Industrial and Engineering Application of Artificial Intelligence and Expert Systems – IEA/AIE’2002, Jun 2002, Cairns, Australia, pp.253-263
Accès au texte intégral et bibtex

titre: Audio-Indexing : what has been accomplished and the road ahead
auteur: Ivan Magrin-Chagnolleau, Nathalie Vallès-Parlangeau
article: 6th International Joint Conference on Information Sciences (JCIS 2002), Mar 2002, Durham, North Carolina, United States. pp.911-914
Accès au texte intégral et bibtex

titre: Modifying speech to improve the perception of L2
auteur: Vincent Colotte, Yves Laprie, Anne Bonneau
article: Integrating speech technology in learning – INSTIL 2002, Mar 2002, Davis, Ca, USA, 1 p
Accès au bibtex

titre: Dynamic estimation of a noise over estimation factor for Jacobian-based adaptation
auteur: Christophe Cerisara, Jean-Claude Junqua, Luca Rigazio
article: IEEE International Conference on Acoustics, Speech, and Signal Processing – ICASSP 2002, 2002, Orlando, Florida, 4 p
Accès au texte intégral et bibtex

titre: Introduction de contraintes pour l’inversion acoustico-articulatoire utilisant une table hypercubique
auteur: Yves Laprie, Slim Ouni
article: XXIVèmes Journées d’Etude sur la Parole – JEP 2002, 2002, Nancy, France
Accès au texte intégral et bibtex

titre: Higher precision pitch marking for TD-PSOLA
auteur: Vincent Colotte, Yves Laprie
article: XI European Signal Processing Conference- EUSIPCO 2002, 2002, Toulouse, France
Accès au texte intégral et bibtex

titre: Statistical Adaptation of Acoustic Models to Noise Conditions for Robust Speech Recognition
auteur: Angel de La Torre, Dominique Fohr, Jean-Paul Haton
article: International Conference on Spoken Language Processing – ICSLP 2002, 2002, Denver, USA, pp.1437-1440
Accès au bibtex

titre: On The Use of High Order Derivatives for High Performance Alphabet Recognition
auteur: Joseph Di Martino
article: International Conference on Acoustics Speech and Signal Processing – ICASSP 2002, 2002, Orlando, Florida, United States. 4 p
Accès au texte intégral et bibtex

titre: Dynamic Topic Identification : Introduction of Trigger pairs in the Cache Model
auteur: Brigitte Bigi, Salma Jamoussi, Kamel Smaïli
article: International Workshop Speech and Computer 2002 – SPECOM’2002, 2002, St-Petersburg, Russia, 4 p
Accès au bibtex

titre: Amélioration de la précision de la resynthèse avec TD-PSOLA
auteur: Vincent Colotte, Yves Laprie
article: XXIVème Journées d’Etude sur la Parole – JEP 2002, 2002, Nancy, France
Accès au texte intégral et bibtex

titre: Fast Channel and Noise Compensation in the Spectral Domain
auteur: Christophe Cerisara, Dominique Fohr
article: XI European Signal Processing Conference – EUSIPCO 2002, 2002, Toulouse, France, 4 p
Accès au texte intégral et bibtex

titre: Introduction of constraints in an acoustic-to-articulatory inversion
auteur: Yves Laprie, Slim Ouni
article: 7th International Conference on Spoken Language Processing – ICSLP 2002, 2002, Denver, USA
Accès au texte intégral et bibtex

titre: Un Algorithme de Réduction de la Réverbération de Signaux Issus du Vocoder de Phase
auteur: Joseph Di Martino, Yves Laprie
article: XXIVe Journées d’Etude sur la Parole – JEP 2002, 2002, Nancy, France. 4 p
Accès au texte intégral et bibtex

titre: Tree-Structured Maximum a Posteriori Adaptation for a Segment-Based Speech Recognition System
auteur: Irina Illina
article: 7th International Conference on Spoken Language Processing – ICSLP’02, 2002, Denver, Colorado, USA, 4 p
Accès au bibtex

titre: Neural Network and Information Theory In Automatic Speech Understanding
auteur: Salma Jamoussi, Kamel Smaïli, Jean-Paul Haton
article: SPECOM 2002 – International Workshop Speech and Computer, 2002, St-Petersburg, Russia. pp.1-4
Accès au texte intégral et bibtex

Book sections

titre: Méthodes robustes pour la reconnaissance automatique de la parole
auteur: Jean-Paul Haton
article: Pierre Escudier, Jean-Luc Schwartz. La parole, des modèles cognitifs aux machines communicantes, J. Mariani, 2002
Accès au bibtex

Other publications

titre: The WinSnoori user’s manual version 1.32
auteur: Yves Laprie
article: 2002
Accès au texte intégral et bibtex

Reports

titre: Projet RAIVES (Recherche Automatique d’Informations Verbales Et Sonores) vers l’extraction et la structuration de données radiophoniques sur Internet
auteur: Nathalie Vallès-Parlangeau, Ivan Magrin-Chagnolleau, Dominique Fohr, Irina Illina, Odile Mella, Kamel Smaïli, Christine Sénac, Jérôme Farinas, Julien Pinquier, Jean-Luc Rouas, Régine André-Obrecht, François Pellegrino, David Janiszek
article: [Contrat] A02-R-553 || parlangeau-valles02a, IRIT – Institut de recherche en informatique de Toulouse; LORIA (Université de Lorraine, CNRS, INRIA). 2002
Accès au texte intégral et bibtex

titre: Comparaison d’un réseau de neurones artificiel et d’une méthode statistique pour la classification sémantique
auteur: Salma Jamoussi, Kamel Smaïli, Jean-Paul Haton
article: [Interne] A02-R-034 || jamoussi02a, 2002, 10 p
Accès au bibtex

titre: Le prototype SAALSA : Automatisation de la post-synchronisation
auteur: Dominique Fohr, Odile Mella
article: [Contrat] A02-R-487 || fohr02b, 2002, 16 p
Accès au bibtex

titre: Détection de mots clés dans un flux de parole par les modèles de Markov cachés
auteur: Yassine Benayed, Dominique Fohr, Jean-Paul Haton, Gérard Chollet
article: [Interne] A02-R-033 || benayed02e, INRIA. 2002, 10 p
Accès au bibtex

2001

Journal articles

titre: Multi-band automatic speech recognition
auteur: Christophe Cerisara, Dominique Fohr
article: Computer Speech and Language, 2001, 15 (2), pp.151-174
Accès au bibtex

titre: An alternative scheme for perplexity estimation and its assessment for the evaluation of language models
auteur: Frédéric Bimbot, Marc El Bèze, Stéphane Igounet, Michèle Jardino, Kamel Smaïli, Imed Zitouni
article: Computer Speech and Language, 2001, 15 (1), pp.1-13. ⟨10.1006/csla.2000.0150⟩
Accès au bibtex

titre: Modélisation du langage pour les systèmes de reconnaissance de la parole : Application à MAUD
auteur: Imed Zitouni
article: In Cognito – Cahiers Romans de Sciences Cognitives, 2001, 20, pp.43-44
Accès au bibtex

Conference papers

titre: Continuous Multi-Band Speech Recognition using Bayesian Networks
auteur: Khalid Daoudi, Dominique Fohr, Christophe Antoine
article: IEEE Automatic Speech Recognition and Understanding Workshop – ASRU’2001, IEEE, Dec 2001, Trento, Italy, 4 p
Accès au texte intégral et bibtex

titre: A Hierarchical Approach for Topic Identification
auteur: Brigitte Bigi, Armelle Brun, Kamel Smaïli, Jean-Paul Haton
article: Proceedings of the international workshop Speech and Computer – SPECOM’01, Nov 2001, Moscow, Russia, France. 4 p
Accès au texte intégral et bibtex

titre: Structural Learning of Dynamic Bayesian Networks in Speech Recognition
auteur: Murat Deviren, Khalid Daoudi
article: 7th European Conference on Speech Communication and Technolgoy – EUROSPEECH’2001, Sep 2001, Aalborg, Denmark, 4 p
Accès au bibtex

titre: Modeling dependency between regression classes in MLLR using multiscale autoregressive models
auteur: Christophe Cerisara, Khalid Daoudi
article: ISCA Workshop on Adaptation methods for speech recognition, Aug 2001, Sophia-Antipolis, France, 4 p
Accès au bibtex

titre: Implantation d’une carte associative pour l’orientation d’un robot autonome à l’aide d’une image vidéo
auteur: Salma Jamoussi, Frédéric Alexandre
article: Traitement et Analyse d’Images Méthodes et Applications – TAIMA’01, Jun 2001, Hammamet, Tunisie, 6 p
Accès au bibtex

titre: Language-specific knowledge and the perception of tonal contrasts in Italian and English
auteur: Mariapaola D’Imperio
article: 141st Meeting of the Acoustical Society of America, Jun 2001, Chicago, Illinois, USA, pp.2475
Accès au bibtex

titre: A Bayesian network for time-frequency speech modeling and recognition
auteur: Khalid Daoudi, Dominique Fohr, Christophe Antoine
article: International Conference on Artificial Intelligence and Soft Computing, May 2001, Cancun, Mexico, 5 p
Accès au bibtex

titre: Environmental adaptation based on first order approximation
auteur: Christophe Cerisara, Luca Rigazio, Robert Boman, Jean-Claude Junqua
article: International Conference on Acoustics, Speech, and Signal Processing – ICASSP 2001, May 2001, Salt lake City, USA, 4 p
Accès au texte intégral et bibtex

titre: Studying articulatory effects through hypercube sampling of the articulatory space
auteur: Slim Ouni, Yves Laprie
article: 17th International Congress on Acoustics, 2001, Rome, Italy, 2 p
Accès au bibtex

titre: Document Structuring à la SDRT
auteur: Laurence Danlos, Bertrand Gaiffe, Laurent Roussarie
article: Proceedings of the European Workshop on Generation, ACL, Toulouse, 2001, Toulouse, France
Accès au texte intégral et bibtex

titre: Language-specific knowledge and syllable structure effects in the perception of tonal contrast
auteur: Mariapaola D’Imperio
article: Invited Talk, 2001, Max Plank Institut for Psycholinguistics
Accès au bibtex

titre: Improving Statistical Language Models by Removing Impossible Events
auteur: Armelle Brun, David Langlois, Kamel Smaïli, Jean-Paul Haton
article: Proceedings of the International Workshop “Speech and Computer” – SPECOM 2001, 2001, Moscow, Russia, 4 p
Accès au bibtex

titre: A comparison of different methods for noise adaptation in a HMM-based speech recognition system
auteur: Christophe Cerisara, Dominique Fohr, Irina Illina, Fabrice Lauri, Odile Mella
article: International Congress on Acoustics, 2001, Italy, Rome, 2 p
Accès au bibtex

titre: Efficient Language Models Combination: Application to Phrase Finding
auteur: David Langlois, Kamel Smaïli, Jean-Paul Haton
article: Proceedings of the International Workshop “Speech and Computer” – SPECOM 2001, 2001, Moscow, Russia, 4 p
Accès au bibtex

titre: A New Method Based on Context for Combining Statistical Language Models
auteur: David Langlois, Kamel Smaïli, Jean-Paul Haton
article: Third International Conference on Modeling and Using Context – CONTEXT 01, 2001, Dundee, Scotland, pp.235-247
Accès au bibtex

titre: Statistical Language Model based on a Hierarchical Approach : MCnv
auteur: Imed Zitouni, Kamel Smaïli, Jean-Paul Haton
article: 7th european conference on speech communication and technology – EUROSPEECH 2001, 2001, Aalborg, Denmark, pp.29
Accès au bibtex

titre: Dynamic Topic Identification: Towards Combination of Methods
auteur: Brigitte Bigi, Armelle Brun, Jean-Paul Haton, Kamel Smaïli, Imed Zitouni
article: Recent Advances in Natural Language Processing – RANLP’2001, Galia Angelova, Kalima Bontcheva, Ruslan Mitkov, Nicolas Nicolov, Nikolai Nikolov, 2001, Tzigov Chark, Bulgaria, pp.255-257
Accès au bibtex

titre: The Role of Perception in Defining Tonal Targets and their Alignment
auteur: Mariapaola D’Imperio
article: Invited Talk, Martine Grice, 2001, Department of Linguistics, University of Saarbruecken, Germany
Accès au bibtex

titre: Burst segmentation and evaluation of acoustic cues
auteur: Yves Laprie, Anne Bonneau
article: 7th European Conference on Speech Communication and Technology – EUROSPEECH’2001, 2001, Aalborg, Danemark, 4 p
Accès au bibtex

titre: Exploring the Null Space of the Acoustic-to-Articulatory Inversion Using a Hypercube Codebook
auteur: Slim Ouni, Yves Laprie
article: 7th European Conference on Speech Communication and Technology – EUROSPEECH’2001, 2001, Aalborg, Denmark, pp.277-280
Accès au bibtex

titre: Perceptual experiments on enhanced and slowed down speech sentences for second language acquisition
auteur: Vincent Colotte, Yves Laprie, Anne Bonneau
article: European Conference on Speech Communication and Technology, 2001, Aalborg, Denmark, 4 p
Accès au bibtex

titre: Signal transformation strategies to improve speech intelligibility for second language acquisition
auteur: Vincent Colotte, Yves Laprie, Anne Bonneau
article: 17th International Congress on Acoustics, 2001, Rome, Italy, 2 p
Accès au bibtex

titre: A comparative study of Topic Identification on Newspaper and E-mail
auteur: Brigitte Bigi, Armelle Brun, Jean-Paul Haton, Kamel Smaïli, Imed Zitouni
article: Proceedings of the 8th International Symposium on String Processing and Information Retrieval – SPIRE’01, 2001, Laguna de San Rafael, Chili, pp.238-241
Accès au texte intégral et bibtex

titre: Tonal alignment, scaling and slope in Italian question and statement tunes
auteur: Mariapaola D’Imperio
article: 7th European Conference on Speech Communication and Technology – EUROSPEECH ’01, 2001, Aalborg, Denmark, 4 p
Accès au bibtex

titre: Structural Maximum a Posteriori Adaptation for Mixture Stochastic Trajectory Framework
auteur: Irina Illina, Djamel Mostefa
article: WorkShop International on Adaptation Methods for Automatic Speech Recognition, Eurecom, 2001, Sophia Antipolis, France, 4 p
Accès au bibtex

titre: Environment-adaptive algorithms for robust speech recognition
auteur: Jean-Claude Junqua, Christophe Cerisara, Luca Rigazio, David Kryze
article: International Workshop on Hands-Free Speech Communication – HSC 2001, 2001, Kyoto, Japan. pp.4
Accès au bibtex

titre: Word recognition for all: application to speech training
auteur: Marie-Christine Haton, Jean-Paul Haton
article: Universal Access in Human-Computer Interaction, 2001, New-Orleans, Louisiana, USA, pp.329-333
Accès au bibtex

titre: On the comparison of front-ends for robust speech recognition in car environments
auteur: Angel de La Torre, Dominique Fohr, Jean-Paul Haton
article: ISCA ITR Workshop: adaptation methods for speech recognition, 2001, Sophia-Antipolis France, pp.105-108
Accès au bibtex

titre: Adaptation MLLR pour des HMMs
auteur: Fabrice Lauri, Irina Illina, Dominique Fohr
article: Quatrièmes Rencontres Jeunes Chercheurs en Parole – RJC’2001, 2001, Mons, Belgique, pp.90-93
Accès au texte intégral et bibtex

titre: Suppression of Phasiness for Time-Scale Modifications of Speech Signals Based on a Shape Invariance Property
auteur: Joseph Di Martino, Yves Laprie
article: International Conference on Acoustics, Speech, and Signal Processing – ICASSP 2001, IEEE, 2001, Salt Lake City, United States. pp.853-856
Accès au bibtex

titre: A face-to-muscle inversion of a biomechanical face model for audiovisual and motor control research
auteur: Michel Pitermann, Kevin G. Munhall
article: 7th European Conference on Speech Communication and Technology – EUROSPEECH’2001, ISCA, 2001, Aalborg, Denmark, 4 p
Accès au bibtex

Book sections

titre: Towards a Strategy for ToBI labelling varieties of Italian
auteur: Martine Grice, Mariapaola D’Imperio, Michelina Savino, Cinzia Avesani
article: Jun, Sun-Ah. Prosodic Typology and Transcription: A Unified Approach, Oxford University Press, 23 p, 2001
Accès au bibtex

2000

Journal articles

titre: Identification of vocalic features from French stop bursts
auteur: Anne Bonneau
article: Journal of Phonetics, 2000, 28, pp.495-502
Accès au bibtex

Conference papers

titre: Discarding Impossible Events from Statistical Language Models
auteur: Armelle Brun, David Langlois, Kamel Smaïli, Jean-Paul Haton
article: International Conference on Spoken Language Processing, Oct 2000, Pékin, China, 4 p
Accès au texte intégral et bibtex

titre: A new approach for multi-band speech recognition based on probabilistic graphical models
auteur: Khalid Daoudi, Dominique Fohr, Christophe Antoine
article: ICSLP, Oct 2000, Beijing, China, 4 p
Accès au texte intégral et bibtex

titre: Variable-Length Class Sequences Based on a Hierarchical Approach: MCnv
auteur: Imed Zitouni, Kamel Smaïli, Jean-Paul Haton
article: SCI 2000 – 4th Word Multiconference on Systemics, Cybertinics & Informatics, Jul 2000, Orlando, United States. pp.6
Accès au texte intégral et bibtex

titre: A tool for the synchronization of speech and mouth shapes: LIPS
auteur: Odile Mella, Dominique Fohr, Laurent Martin, Andreas Carlen
article: Sixth International Conference on Spoken Language Processing – ICSLP 2000, 2000, Beijing/China, 4 p
Accès au bibtex

titre: Un diagnostic phonétique pour les déficiences auditives
auteur: Anne Bonneau, Parham Mokhtari
article: Journées d’étude sur la parole, Institut de la Communication Parlée (I.C.P.), 2000, Aussois, Savoie, France, 4 p
Accès au bibtex

titre: Topic Identification Challenge Based on Short Word History
auteur: Armelle Brun, Kamel Smaïli, Jean-Paul Haton
article: Traitement Automatique du Langage Naturel – TALN’00, 2000, Lausanne, Suisse, pp.383-392
Accès au bibtex

titre: Compensation of Noise Effects for Robust Speech Recognition in Car Environments
auteur: Angel de La Torre, Dominique Fohr, Jean-Paul Haton
article: ICSLP’2000, 2000, Beijing, China, 4 p
Accès au bibtex

titre: Modification sélective du débit de parole
auteur: Vincent Colotte, Yves Laprie
article: Reconnaissance des Formes et Intelligence Artificielle – RFIA’2000, 2000, Paris, France, pp.141-148
Accès au bibtex

titre: Detecting relevant acoustic events for piloting improvement of intelligibility
auteur: Vincent Colotte, Yves Laprie
article: European Signal Processing Conference, 2000, Tampere, Finlande, 4 p
Accès au texte intégral et bibtex

titre: Improving acoustic-to-articulatory inversion by using hypercube codebooks
auteur: Slim Ouni, Yves Laprie
article: International Conference on Spoken Language Processing – ICSLP2000, 2000, Beijing, Chine, pp.178-181
Accès au texte intégral et bibtex

titre: Experiment Analysis in Newspaper Topic Detection
auteur: Armelle Brun, Kamel Smaïli, Jean-Paul Haton
article: SPIRE 2000 – String Processing & Information Retrieval, 2000, A Coruna, Spain. pp.55 – 64
Accès au texte intégral et bibtex

titre: Vers une meilleure modélisation du langage : la prise en compte des séquences dans les modèles statistiques
auteur: Imed Zitouni, Kamel Smaïli
article: XXIIIèmes Journées d’Etude sur la Parole – JEP’2000, 2000, Aussois, France, 4 p
Accès au texte intégral et bibtex

titre: Dealing with distant relationships in natural language modelling for automatic speech recognition
auteur: David Langlois, Kamel Smaïli, Jean-Paul Haton
article: 4th World Multiconference on Systemics, Cybernetics & Informatics – SCI’2000, International Institute of Informatics & Systemics, 2000, Orlando, USA, pp.400-405
Accès au texte intégral et bibtex

titre: Amélioration automatique de l’intelligibilité de la parole
auteur: Vincent Colotte, Yves Laprie
article: Journées d’Etudes de la Parole, Institut de la Communication Parlée, 2000, Aussois, France, pp.105-108
Accès au texte intégral et bibtex

titre: Utilisation d’un dictionnaire hypercubique pour l’inversion acoustico-articulatoire
auteur: Slim Ouni, Yves Laprie
article: 23èmes Journées d’Etudes sur la Parole, 2000, Aussois, France, pp.409 – 412
Accès au texte intégral et bibtex

titre: The automatic speech recognition engine ESPERE : experiments on telephone speech
auteur: Dominique Fohr, Odile Mella, Christophe Antoine
article: ICSLP, 2000, Pékin, China, 4 p
Accès au bibtex

titre: Towards phonetic tools for speech training
auteur: Anne Bonneau, Yves Laprie, Vincent Colotte
article: Integrating Speech Technology In (language) Learning – InSTIL2000, 2000, Dundee, Scotland, 4 p
Accès au bibtex

titre: Automatic enhancement of speech intelligibility
auteur: Vincent Colotte, Yves Laprie
article: IEEE International Conference on Acoustics, Speech, & Signal Processing – ICASSP’2000, 2000, Istanbul, Turkey
Accès au bibtex

titre: Perceived tone “targets” and pitch accent identification in Italian
auteur: Mariapaola D’Imperio, Jacques Terken, Michel Pitermann
article: 8th International Conference on Speech Science & Technology – SST’2000, 2000, Canberra, Australia, pp.206-211
Accès au bibtex

titre: Transformation of Jacobian matrices for noisy speech recognition
auteur: Christophe Cerisara, Luca Rigazio, Robert Boman, Jean-Claude Junqua
article: ICSLP’2000, 2000, none, 4 p
Accès au bibtex

titre: Asynchrony in Multi-Band Speech Recognition
auteur: Christophe Cerisara, Dominique Fohr, Jean-Paul Haton
article: IEEE International Conference on Acoustics, Speech, & Signal Processing – ICASSP’2000, 2000, Istanbul, Turkey, 4 p
Accès au texte intégral et bibtex

titre: Beyond the Conventional Statistical Language Models: The Variable-Length Sequences Approach
auteur: Imed Zitouni, Kamel Smaïli, Jean-Paul Haton
article: International Conference on Speech Language Processing, 2000, Pékin, China. pp.4
Accès au texte intégral et bibtex

Book sections

titre: MAUD : Un prototype de machine à dicter vocale
auteur: Dominique Fohr, Jean-Paul Haton, Jean-François Mari, Kamel Smaïli, Imed Zitouni
article: none. Ressources et évaluation en ingénierie des langues, De Boeck, pp.315-328, 2000, universités francophones
Accès au texte intégral et bibtex

Reports

titre: Report about the user experience with ISAEUS
auteur: Marie-Christine Haton
article: [Contract] A00-R-242 || haton00a, 2000, 75 p
Accès au bibtex

titre: Exploitation plan of the French ISAEUS system
auteur: Marie-Christine Haton
article: [Contract] A00-R-243 || haton00b, 2000, 17 p
Accès au bibtex

titre: Le système ISAEUS, manuel utilisateur
auteur: Marie-Christine Haton, Jean-Paul Haton
article: [Contrat] A00-R-244 || haton00c, 2000, 55 p
Accès au bibtex

1999

Journal articles

titre: A Minimum Cross-Entropy Approach to Hidden Markov Model Adaptation
auteur: Mohamed Afify, Yifan Gong, Jean-Paul Haton
article: IEEE Signal Processing Letters, 1999, 6 (6), pp.132-134
Accès au bibtex

Conference papers

titre: Towards a Better Collaboration Between a n-class and a n-gram Language Model
auteur: Kamel Smaïli, Imed Zitouni, Jean-Paul Haton
article: SPECOM, Oct 1999, Moscow, Russia
Accès au bibtex

titre: Improvement of Multi-Band Speech Recognition
auteur: Jean-Paul Haton, Christophe Cerisara, Dominique Fohr
article: International Workshop Speech & Computer – SPECOM’99, Oct 1999, Moscow, Russia
Accès au bibtex

titre: Use of articulatory and spectral information for speech training
auteur: Marie-Christine Haton, Jean-Paul Haton
article: The XIVth International Congress of Phonetic Sciences, Aug 1999, San Francisco, USA, 5 p
Accès au bibtex

titre: Snorri, a software for speech sciences
auteur: Yves Laprie
article: ESCA/SOCRATES Workshop on Method & Tool Innovations for Speech Science Education MATISSE, Apr 1999, London, UK, pp.89-92
Accès au texte intégral et bibtex

titre: Hypertext atlas of speech sounds
auteur: Anne Bonneau, Yves Laprie, Jacqueline Vaissière
article: Method & Tool Innovations for Speech Science Education, Worshop of the European Speech Communication Association, Apr 1999, none, pp.65-68
Accès au bibtex

titre: Représentation des connaissances à l’aide d’une qualification floue et de frames dans un environnement manipulant des normes
auteur: Virginie Govaere
article: IIIeme Colloque Jeunes Chercheurs en Sciences Cognitives, Apr 1999, Soulac, France, pp.110-115
Accès au texte intégral et bibtex

titre: A phonetically-guided diagnosis of auditory deficiency based on synthetic speech stimuli
auteur: Anne Bonneau, Parham Mokhtari
article: 6th European Conference on Speech Communication & Technology – EUROSPEECH’99, Technical University of Budapest & The Scientific Society for Telecommunications, 1999, Budapest, Hungary, pp.559-562
Accès au texte intégral et bibtex

titre: Design of hypercube codebooks for the acoustic-to-articulatory inversion respecting the non-linearities of the articulatory-to-acoustic mapping
auteur: Slim Ouni, Yves Laprie
article: 6th European Conference on Speech Communication & Technology – EUROSPEECH’99, 1999, Budapest, Hungary, pp.141-144
Accès au bibtex

titre: A New Based Distance Language Model for a Dictation Machine: application to MAUD
auteur: David Langlois, Kamel Smaïli
article: 6th European Conference on Speech Communication & Technology – EUROSPEECH’99, 1999, Budapest, Hungary, pp.1779-1782
Accès au bibtex

titre: Towards a Global Optimization Scheme for Multi-Band Speech Recognition
auteur: Christophe Cerisara, Jean-Paul Haton, Dominique Fohr
article: 6th European Conference on Speech Communication & Technology – EUROSPEECH’99, 1999, Budapest, Hungary, 4 p
Accès au texte intégral et bibtex

titre: Robust behavior of multi-band paradigm
auteur: Christophe Cerisara, Dominique Fohr, Jean-Paul Haton
article: Robust Methods for Speech Recognition in Adverse Conditions, Nokia, COST249 & IEEE, 1999, Tampere, Finland, 4 p
Accès au texte intégral et bibtex

titre: l’intelligence artificielle et ses applications
auteur: Jean-Paul Haton
article: Congrès de la société française d’anesthésie-réanimation – SFIMAR’99, 1999, Nantes, France
Accès au bibtex

titre: A Combination of Representation Styles for the Acquirement of Speech Abilities
auteur: Virginie Govaere
article: Artificial Intelligence in Education, S.P. Lajoie & M.Vivet, 1999, none, pp.371-378
Accès au texte intégral et bibtex

titre: An Efficient F0 Determination Algorithm Based on the Implicit Calculation of the Autocorrelation of the Temporal Excitation Signal
auteur: Joseph Di Martino, Yves Laprie
article: 6th European Conference on Speech Communication & Technology – EUROSPEECH’99, 1999, Budapest, Hungary. 4 p
Accès au texte intégral et bibtex

titre: Speech training for deaf and hearing-impaired people
auteur: Ramon Garcia Gomez, Marie-Christine Haton, Jean-Paul Haton, Christophe Antoine, Pierre Alinat
article: European Speech Communication Association, 1999, Budapest, Hongrie, 5 p
Accès au bibtex

titre: Statistical Models for Robust Speech Recognition
auteur: Jean-Paul Haton
article: International Symposium on Pattern Recognition, 1999, Bruxelles, Belgique
Accès au bibtex

titre: Evaluation of a Segmentation System based on Multi-Level Lattices
auteur: Jean-Luc Husson
article: 6th European Conference on Speech Communication & Technology – EUROSPEECH’99, 1999, Budapest, Hungary, 4 p
Accès au bibtex

titre: Variable-Length Sequence Language Model for Large Vocabulary Continuous Dictation Machine
auteur: Imed Zitouni, Jean-François Mari, Kamel Smaïli, Jean-Paul Haton
article: 6th European Conference on Speech Communication and Technology – EUROSPEECH’99, 1999, Budapest, Hungary
Accès au texte intégral et bibtex

titre: Automatic and manual clustering for large vocabulary speech recognition: a comparative study
auteur: Kamel Smaïli, Armelle Brun, Imed Zitouni, Jean-Paul Haton
article: 6th European Conference on Speech Communication & Technology – EUROSPEECH’99, 1999, Budapest, Hungary, 4 p
Accès au bibtex

titre: Physique et intelligence artificielle
auteur: Jean-Paul Haton
article: Congrès de la Société Française de Physique, 1999, Clermont-Ferrand, France
Accès au bibtex

Book sections

titre: Dealing With Loss of Synchronism in Multi-Band Continuous Speech Recognition Systems
auteur: Christophe Cerisara
article: none. Computational Models of Speech Pattern Processing, 14 p, 1999
Accès au bibtex

1998

Conference papers

titre: Variable-length class sequences based on a hierarchical approach: MCnv
auteur: Imed Zitouni, Kamel Smaïli, Jean-Paul Haton
article: SPECOM 1998 – 3rd International Workshop on Speech and Computer, Oct 1998, Saint-Petersbourg, Russia
Accès au bibtex

titre: A first evaluation campaign for language models
auteur: M Jardino, F Bimbot, S Igounet, Kamel Smaïli, I Zitouni, Marc El Bèze
article: First international conference on language resources and evaluation, May 1998, Grenade, Spain
Accès au texte intégral et bibtex

titre: Cooperation of frequency and time-domain methods for pitch tracking
auteur: Jean-Luc Husson, Yves Laprie
article: SPECOM’98, 1998, St-Peterburg, Russie, pp.293-298
Accès au bibtex

titre: A COMPARATIVE STUDY BETWEEN POLYCLASS AND MULTICLASS LANGUAGE MODELS
auteur: I Zitouni, K Smaïli, S Deligne, F Bimbot
article: Proceedings of the Fifth International Conference on Spoken Language Processing, 1998, Sydney, Australia
Accès au texte intégral et bibtex

Reports

titre: Une méthode d’inversion acoustico-articulatoire
auteur: Slim Ouni
article: [Stage] 98-R-407 || ouni98a, 1998, 50 p
Accès au bibtex

1997

Conference papers

titre: An Hybrid Language Model for a Continuous Dictation Prototype
auteur: Kamel Smaïli, Imed Zitouni, François Charpillet, Jean-Paul Haton
article: 5th European Conference on Speech Communication and Technology, Sep 1997, Rhodes, Greece
Accès au bibtex

titre: Speech synthesis using phase vocoder techniques
auteur: Joseph Di Martino
article: EUROSPEECH – Fifth European Conference on Speech Communication and Technology – 1997, Sep 1997, Rhodes, Greece
Accès au bibtex

titre: Towards an oral interface for data entry: The MAUD System
auteur: Dominique Fohr, J.-P Haton, Jean-François Mari, Kamel Smaïli, Imed Zitouni
article: European Research Consortium for Informatics and Mathematics User Interfaces for AII, 1997, Nancy, France
Accès au texte intégral et bibtex

Theses

titre: Traitement automatique de la parole en milieu bruité : étude de modèles connexionnistes statiques et dynamiques
auteur: Laurent Buniet
article: Interface homme-machine [cs.HC]. Université Henri Poincaré – Nancy 1, 1997. Français. ⟨NNT : ⟩
Accès au texte intégral et bibtex

1996

Conference papers

titre: A new algorithm for Automatic Word Classification based on an Improved Simulated Annealing Technique
auteur: Kamel Smaïli, François Charpillet, Jean-Paul Haton
article: The 5th International Conference on the Cognitive Science of Natural Language Processing, 1996, Dublin, Ireland
Accès au texte intégral et bibtex

1994

Conference papers

titre: Extraction of formants of oral vowels and critical analysis for speaker characterization
auteur: Odile Mella
article: ESCA Workshop on Automatic Speaker Recognition, Identification and Verification, Apr 1994, Martigny, Switzerland. pp.193-196
Accès au texte intégral et bibtex

titre: Which model for future speech recognition systems: Hidden Markov models for finite-state automata?
auteur: Joseph Di Martino, Jean-François Mari, B. Mathieu, K. Perot, Kamel Smaïli
article: IEEE International Conference on Acoustics, Speech, and Signal Processing – ICASSP-94, 1994, Adelaïd, Australia. pp.633-635
Accès au bibtex

1993

Conference papers

titre: A Level-Building Top-Down Parsing Algorithm for Context-Free Grammars in Continuous Speech Recognition
auteur: François Charpillet, Joseph Di Martino
article: EUROSPEECH – Third European Conference on Speech Communication and Technology – 1993, Sep 1993, Berlin, Germany. pp.1947-1949
Accès au bibtex

titre: Integration of phonological knowledge in a continuous speech recognition system
auteur: Roselyne Nguyen, Kamel Smaïli, Jean-Paul Haton, Guy Pérennou
article: European conference on speech communication and technology, Sep 1993, Berlin, Germany. pp.2191-2194
Accès au bibtex

titre: MAUD : Une interface vocale pour la saisie de textes lus
auteur: Kamel Smaïli, François Charpillet, Jean-Paul Haton
article: 2nd International conference Interface to real and virtual worlds, Mar 1993, Montpellier, France. pp.311-318
Accès au bibtex

Theses

titre: Contribution to the automatic identification of the speaker on acoustic and phonetic criteria
auteur: Odile Mella
article: Informatique et langage [cs.CL]. Université de Nancy I, 1993. Français. ⟨NNT : 1993NAN10411⟩
Accès au texte intégral et bibtex

1992

Conference papers

titre: Pertinence des trois premiers formants des voyelles orales dans la caractérisation du locuteur
auteur: Odile Mella
article: JEP 1992 – 19e Journées d’Etude sur la Parole, May 1992, Bruxelles, Belgique. pp.1-6
Accès au texte intégral et bibtex

titre: La composante lexicale de la machine à dicter MAUD,
auteur: Kamel Smaïli, François Charpillet, Jean-Marie Pierrel, Jean-Paul Haton
article: Séminaire Lexique, Jan 1992, Toulouse, France. pp.71-82
Accès au texte intégral et bibtex

1991

Conference papers

titre: A continuous speech recognition approach for the design of a dictation machine
auteur: Kamel Smaïli, François Charpillet, Jean-Marie Pierrel, Jean-Paul Haton
article: European Conference on Speech Technology, 1991, Genova, Italy. pp.953-956
Accès au texte intégral et bibtex

1990

Journal articles

titre: Statistical methods in multi-speaker automatic speech recognition
auteur: Anne Boyer, Joseph Di Martino, P. Divoux, Jean-Paul Haton, Jean-François Mari, Kamel Smaïli
article: Applied Stochastic Models and Data Analysis, 1990, 6 (3), pp.143-155. ⟨10.1002/asm.3150060302⟩
Accès au bibtex

Conference papers

titre: Idées et concepts de réalisation d’une machine à dicter destinée aux grands vocabulaires
auteur: Kamel Smaïli, François Charpillet, Jean-Marie Pierrel, Jean-Paul Haton
article: XVIIIèmes journées d’études sur la parole, May 1990, Montréal, Canada. pp.337-341
Accès au bibtex

1988

Conference papers

titre: Statistical methods in multi-speaker automatic speech recognition
auteur: Anne Boyer, Joseph Di Martino, P. Divoux, Jean-Paul Haton, Jean-Francois Mari, Kamel Smaïli
article: ASMDA – 4th International Symposium on Applied stochastic models and data analysis – 1988, 1988, Nancy, France
Accès au bibtex

1987

Conference papers

titre: On multi-level machines for continuous speech recognition
auteur: Joseph Di Martino
article: IJCAI – Tenth International Joint Conference on Artificial Intelligent – 1987, Aug 1987, Milan, Italy. pp.836-839
Accès au bibtex

titre: Dynamic time warping and vector quantization in isolated and connected word recognition
auteur: Anne Boyer, Jean-Paul Haton, Joseph Di Martino
article: ECST – European Conference on Speech Technology – 1987, 1987, Edinburgh, Scotland, United Kingdom. pp.2436-2439
Accès au bibtex

1986

Conference papers

titre: Reconnaissance de la parole continue par programmation dynamique
auteur: Joseph Di Martino
article: JEP – Actes 15èmes Journées d’Etudes sur la Parole – 1986, May 1986, Aix-en-Provence, France
Accès au bibtex

titre: Reconnaissance de la parole multi-locuteur par programmation dynamique
auteur: Anne Boyer, Joseph Di Martino, Jean-Paul Haton
article: JEP – Actes 15èmes Journées d’Etudes sur la Parole – 1986, May 1986, Aix-en-Provence, France
Accès au bibtex

1985

Conference papers

titre: Un algorithme de reconnaissance de mots enchaînés avec contraintes syntaxiques
auteur: Anne Boyer, Joseph Di Martino, Jean-Paul Haton
article: JEP – Actes 14èmes Journées d’Etudes sur la Parole – 1985, Jun 1985, Paris, France
Accès au bibtex

Book sections

titre: Dynamic Time Warping Algorithms for Isolated and Connected Word Recognition
auteur: Joseph Di Martino
article: Renato De Mori and Ching Y. Suen. New Systems and Architectures for Automatic Speech Recognition and Synthesis, 16, Springer Verlag, pp.405-418, 1985, NATO ASI Series, 978-3-642-824494. ⟨10.1007/978-3-642-82447-0_15⟩
Accès au bibtex