Mostafa Sadeghi – RobotLearn

Since August 2018 I have been a postdoctoral researcher in the PERCEPTION team at Inria Grenoble Rhône-Alpes.

My research interests lie at the intersection of machine learning and signal processing. In particular, I am interested in deploying unsupervised probabilistic generative models, e.g., variational autoencoder (VAE), to solve inverse problems. One such problem that I am currently working on is that of audio-visual speech enhancement using VAEs.

You can visit my web page at this address.

Contact

INRIA Grenoble Rhone-Alpes
655, avenue de l’Europe
38330 Montbonnot Saint-Martin
France
Email: mostafa dot sadeghi at inria dot fr

Publications

Publications HAL de Mostafa Sadeghi

2025

Journal articles

titre: Objective and subjective evaluation of speech enhancement methods in the UDASE task of the 7th CHiME challenge
auteur: Simon Leglaive, Matthieu Fraticelli, Hend ElGhazaly, Léonie Borne, Mostafa Sadeghi, Scott Wisdom, Manuel Pariente, John R. Hershey, Daniel Pressnitzer, Jon P. Barker
article: Computer Speech and Language, 2025, 89, ⟨10.1016/j.csl.2024.101685⟩
Accès au texte intégral et bibtex

titre: Posterior Transition Modeling for Unsupervised Diffusion-Based Speech Enhancement
auteur: Mostafa Sadeghi, Jean-Eudes Ayilo, Romain Serizel, Xavier Alameda-Pineda
article: IEEE Signal Processing Letters, In press, pp.1-5. ⟨10.1109/LSP.2025.3583967⟩
Accès au texte intégral et bibtex

Conference papers

titre: Data-independent Beamforming for End-to-end Multichannel Multi-speaker ASR
auteur: Can Cui, Paul Magron, Mostafa Sadeghi, Emmanuel Vincent
article: IEEE 27th International Workshop on Multimedia Signal Processing (MMSP 2025), IEEE, Sep 2025, Pékin, China
Accès au texte intégral et bibtex

titre: Towards Skeletal and Signer Noise Reduction in Sign Language Production via Quaternion-Based Pose Encoding and Contrastive Learning
auteur: Guilhem Fauré, Mostafa Sadeghi, Sam Bigeard, Slim Ouni
article: SLTAT 2025: 9th Workshop on Sign Language Translation and Avatar Technologies, Sep 2025, Berlin, Germany. ⟨10.1145/3742886.3756728⟩
Accès au texte intégral et bibtex

titre: End-to-end Joint Punctuated and Normalized ASR with a Limited Amount of Punctuated Training Data
auteur: Can Cui, Imran Ahamad Sheikh, Mostafa Sadeghi, Emmanuel Vincent
article: European Signal Processing Conference (EUSIPCO 2025), Sep 2025, Palermo, Italy
Accès au texte intégral et bibtex

titre: Joint Beamforming and Speaker-Attributed ASR for Real Distant-Microphone Meeting Transcription
auteur: Can Cui, Imran Ahamad Sheikh, Mostafa Sadeghi, Emmanuel Vincent
article: European Signal Processing Conference (EUSIPCO 2025), Sep 2025, Palermo, Italy
Accès au texte intégral et bibtex

titre: Diffusion-based Unsupervised Audio-visual Speech Enhancement
auteur: Jean-Eudes Ayilo, Mostafa Sadeghi, Romain Serizel, Xavier Alameda-Pineda
article: ICASSP 2025 – International Conference on Acoustics Speech and Signal Processing, IEEE, Apr 2025, Hyderabad, India. pp.1-5
Accès au texte intégral et bibtex

2024

Journal articles

titre: Unsupervised Performance Analysis of 3D Face Alignment with a Statistically Robust Confidence Test
auteur: Mostafa Sadeghi, Xavier Alameda-Pineda, Radu Horaud
article: Neurocomputing, 2024, 564, pp.1-16. ⟨10.1016/j.neucom.2023.126941⟩
Accès au texte intégral et bibtex

Conference papers

titre: Improving Speaker Assignment in Speaker-Attributed ASR for Real Meeting Applications
auteur: Can Cui, Imran Ahamad Sheikh, Mostafa Sadeghi, Emmanuel Vincent
article: The Speaker and Language Recognition Workshop Odyssey 2024, Jun 2024, Quebec, Canada
Accès au texte intégral et bibtex

titre: Diffusion-based speech enhancement with a weighted generative-supervised learning loss
auteur: Jean-Eudes Ayilo, Mostafa Sadeghi, Romain Serizel
article: International Conference on Acoustics Speech and Signal Processing (ICASSP), IEEE, Apr 2024, Seoul (Korea), South Korea. ⟨10.48550/arXiv.2309.10457⟩
Accès au texte intégral et bibtex

titre: A weighted-variance variational autoencoder model for speech enhancement
auteur: Ali Golmakani, Mostafa Sadeghi, Xavier Alameda-Pineda, Romain Serizel
article: ICASSP 2024 – International Conference on Acoustics Speech and Signal Processing, IEEE, Apr 2024, Seoul (Korea), South Korea. pp.1-5, ⟨10.1109/ICASSP48485.2024.10446294⟩
Accès au texte intégral et bibtex

titre: Unsupervised speech enhancement with diffusion-based generative models
auteur: Berné Nortier, Mostafa Sadeghi, Romain Serizel
article: International Conference on Acoustics Speech and Signal Processing (ICASSP), IEEE, Apr 2024, Seoul (Korea), South Korea. ⟨10.48550/arXiv.2309.10450⟩
Accès au texte intégral et bibtex

titre: Posterior sampling algorithms for unsupervised speech enhancement with recurrent variational autoencoder
auteur: Mostafa Sadeghi, Romain Serizel
article: International Conference on Acoustics Speech and Signal Processing (ICASSP), IEEE, Apr 2024, Seoul (Korea), South Korea. ⟨10.48550/arXiv.2309.10439⟩
Accès au texte intégral et bibtex

2023

Journal articles

titre: Expression-preserving face frontalization improves visually assisted speech processing
auteur: Zhiqi Kang, Mostafa Sadeghi, Radu Horaud, Xavier Alameda-Pineda
article: International Journal of Computer Vision, 2023, 131 (5), pp.1122-1140. ⟨10.1007/s11263-022-01742-1⟩
Accès au texte intégral et bibtex

Conference papers

titre: End-to-end Multichannel Speaker-Attributed ASR: Speaker Guided Decoder and Input Feature Analysis
auteur: Can Cui, Imran Ahamad Sheikh, Mostafa Sadeghi, Emmanuel Vincent
article: 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2023), Dec 2023, Taipei, Taiwan. ⟨10.1109/ASRU57964.2023.10389729⟩
Accès au texte intégral et bibtex

titre: The CHiME-7 UDASE task: Unsupervised domain adaptation for conversational speech enhancement
auteur: Simon Leglaive, Léonie Borne, Efthymios Tzinis, Mostafa Sadeghi, Matthieu Fraticelli, Scott Wisdom, Manuel Pariente, Daniel Pressnitzer, John R. Hershey
article: 7th International Workshop on Speech Processing in Everyday Environments (CHiME), Aug 2023, Dublin, Ireland. ⟨10.21437/CHiME.2023-2⟩
Accès au texte intégral et bibtex

titre: Audio-visual speech enhancement with a deep kalman filter generative model
auteur: Ali Golmakani, Mostafa Sadeghi, Romain Serizel
article: International Conference on Acoustics Speech and Signal Processing (ICASSP), IEEE, Jun 2023, Rhodes island, Greece
Accès au texte intégral et bibtex

titre: Fast and efficient speech enhancement with variational autoencoders
auteur: Mostafa Sadeghi, Romain Serizel
article: International Conference on Acoustics Speech and Signal Processing (ICASSP), IEEE, Jun 2023, Rhodes island, Greece
Accès au texte intégral et bibtex

Poster communications

titre: End-to-end Multichannel Speaker-Attributed ASR: Speaker Guided Decoder and Input Feature Analysis
auteur: Can Cui, Imran Ahamad Sheikh, Mostafa Sadeghi, Emmanuel Vincent
article: Rencontre des Jeunes Chercheurs en Parole 2023 – 10E Edition, Nov 2023, Grenoble, France
Accès au texte intégral et bibtex

Preprints, Working Papers, …

titre: The CHiME-7 UDASE task: Unsupervised domain adaptation for conversational speech enhancement
auteur: Simon Leglaive, Léonie Borne, Efthymios Tzinis, Mostafa Sadeghi, Matthieu Fraticelli, Scott Wisdom, Manuel Pariente, Daniel Pressnitzer, John Hershey
article: 2023
Accès au texte intégral et bibtex

2022

Journal articles

titre: Non-Smooth Regularization: Improvement to Learning Framework through Extrapolation
auteur: Sajjad Amini, Mohammad Soltanian, Mostafa Sadeghi, Shahrokh Ghaemmaghami
article: IEEE Transactions on Signal Processing, 2022, 70, pp.1213 – 1223. ⟨10.1109/TSP.2022.3154969⟩
Accès au texte intégral et bibtex

Conference papers

titre: A Sparsity-promoting Dictionary Model for Variational Autoencoders
auteur: Mostafa Sadeghi, Paul Magron
article: INTERSPEECH 2022, Sep 2022, Incheon, South Korea
Accès au texte intégral et bibtex

titre: The Impact of Removing Head Movements on Audio-visual Speech Enhancement
auteur: Zhiqi Kang, Mostafa Sadeghi, Radu Horaud, Xavier Alameda-Pineda, Jacob Donley, Anurag Kumar
article: ICASSP 2022 – IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE Signal Processing Society, May 2022, Singapore, Singapore. pp.1-5, ⟨10.1109/ICASSP43922.2022.9746401⟩
Accès au texte intégral et bibtex

2021

Journal articles

titre: Mixture of Inference Networks for VAE-based Audio-visual Speech Enhancement
auteur: Mostafa Sadeghi, Xavier Alameda-Pineda
article: IEEE Transactions on Signal Processing, 2021, 69, pp.1899-1909. ⟨10.1109/TSP.2021.3066038⟩
Accès au texte intégral et bibtex

Conference papers

titre: Deep Variational Generative Models for Audio-visual Speech Separation
auteur: Viet-Nhat Nguyen, Mostafa Sadeghi, Elisa Ricci, Xavier Alameda-Pineda
article: MLSP 2021 – IEEE International Workshop on Machine Learning for Signal Processing, Oct 2021, Gold Coast, Australia. pp.1-6, ⟨10.1109/MLSP52302.2021.9596406⟩
Accès au bibtex

titre: Robust Face Frontalization For Visual Speech Recognition
auteur: Zhiqi Kang, Radu Horaud, Mostafa Sadeghi
article: ICCVW 2021 – International Conference on Computer Vision Workshops, IEEE, Oct 2021, Montreal – Virtual, Canada. pp.2485-2495, ⟨10.1109/ICCVW54120.2021.00281⟩
Accès au texte intégral et bibtex

titre: Switching Variational Auto-Encoders for Noise-Agnostic Audio-visual Speech Enhancement
auteur: Mostafa Sadeghi, Xavier Alameda-Pineda
article: ICASSP 2021 – 46th International Conference on Acoustics, Speech, and Signal Processing, Jun 2021, Toronto / Virtual, Canada. pp.1-5, ⟨10.1109/ICASSP39728.2021.9414097⟩
Accès au texte intégral et bibtex

2020

Journal articles

titre: Audio-Visual Speech Enhancement Using Conditional Variational Auto-Encoders
auteur: Mostafa Sadeghi, Simon Leglaive, Xavier Alameda-Pineda, Laurent Girin, Radu Horaud
article: IEEE/ACM Transactions on Audio, Speech and Language Processing, 2020, 28, pp.1788-1800. ⟨10.1109/TASLP.2020.3000593⟩
Accès au texte intégral et bibtex

Conference papers

titre: Robust Unsupervised Audio-visual Speech Enhancement Using a Mixture of Variational Autoencoders
auteur: Mostafa Sadeghi, Xavier Alameda-Pineda
article: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), May 2020, Barcelona, Spain. pp.7534-7538, ⟨10.1109/ICASSP40776.2020.9053730⟩
Accès au texte intégral et bibtex

titre: Low Mutual and Average Coherence Dictionary Learning Using Convex Approximation
auteur: Javad Parsa, Mostafa Sadeghi, Massoud Babaie-Zadeh, Christian Jutten
article: ICASSP 2020 – IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE, May 2020, Barcelone (virtual), Spain. pp.3417-3421, ⟨10.1109/ICASSP40776.2020.9052901⟩
Accès au texte intégral et bibtex

Preprints, Working Papers, …

titre: Face Frontalization Based on Robustly Fitting a Deformable Shape Model to 3D Landmarks
auteur: Zhiqi Kang, Mostafa Sadeghi, Radu Horaud
article: 2020
Accès au texte intégral et bibtex

titre: Unsupervised Performance Analysis of 3D Face Alignment
auteur: Mostafa Sadeghi, Sylvain Guy, Adrien Raison, Xavier Alameda-Pineda, Radu Horaud
article: 2020
Accès au texte intégral et bibtex

2018

Journal articles

titre: Sparse Signal Recovery Using Iterative Proximal Projection
auteur: Fatemeh Ghayyem, Mostafa Sadeghi, Massoud Babaie-Zadeh, Saikat Chatterjee, Mikael Skoglund, Christian Jutten
article: IEEE Transactions on Signal Processing, 2018, 66 (4), pp.879 – 894. ⟨10.1109/TSP.2017.2778695⟩
Accès au texte intégral et bibtex

2017

Conference papers

titre: Accelerated Dictionary Learning for Sparse Signal Representation
auteur: Fateme Ghayem, Mostafa Sadeghi, Massoud Babaie-Zadeh, Christian Jutten
article: LVA/ICA 2017 – 13th International Conference on Latent Variable Analysis and Signal Separation, Olivier Michel; Nadège Thirion-Moreau, Feb 2017, Grenoble, France. pp.531 – 541, ⟨10.1007/978-3-319-53547-0_50⟩
Accès au texte intégral et bibtex

2014

Journal articles

titre: Learning Overcomplete Dictionaries Based on Atom-by-Atom Updating
auteur: Mostafa Sadeghi, Massoud Babaie-Zadeh, Christian Jutten
article: IEEE Transactions on Signal Processing, 2014, 62 (4), pp.883-891. ⟨10.1109/TSP.2013.2295062⟩
Accès au texte intégral et bibtex

2013

Journal articles

titre: Dictionary Learning for Sparse Decomposition: A Novel Approach
auteur: Mostafa Sadeghi, Massoud Babaie-Zadeh, Christian Jutten
article: IEEE Signal Processing Letters, 2013, 20 (12), pp.1195-1198. ⟨10.1109/LSP.2013.2285218⟩
Accès au texte intégral et bibtex

Conference papers

titre: Learning overcomplete dictionaries based on parallel atom-updating
auteur: Mostafa Sadeghi, Massoud Babaie-Zadeh, Christian Jutten
article: MLSP 2013 – IEEE 23rd International Workshop on Machine Learning for Signal Processing, Sep 2013, Southampton, United Kingdom. 5 p
Accès au texte intégral et bibtex

titre: A new algorithm for learning overcomplete dictionaries
auteur: Mostafa Sadeghi, Massoud Babaie-Zadeh, Christian Jutten
article: EUSIPCO 2013 – 21th European Signal Processing Conference, Sep 2013, Marrakech, Morocco. pp.EUSIPCO 2013 1569746047
Accès au texte intégral et bibtex

titre: Sequential subspace finding: a new algorithm for learning low-dimensional linear subspaces
auteur: Mostafa Sadeghi, Mohsen Joneidi, Massoud Babaie-Zadeh, Christian Jutten
article: EUSIPCO 2013 – 21th European Signal Processing Conference, Sep 2013, Marrakech, Morocco. pp.EUSIPCO 2013 1569746207
Accès au texte intégral et bibtex