Publications
Selected Publications
* denotes equal contribution as first author.
de Seyssel, M., D'Avirro, A., Williams, A., & Dupoux, E. (2024). EmphAssess: a Prosodic Benchmark on Assessing Emphasis Transfer in Speech-to-Speech Models. arXiv preprint arXiv:2312.14069. [pdf]
de Seyssel, M.*, Lavechin, M.*, & Dupoux, E. (2023) Realistic and broad-scope learning simulations: First results and challenges. Journal of Child Language, 1-24. doi:10.1017/S0305000923000272 [pdf]
de Seyssel, M., Lavechin, M., Titeux, H., Thomas, A., Virlet, G., Santos Revilla, A., Wisniewski, G., Ludusan, B., & Dupoux, E. (2023) ProsAudit, a prosodic benchmark for self-supervised speech models. In Proc. Interspeech 2023. [pdf] [benchmark] [leaderboard]
Endress, A., & de Seyssel, M. (2022). The limits of statistical learning in word segmentation: Accumulation of predictive information from unstructured input in the absence of (declarative) memory. Retrieved from psyarxiv.com/u9z4a [pdf]
Nguyen, T. A.*, de Seyssel, M.*, Rozé, P., Rivière, M., Kharitonov, E., Baevski, A., Dunbar, E., & Dupoux,E. (2020). The zero resource speech benchmark 2021: Metrics and baselines for unsupervised spoken language modeling. In Neurips Workshop on Self-Supervised Learning for Speech and Audio Processing. [pdf] [video]
Academic dissertations
de Seyssel, M. (2023). Unsupervised multilingual models of speech representation, an approach inspired by cognitive science. [PhD Dissertation]. Ecole Normale Supérieure. [pdf]
de Seyssel, M. (2017). Active learning for training data selection for Automatic Speech Recognition using Unreliable Transcriptions. [Unpublished MSc Dissertation]. University of Edinburgh. (restricted access rights - manuscript available on demand)
de Seyssel, M. (2016). The Role of Statistical and Crosslinguistic Prosodic Cues in Segmenting Groups of Words. [Unpublished BSc Dissertation]. City, University of London. [pdf]
Other publications
Lavechin, M., de Seyssel, M., Métais, M., Metze, F., Mohamed, A., Bredin, H., Dupoux, E. & Cristia, A. (2023). Statistical learning models of early phonetic acquisition struggle with child-centered audio data. Retrieved from psyarxiv.com/hav58 [full article] [preprint]
Lavechin, M., de Seyssel, M., Gautheron, L., Dupoux, E., & Cristia, A. (2021). Reverse-engineering language acquisition with child-centered long-form recordings. Annual Review of Linguistics, 8, 389-407. [pdf]
Dunbar, E., Bernard, M., Hamilakis, N., Nguyen, T.A., de Seyssel, M., Rozé, P., Rivière, M., Kharitonov, E. & Dupoux, E. (2021). The Zero Resource Speech Challenge 2021: Spoken Language Modelling. In Proc. Interspeech 2021, 1574-1578, doi: 10.21437/Interspeech.2021-1755. [pdf]
Nguyen, T.A., de Seyssel, M., Algayres, R., Roze, P., Dunbar, E., Dupoux, E. (2020). Are word boundaries useful for unsupervised language learning? CoML Technical Report, September 2020. [pdf]
Maudet, E., Cattan, O., de Seyssel, M., & Servan, C. (2019). Qwant Research@ DEFT 2019: appariement de documents et extraction d’informations à partir de cas cliniques (Document matching and information retrieval using clinical cases). In Actes de la Conférence sur le Traitement Automatique des Langues Naturelles (TALN) PFIA 2019. Défi Fouille de Textes (atelier TALN-RECITAL) (pp. 67-80). [pdf]