About me

My research in computational linguistics focuses on language evolution and typology. I see computational tools as an opportunity to systematize linguistic analyses, a solution to study precisely large amounts of data, and a necessary methodological step towards typological investigation.

In February 2021, I joined the Surrey Morphology Group as a Newton International Fellow, to work on a typological study of exponence. The title of the project is: Solving the word puzzle: morphological analysis beyond stem and affixes.

Before joining the SMG, I was a post-doctoral researcher at the Max Planck Institute EVA, in the department of Linguistic and Cultural Evolution (DLCE), for a year and a half. I worked on inflectional lexicons, evolutionary models of inflectional paradigms and sound correspondence.

During my PhD, I studied the typological variation of inflection classes (declensions or conjugations) using computational methods.

Interests

  • Computational approaches to linguistic theory
  • Typology (and in particular canonical typology)
  • Morphology
  • Inflectional paradigms
  • Data curation and Inflected lexicons

Education

  • PhD in Linguistics, 2018

    Université Paris 7

  • MA in Language Sciences & NLP, 2014

    Université Paris 7

Publications

(2021). The fine implicative structure of European Portuguese conjugation. Isogloss. Open Journal of Romance Linguistics. PDF Publication Dataset
Recent literature has highlighted the extent to which inflectional paradigms are organised into systems of implications allowing speakers to make full use of the inflection system on the basis of exposure to only a few forms of each word. The present paper contributes to this line of research by investigating in detail the implicative structure of European Portuguese verbal paradigms. After …
(2021). Leaving the stem by itself. All Things Morphology: Its independence and its interfaces. PDF Publication
Stem allomorphy plays a central role in the recent history of morphology, in no small part thanks to a research program initiated by Aronoff (1994). Yet, there is no agreed upon way of deciding whether some bit of form should be considered a proper part of a stem allomorph or an independent exponent. We explore the possibility of just doing away with the notion of stem allomorphy in inflection. We …
(2021). One lexeme, many classes: inflection class systems as lattices. One-to-Many Relations in Morphology, Syntax and Semantics. PDF Publication
This paper discusses the nature of inflection classes (ICs) and provides a fully im- plemented methodology to conduct typological investigations into their structure. ICs (conjugations or declensions) are sets of lexemes which inflect similarly. They are often described as partitioning the set of lexemes, but similarities across classes lead some authors to favor hierarchical descriptions. While …

Talks

Spontaneous emergence of inflectional class systems via attraction-repulsion dynamics

Inflectional classes are ubiquitous in the world’s inflectional systems, but where do they come from? We introduce a simple, …

Towards automatic morphological analysis: aligning inflected forms

Several hypotheses exist according to which defectivity and overabundance can arise as a result of specific properties in the implicative relations which hold between paradigm forms. This presentation addresses the fundamental question of how we can obtain automatically good characterizations of these relations, starting from raw unsegmented inflected forms.

Simulating paradigm Evolution

analogical change and morphomic patterns

Datasets

European Portuguese Verbal Paradigms in Phonemic Notation

This is a collection of European Portuguese verbal paradigms, in phonemic notation. They are suited for both computational and manual …

Inflected lexicon of Russian Nouns in IPA notation

This inflected lexicon of Russian Nouns is based on data generated by a DATR fragment for the nominal system of Russian (Dunstan Brown …

Romance Verbal Inflection Dataset 2.0

The Romance Verbal Inflection Dataset 2.0 is a multilingual lexicon of Romance inflection covering 73 varieties. It provides verbal …

Software

Surrey Morphology Group Wordle

A wordle game for some languages studied at the Surrey Morphology Group: Archi and Nuer.

Feature Viz

This script generates natural class lattices for phoneme inventories defined by distinctive features. It is useful to visualize the natural classes implied by distinctive features.

Gitlab2Zenodo

Make your code and data citable with Zenodo and Gitlab !

IPA Keyboard

A keyboard layout for Onboard Keyboard, allowing for easily typing International Phonetic Alphabet symbols in utf-8 on linux.

Qumín

Qumín (Quantitative Modelling of Inflection) is a set of scripts written during my PhD to explore the structure of inflection class systems.