Welcome into my own little personnal space... '


I'm a tenured associate professor (Maître de Conférence) in the University Paris-Sorbonne . From September 2006 to December 2009, I was a member of the Lalic Team, then at Alpage until the end of that project. Now, I'm a member of the Almanach team.

I used to be a postdoc at the DCU's GramLab group.
Before that I was a Phd Student in the Langue et Dialogue team at the Loria in France.

My interests cover all parts of Natural Language Processing, mainly syntatic analysis, wide coverage parsing, syntax-semantic interface, etc.
Those days I'm involved into parsing user generated content through treebanking and parsing, focusing on context-aware models. More and more involved into contextual neural language models for under ressourced languages. Contact me if you're interested !


We got a NAACL accepted paper, bravo Ghazi and Jospeh !

I gave an interview in January at France Culture for La méthode scientifique ("7mn reportage")

Great news! I gave a tutorial at the NLP winter school in the Alps (Sometime between January 17th and January 21st, 2022) !!

Also I gave a cool talk at the ENS Lyon as part of their IXXI seminar series Recording (in French) here


I'm invited to give a talk at the DFKI on CamemBert and Character-based Bert models (October 12th)!

Another cool year (2021) : One huge European project accepted! 3 papers accepted at EACL, NAACL and EMNLP ! Many ongoing work papers accepted at cool workshops! Congrats to my students!

Great year so far (2020) (beside that whole global pandemic of course): 3 papers accepted at ACL, one at LREC. Co-organization of the first two IWPT shared tasks on parsing Enhanced Universal Dependencies and super interesting collaborating work UGC treebanking.

Another talk given at the NLP Paris Meetup about CamemBERT in last January, the French contextual Language models that was actually the first large scale BERT model to be released outside English. Talk on the same topic also given in front of the whole Axa R&D world-wide groups.

Together with Marie Candito, I'll be organizing the Treebank and Linguistics Theory conference 2019 edition in paris. It'll be a joint even with Depling 2019 and the UD 2019 workshop. (Actually it was a whole week of conferences, called the SyntaxFest 2019. Check that out SyntaxFest 2019 Website

Our 2017 #ParsingTragedy system was re-evaluated and finally ranked #3 overall :) This year's system (by mostly my students Ganash Jawahar and Benjamin Muller ranked #10 with a nice model though (ELmo and external lexicon features, Dozat's neural model).

Teaming up with the Stanford NLP group (Sebastian Schuster and Chris Manning). We ranked #1 and #3 at the Extrinsic Evaluation Shared 2017 Shared task :)

We unofficially scored #6 at the #ParsingTragedy shared task. Why unofficial? because a bug in the official metadata led our parser to failback on delexicalized mode for all languages. Ranking #26 but #3 on POS tagging. Check out or paper


co-orgnanizing the SPMRL 2014 Shared Task (with Reut Tsarfaty and Sandra Kübler)
Check it out : http://www.spmrl.org/spmrl2014-sharedtask.html it's still ongoing. This year with an emphasis on semi supervized parsing!

honorably participating in the Semeval task 8 broad coverage semantic parsing (two transition-based graph parsers + syntactic features)

co-orgnanizing the SPMRL 2013 Shared Task (with Reut Tsarfaty and Sandra Kübler)
Check it out : http://www.spmrl.org/spmrl2013-sharedtask.html

More up-to-date wiki is here :

Ranked #2 and #3 on the constituency parsing track of the sancl 2012 google shared task system based on self-training, delicate part-of-speech, normalisation and hard clustering (With Benoit Sagot)

(last updated April 17th, 2022)