Sotabase

Career

· Researcher in Natural Language Processing, Columbia University2024–
· Teaching Assistant, Columbia University2016–
· Intern, Google2015–
· Co-organizer, Columbia University2014–
· Graduate Vice President, Columbia University2014–
· Member of GSAC, Columbia University2014–
· Research Assistant, Columbia University2012–
· Graduate Student, Columbia University

Publications (43)

MADAMIRA: A Fast, Comprehensive Tool for Morphological Analysis and Disambiguation of Arabic
International Conference on Language Resources and Evaluation · 2014
669
cited
MADA + TOKAN : A Toolkit for Arabic Tokenization , Diacritization , Morphological Disambiguation , POS Tagging , Stemming and Lemmatization
2009
348
cited
Annual Meeting of the Association for Computational Linguistics · 2008
173
cited
Annual Meeting of the Association for Computational Linguistics · 2009
149
cited
Morphological Analysis and Disambiguation for Dialectal Arabic
North American Chapter of the Association for Computational Linguistics · 2013
141
cited
61
cited
Syntactic Annotation in the Columbia Arabic Treebank
2009
59
cited
Automatic Morphological Enrichment of a Morphologically Underspecified Treebank
North American Chapter of the Association for Computational Linguistics · 2013
24
cited
Using Deep Morphology to Improve Automatic Error Detection in Arabic Handwriting Recognition
Annual Meeting of the Association for Computational Linguistics · 2011
21
cited
arTenTen: a new, vast corpus for Arabic
2013
17
cited
Sotabase