I am a PhD student at CMU in the Language Technologies Institute (LTI) advised by Mona Diab. I am interested in building responsible and controllable NLP systems through understanding the internals of language models with an eye towards steering their generations in a reliable, trustworthy, and efficient manner. I also have experience working on large language model initiatives such as BLOOM and am currently involved in the OLMo initiative at AI2.
Before CMU, I was a predoctoral researcher at The Allen Institute for Artificial Intelligence on the AllenNLP team, where I worked with Matt Peters on controllable text generation and steering language models. I also collaborated with Margaret Mitchell and Sasha Luccioni on personal information in large web corpora. I am also an NLP researcher affiliated with Masakhane, an open source and distributed research effort for NLP for African languages. I have also spent time in industry working on controlling language models, document understanding, optical character recognition, fake speech detection, and speech syntehsis at different companies in both an applied and research context. I completed my MS in Computer Science at the Courant Institute at NYU in the CILVR group focusing on deep learning applied to NLP. I also completed my B.A. in Statistics and M.S. in Computer Science focusing on ML and NLP at Northwestern University working with Doug Downey.
Broadly, my research interests are:
I follow both international and club football (soccer), NBA basketball, and professional tennis very closely. I’m a huge supporter of Borussia Dortmund from the German Bundesliga.
We audit web-crawled multilingual datasets and find that many corpora are completely erroneus. Furthermore, we find that for many languages, less than 50% of sentences are of acceptable quality.
PhD Student at Carnegie Mellon University (Language Technologies Institute)
Predoctoral Young Investigator at the Allen Institute for AI (AllenNLP team)
Predoctoral Resident at Intel’s Intelligent Systems Lab
ML Research Scientist at Scale AI
Research Scientist at AI Foundation
MS in Computer Science (Machine Learning) at New York University
Deep Learning Research Intern at Salesforce Research
Research Assistant in Deep Learning & NLP at Northwestern University
MS in Computer Science at Northwestern University
Master’s Exchange Student in Computer Science at ETH Zurich
Research Assistant in Biomedical Informatics at Stanford University
Research Assistant in Neural Network Language Modeling at Northwestern University
BA in Statistics at Northwestern University
Drop me an email if you are interested in collaborating on research or have any questions regarding my projects.