news

Jun 2025 New preprint: πŸ•΅ Model Internal Sleuthing: Finding Lexical Identity and Inflectional Morphology in Modern Language Models led by my undergrad mentee Michael Li is out!
May 2025 Started as a student researcher (intern) on the Cloud AI research team at Google Cloud working with Hamid Palangi on actionable interpretability πŸ”Ž to supercharge tool-using agents πŸ€–
Apr 2025 At NAACL in Albuquerque :cactus: to present :mouse:MICE for CATs! Reach out if you want to chat about interpretability things πŸ”Ž
Apr 2025 :mouse: MICE for CATs: Model-Internal Confidence Estimation for Calibrating Agents with Tools won a Best Paper Runner Up prize at the LTI Student Research Symposium :tada:
Jan 2025 :mouse: MICE for CATs: Model-Internal Confidence Estimation for Calibrating Agents with Tools has been accepted to NAACL2025 as a main conference paper! :tada: See you in Albuquerque in April!
Aug 2024 OLMo won an outstanding paper award at ACL 2024!
Aug 2024 Dolma won an outstanding resource paper award at ACL 2024!
Jun 2024 At NAACL in Mexico City :mexico:; come say hi!
Jun 2024 In Seattle for the summer πŸ”οΈ - started as a PhD research intern on the semantic machines team at Microsoft Research working with Sam Thomson and Yu Su on calibrating tool-using agents πŸ€–
May 2024 OLMo and Dolma accepted to the main conference at ACL. See my wonderful coauthors in Bangkok :thailand: in August!
Apr 2024 Evaluating Personal Information Parroting in Language Models has been accepted to TrustNLP! See you in Mexico City :mexico: in June!
Nov 2023 Had a wonderful time giving a talk on Steering vectors: an alternative way to steer language models at in Annie En-Shiun Lee’s group at OntarioTech!
Aug 2023 Started my PhD :mortar_board: at CMU LTI with Mona Diab on model interpretability :mag_right: