Emma Strubell

6709 Gates Hillman

Pittsburgh, PA, USA

I earned my Ph.D. from UMass Amherst working in the Information Extraction and Synthesis Laboratory with Andrew McCallum. Previously, I earned a B.S. in Computer Science from the University of Maine with a minor in math, where I applied epidemiological models to the spread of internet worms with David Hiebeler. I’ve also spent time as an intern and/or research scientist at Amazon, IBM, Meta and Google. You can find a formal bio for me here.

I do research at the intersection of natural language processing (NLP) and machine learning, and my broad research objective is bridging the gap between state-of-the-art NLP methods, and the wide variety of users who stand to benefit from that technology, but for whom that technology does not yet work in practice. You can learn more about my research group SLAB here.

/etc

Maintaining a healthy work-life balance is important to me. Outside of work, I enjoy cooking and baking (mostly vegetarian), fermenting (kombucha, kimchi, yogurt, sourdough), DIY renovating and restoring my weird old house, and hiking and camping with my two dogs, Nala and Pepper.

In 2019 I backpacked the first 250 miles (southbound) of the Colorado Trail with Nala, and I hope to finish the trail soon! I also like to summit the high points of U.S. states. So far I have completed: Colorado, Connecticut, Maine, Massachusetts, New Jersey, New York, Pennsylvania, Rhode Island, and West Virginia. I have also come close in Vermont and New Hampshire.

I am also co-author of Plant Jones, a semi-intelligent plant who tweets negatively about water when thirsty, and positively when not. Code is available here. Like many of us, Plant has been staying away from social media lately, but is still living his best life in my kitchen today!

I started programming in middle school on my TI-83 calculator, and started using Gentoo Linux in high school, all self-taught out of a strong motivation to h4ck the planet. Now I’ve sold out and use a Mac, but can still satisfy some of that system debugging itch when I teach On-Device ML.

news

Nov 01, 2024	💬 I am giving a talk on the environmental footprint of AI at AI and the Environment: Sustaining the Common Good, a day-long conference organized by the Markkula Center for Applied Ethics at Santa Clara University.
Oct 24, 2024	💪 I was named one of the most powerful people in artificial intelligence by Business Insider.
Oct 11, 2024	📄 Check out our new preprint characterizing bias in LLMs: Stereotype or Personalization? User Identity Biases Chatbot Recommendations, led by Anjali Kantharuban and Jeremiah Milbauer. You can also find tweet threads here and here.
Aug 24, 2024	🏳️‍⚧️ My gender identity is trans nonbinary! Please use they/them pronouns to refer to me, and Professor or Dr. if you need an honorific.
Aug 21, 2024	📄 Check out our Nature Commentary about the new AI Energy Star initiative, led by Sasha Luccioni: Light bulbs have energy ratings — so why can’t AI chatbots?
Aug 14, 2024	🏆 Our papers OLMo: Accelerating the Science of Language Models and Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research were awarded Best Theme Paper and Best Resource Paper, respectively, at ACL 2024!
Jul 24, 2024	📈 Nupoor Gandhi will be presenting our work using LLMs to analyze municipal climate action plans at the ClimateNLP workshop at ACL 2024!
Jul 01, 2024	🍎 I’m giving a mater class on reducing the environmental footprint of large language models at the 2024 Deep Learning School at Université Côte d’Azur.