Fereshte Khani

I am a researcher at OpenAI. Before joining OpenAI, I was at Microsoft and before that I got my Ph.D. at Stanford under supervision of Percy Liang and my M.S. at MIT under supervision of Martin Rinard.

Research

Alignment. Alignment is ensuring machine learning models conform to human values and intentions. In NeurIPS2023, we focus on collaborative alignment, where multiple individuals engage with the model and each other to align the model to their preference without interferring other users. I envision a future where NLP models are developed in a collaborative fashion, similar to open source software or Wikipedia, benefiting from diverse user inputs for improved quality and fairness. For this scenario to materialize, we need to help users to convey knowledge, and verify the impact of their proposed changes to models, similar to “diffs” or “regression tests”. CoDev is a small step in this direction. In TACL2018, we showed how human use pragmatics, planning, and inference in dialogue to convey information.

Robustness. One main concern about utilizing ML models in the real world is their poor performance in new domains, even when the new domain is slightly different from the training domain. In my previous work, we have shown how to leverage unlabeled data through pretraining and finetuning to achieve better performance in a new domain (ICLR2021); and how to use unlabeled data to make a model robust against spurious features (FAccT2021). In ACL2023, we showed how to use LLMs as a large pool of unlabeled data to augment groups that the model has low performance on.

Reliability. How can we have a reliable model that knows when it does not know? I am interested in selective classification where a model can abstain if it cannot provide a true prediction with high probability. Previously, I proposed a model that only predicts if all models consistent with training data unanimously agree, resulting in 100% precision (ACL2016). However, as models become more complicated we cannot do the unanimous agreement anymore! In (Neurips2022), we show how to find two disjoint models and only predicts if those two model agree.

Fairness. ML Models can lead to discrimination, and in light of their increasing prevalence, it is necessary to address this problem (see this post). In my research I try to understand how we can investigate and mitigate discrimination of ML models, and how to study the feedback loops created by ML models. Previously I have shown unexpected reasons that cause ML models to exhibit discrimination. For example, adding the same amount of feature noise to all individuals (ICML2020, blog post), or the inductive bias in overparameterized regimes (FAccT2021, blog post) can lead to discrimination. In addition, in scenarios when protected groups are not known a priori, or there is an exponential number of such groups, I showed that what kind of statistical measure is possible to measure loss discrepancy among groups (SafeML,ICLR2019).

Honors and Awards

EDGE Doctoral Fellowship, 2016-2022.
Ranked 3rd in the Asia Regional ACM-ICPC Contest, Iran, 2010.
Silver Medal in International Olympiad in Informatics (IOI) 2009, Plovdiv, Bulgaria.
Selected as a national scientific elite and the recipient of the grant from the Iranian National Elites Foundation 2009 - 2013.
Gold Medal in Iranian National Olympiad in Informatics (INOI), 2008.
Awarded winter school grant from Chinese University of Hong Kong, Hong Kong, 2011

Teaching and Services

Teaching Assistant for Machine Learning, Stanford, Summer 2020, Fall 2020 [review of linear algebra][probability theory review]
Teaching Assistant for Data Structure (2011), Artificial Intelligence (2011), and Design of Algorithm (2012), Sharif University of Technology.
Lecturer in Summer Math Camp , Farzanegan Highschool, 2007 - 2013
Teaching students for Iranian National Olympiad in Informatics (INOI), 2009 - 2011
Contest Designer of Iranian National Olympiad in Informatics (first round), 2011

Reviewer for NeurIPS, ICML, ICLR, ALT, ACL
Stanford computer science admission committee, 2018

Speaker and panelist in trustML-Unlimited, ICLR 2023
Panelist on Application of AI with Peter Norvig at the nightcap, 2023
Co-Moderator in Women in Data Science (WiDS) Conference (NLP breakout), 2017
Stanford CS PhD representative at Grace Hopper conference, 2017
Member of the Scientific Committee, Computer Vision Workshop, Iran, 2012

Other activities

In my free time, I like reading, writing, running, hiking and backpacking.