Learning from Imperfect Human Feedback: a Tale from Corruption-Robust Dueling

Publication
ICLR, 2025
Fan Yao
Fan Yao
Ph.D. student at CS@UVa

A theory-obsessed pragmatist, a crazy tennis player, and an underachieving daydreamer.