Gender Bias in Pre-Trained Vision-and-Language Models
Advisor : Yonatan Bisk
We analyze intra- and inter-modality gender biases encoded by pre-trained vision-and-language models, which often prefer to reinforce stereotypes over faithfully describing the visual scene.
Multimodal ASR for Recovering Noisy and Corrupted Speech
Collaborator : Ramon Sanabria Advisors: Desmond Elliott, Florian Metze
We investigate the utility of multimodal ASR under noisy conditions, showing that the visual context can be leveraged to recover masked words in the speech signal.