
Computer Vision
Natural Language Processing
Research
Tightly Connecting Vision and Language
Remarkable progress has been made at the intersection of vision and language. While showing great promise, current vision and language models may only weakly “connect” the two modalities and often fail in the wild. In this talk, Goggle’s Soravit Changpinyo will present recent efforts aiming to bridge this gap along two dimensions: informativeness and controllability. […]
Read More