The junior research group aims at advancing multi-modal learning from sound, vision, and text for video understanding. It is funded by the BMFTR (01IS24060), and hosted at the Chair of Computer Vision and Artificial Intelligence at the Technical University of Munich. |
News |
|
|