Vision & Language

Juan C. Pérez
Laura Bravo-Sánchez
Pablo Arbeláez
Collaboration with Edgar Margffoy-Tuay and Emilio Botero.

This line of research focuses on the intersection of computer vision and natural language understanding. In particular, we study tasks that require visual input in the form of images or video as well as linguistic input in the form of text or audio. We aim at designing novel methods and algorithms that can jointly process these diverse and dissimilar types of information.

Presentation Video

Publications

Dynamic Multimodal Instance Segmentation Guided by Natural Language Queries

E Margffoy-Tuay, JC Pérez, E Botero, P Arbeláez

ECCV 2018

"Ideas are easy. It's the execution of ideas that really separates the sheep from the goats" - Sue Grafton

Cra. 1 E No. 19A - 40.  

Mario Laserna Building - School of Engineering 

Bogotá, Colombia

Cód. Postal: 111711

+(571) 332 4327, 332 4328, 332 4329


SOCIAL NETWORKS