Top
Back to All Events

Toronto Vision Seminar: Dima Damen

  • Bahen Centre for Information Technology, Room 5187 40 Saint George Street Toronto, ON, M5S 2E4 Canada (map)

Date: Friday, November 22

Time: 3-4 p.m.

Location: BA5187 and online via Zoom. Enlarge image to scan QR code for Zoom link or visit Zoom meeting registration webpage.

There is no registration required to attend this event in person. However, seating is limited, so arriving early is recommended.

Talk title: “Opportunities in Egocentric Vision”

Abstract: Forecasting the rise of wearable devices equipped with audio-visual feeds, this talk will present opportunities for research in egocentric video understanding. The talk argues for new ways to foresee egocentric videos as partial observations of a dynamic 3D world, with objects out of sight but not out of mind. I’ll review new data collection and annotation that merges video understanding with 3D modelling, showcasing current failures of VLMs in understanding the perspective outside the camera’s field of view — a task trivial for humans.

Bio: Dima Damen is a Professor of Computer Vision at the University of Bristol and Senior Research Scientist at Google DeepMind. Dima is currently an EPSRC Fellow (2020-2025), focusing her research interests in the automatic understanding of object interactions, actions and activities using wearable visual (and depth) sensors. She is best known for her leading works in Egocentric Vision, and has also contributed to novel research questions including mono-to-3D, video object segmentation, assessing action completion, domain adaptation, skill/expertise determination from video sequences, discovering task-relevant objects, dual-domain and dual-time learning as well as multi-modal fusion using vision, audio and language.

She is the project lead for EPIC-KITCHENS, the seminal dataset in egocentric vision, with accompanying open challenges and follow-up works: EPIC-Sounds, VISOR and EPIC Fields. She is part of the large-scale consortium effort Ego4D and Ego-Exo4D. Dima is Associate Editor-in-Chief of IEEE TPAMI and associate editor of IJCV, and was a program chair for ICCV 2021. She is frequently an Area Chair in major conferences and was selected as Outstanding Reviewer in CVPR2021, CVPR2020, ICCV2017, CVPR2013 and CVPR2012.

Dima received her PhD from the University of Leeds (2009), joined the University of Bristol as a Postdoctoral Researcher (2010-2012), Assistant Professor (2013-2018), Associate Professor (2018-2021) and was appointed as chair in August 2021. She supervises 10 PhD students, 4 Visiting PhD students and 2 postdoctoral researchers. At the University of Bristol, Dima leads the Machine Learning and Computer Vision (MaVi) lab, and is the university chair of the Research Data Storage Management Executive Board.

At Google DeepMind, Dima is part of the Vision team, led by Andrew Zisserman, focusing on video understanding research. Her latest contribution is to the Perception Test project on measuring perception in AI models.