Skip to main navigation Skip to Content

Computer Science

University of Toronto
  • U of T Portal
  • Site Map
  • Contact
  • About DCS At U of T
    • Why Study CS at U of T
    • Career Options
    • History of DCS
    • Giving to DCS
    • Gr8 Designs for Gr8 Girls
    • Information for Prospective Undergraduate Students
    • Information for Prospective Graduate Students
    • Computer Science at UofT Mississauga
    • Computer Science at UofT Scarborough
    • Contact
  • Programs & Courses
    • Undergraduate Program
    • Undergraduate Courses
    • Graduate Program
    • Graduate Courses
  • Research
    • Research Groups
    • Industrial Relations
    • Research In Action Showcase
    • Research Profiles
    • Research Sponsors & Partners
    • Awards and Accolades
  • Our People
    • Faculty
    • Staff
    • Post Docs and Visitors
    • M.Sc. Students
    • Ph.D. Students
    • In Memoriam
    • People Profiles
    • Alumni and Friends
    • Women in Computer Science
    • Graduate Student Society
    • Undergraduate Student Union
  • News & Events
    • Current News
    • DCS Events Calendar
    • DCS in the Media
    • Grad Announcements
    • Undergrad News
    • Distinguished Lecture Series
    • Awards and Accolades
    • RSS Feed - News
    • RSS Feed - Events
You are viewing : > Home > Research > Research Profiles > Using Language to Learn Structure Appearance Models for Image Annotation
  • Computational Analysis of Ice Hockey Gameplay
  • Online Music Recommendation and the Problem of Missing Ratings
  • Speech Summarization
  • Novel Interfaces for Molecular Visualization
  • Using a Physical Object to Control a Virtual 3D Object
  • Amigo: Proximity-Based Authentication
  • Grapevine
  • Modelling Complex Financial Instruments
  • Using Language to Learn Structure Appearance Models for Image Annotation
  • Stylization of Character Motion
  • ILoveSketch
  • JSCOOP: A High-Level Concurrency Framework for Java
  • Dezombify
  • SPIDER Data Cleaning Tool
  • Cognitive Orthosis for Assisting Activities in the Home
  • NAViGaTOR Visualizing Protein Interaction Networks
  • Friend Forecaster: Cellphone Software Aiding Memory for Games

Using Language to Learn Structure Appearance Models for Image Annotation

Taj Mahal Vision Research
Manual annotation of new images in large image collections is prohibitively expensive for commercial databases, and overly time-consuming for the home photographer. However, low-cost imaging, storage and communication technologies have already made accessible millions of images that are meaningfully associated with text in the form of captions or keywords. It is tempting to see these pairings of visual and linguistic representations as a kind of distributed Rosetta Stone from which we may learn to automatically translate between the names of things and their appearances. Our algorithm uses the repetition of appearance across an unstructured collection of captioned images and a measure of correspondence with caption words to learn to recognize named objects.

Sven Dickinson Faculty
Yulia Eskin Undergraduate Student
Afsaneh Fazly Post Doctoral Fellow
Mike Jamieson Graduate Student
Suzanne Stevenson Faculty

More Research Profiles

Computer Science

All rights reserved copyright Computer Science, University of Toronto 2010