Skip to main navigation Skip to Content

Computer Science

University of Toronto
  • U of T Portal
  • Site Map
  • Contact
  • About DCS At U of T
    • Why Study CS at U of T
    • Career Options
    • History of DCS
    • Giving to DCS
    • Information for Prospective Undergraduate Students
    • Information for Prospective Graduate Students
    • Computer Science at UofT Mississauga
    • Computer Science at UofT Scarborough
    • Contact
  • Programs & Courses
    • Prospective Undergraduate Students
    • Current Undergraduate Students
    • Prospective Graduate Students
    • Current Graduate Students
  • Research
    • Research Groups
    • Industrial Relations
    • Research In Action Showcase
    • Research Profiles
    • Research Sponsors & Partners
    • Awards and Accolades
    • UTRECS - Undergraduate Toronto Research Experience in Computer Science
  • Our People
    • Faculty
    • Staff
    • In Memoriam
    • People Profiles
    • Alumni and Friends
    • Women in Computer Science
    • Graduate Student Society
    • Undergraduate Student Union
    • Undergraduate Artificial Intelligence Group
  • News & Events
    • Current News
    • DCS Events Calendar
    • DCS in the Media
    • @dcs Newsletter
    • Undergrad News
    • Distinguished Lecture Series
    • Awards and Accolades
    • DCS Facebook Page
    • DCS Twitter Feed
    • RSS Feed - News
    • RSS Feed - Events
You are viewing: > Home > News & Events > DCS Events Calendar > Theory & Machine Learning Seminar: May 28
  • Current News
  • DCS Events Calendar
  • DCS in the Media
  • @dcs Newsletter
  • Undergrad News
  • Distinguished Lecture Series
  • Awards and Accolades
  • DCS Facebook Page
  • DCS Twitter Feed
  • RSS Feed - News
  • RSS Feed - Events

Theory & Machine Learning Seminar: May 28

Event date: Monday, May 28, 2012, at 11:10 AM
Location: GB244

Speaker: Avrim Blum
Carnegie Mellon University

Title: Active (and Passive) Property Testing

Abstract:

Property testing concerns the problem of determining whether an unknown function has a certain desirable property, or is far from having that property, by making only a small number of queries to it. One of the motivations for testing is that it can be a useful preprocessing step before actually aiming to learn the function.  However, machine learning applications often do not allow one to query for the label of arbitrary points, a capability needed by most property testing algorithms. Instead, the dominant query paradigm in machine learning--called active learning--is one where the algorithm may ask for examples to be labeled, but only from among those  appearing in a given large unlabeled sample (e.g., images or documents appearing on the web, patients in a medical study, etc).

In this work, we consider property testing under these constraints and show testing can still yield substantial benefits in this setting, for a number of important properties for learning. We also give new results for passive testing, where the algorithm cannot query at all and is just given a small random sample. For example, for testing whether the function is a linear separator in R^n, we show that both active and passive testing can be done with O(sqrt{n}) queries, substantially less than the Omega(n) needed to learn such a function well. We also give a method for combining testable properties out of others, as well as develop notions of the "testing dimension" of a given property that characterize the intrinsic number of label requests needed to test that property.

This is joint work with Nina Balcan, Eric Blais, and Liu Yang.

Computer Science

All rights reserved copyright Computer Science, University of Toronto