Multiple instance learning with manifold bags

Research output: Contribution to journalConference articleResearchpeer-review

In many machine learning applications, labeling every instance of data is burdensome. Multiple Instance Learning (MIL), in which training data is provided in the form of labeled bags rather than labeled instances, is one approach for a more relaxed form of supervised learning. Though much progress has been made in analyzing MIL problems, existing work considers bags that have a finite number of instances. In this paper we argue that in many applications of MIL (e.g. image, audio, etc.) the bags are better modeled as low dimensional manifolds in high dimensional feature space. We show that the geometric structure of such manifold bags affects PAC-learnability. We discuss how a learning algorithm that is designed for finite sized bags can be adapted to learn from manifold bags. Furthermore, we propose a simple heuristic that reduces the memory requirements of such algorithms. Our experiments on real-world data validate our analysis and show that our approach works well.

Original languageEnglish
JournalProceedings of the 28th International Conference on Machine Learning, ICML 2011
Pages (from-to)81-88
Number of pages8
Publication statusPublished - 2011
Externally publishedYes
Event28th International Conference on Machine Learning, ICML 2011 - Bellevue, WA, United States
Duration: 28 Jun 20112 Jul 2011

Conference

Conference28th International Conference on Machine Learning, ICML 2011
CountryUnited States
CityBellevue, WA
Period28/06/201102/07/2011
SponsorAmazon.com, Inc. (Biz), NSF, Microsoft, Google, Yahoo! Labs

ID: 301831085