Word spotting in the wild

Research output: Contribution to journal › Conference article › Research › peer-review

Kai Wang
Belongie, Serge

We present a method for spotting words in the wild, i.e., in real images taken in unconstrained environments. Text found in the wild has a surprising range of difficulty. At one end of the spectrum, Optical Character Recognition (OCR) applied to scanned pages of well formatted printed text is one of the most successful applications of computer vision to date. At the other extreme lie visual CAPTCHAs - text that is constructed explicitly to fool computer vision algorithms. Both tasks involve recognizing text, yet one is nearly solved while the other remains extremely challenging. In this work, we argue that the appearance of words in the wild spans this range of difficulties and propose a new word recognition approach based on state-of-the-art methods from generic object recognition, in which we consider object categories to be the words themselves. We compare performance of leading OCR engines - one open source and one proprietary - with our new approach on the ICDAR Robust Reading data set and a new word spotting data set we introduce in this paper: the Street View Text data set. We show improvements of up to 16% on the data sets, demonstrating the feasibility of a new approach to a seemingly old problem.

Original language	English
Journal	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Issue number	PART 1
Pages (from-to)	591-604
Number of pages	14
ISSN	0302-9743
DOIs	https://doi.org/10.1007/978-3-642-15549-9_43
Publication status	Published - 2010
Externally published	Yes
Event	11th European Conference on Computer Vision, ECCV 2010 - Heraklion, Crete, Greece Duration: 10 Sep 2010 → 11 Sep 2010

Conference

Conference	11th European Conference on Computer Vision, ECCV 2010
Country	Greece
City	Heraklion, Crete
Period	10/09/2010 → 11/09/2010
Sponsor	DAGM, IBM, NICTA

ID: 302047865

Department of Computer Science

Word spotting in the wild

Conference