About CVL OCR DB
CVL OCR DB is a public annotated image dataset of text in natural scenes. Images include signboards, shop names, traffic signs, jumbo posters etc. CVL OCR DB is suitable for evaluating both text detection and text recognition methods. Two types of annotation are supported: binary annotation and polygon annotation. Dataset includes additional cropped single character images suitable for OCR training and testing.
Features:
- 120 binary annotated (text/non-text) images of text in natural scenes (NEW!!!)
- 341 polygon annotated images of text in natural scenes
- 7014 cropped single character images suitable for OCR training and testing
- all images are in JPEG image format
- includes TextAnnotator - a powerful annotation software for annotating your own images
- publicly available free of charge
Authors: Andrej Ikica & Peter Peer
Current version:
1.3
Year of publication:
2011
Binary annotation (NEW!!!)
Each binary annotated image in CVL OCR DB dataset has a corresponding binary (text/non-text) BMP image (see figure below).
|
|
(a) |
(b) |
Figure: An example of binary annotation.
(a) Original JPEG image. (b) Binary annotated BMP image. |
Polygon annotation & cropped character images
Instead of annotating text regions with rectangles all text regions in CVL OCR DB images are annotated with poylgons (see figure below). Each polygon annotated image in CVL OCR DB has a corresponding XML annotation file (see Xml annotation file for details). Additionally, images of cropped characters are available for each CVL OCR DB image, thus making CVL OCR DB also suitable for OCR training and testing.
|
|
|
(a) |
(b) |
(c) |
Figure: An example of polygon annotation.
(a) Original JPEG image. (b) Text region annotated with n-polygon. (c) Cropped characters. |
Material & documentation
Material & documentation
- CVL OCR DB dataset & TextAnnotator will be sent to you personally after we receive hand signed license agreement from you
- CVL OCR DB documentation:
- a journal paper: A.Ikica, P. Peer, "CVL OCR DB, an annotated image database of text in natural scenes, and its usability", Info. MIDEM, vol.41, no.2, pp.150-154, 2011 (errata sheet)
- CVL OCR DB usage example:
- a journal paper: A. Ikica, P. Peer, "SWT voting-based color reduction for text detection in natural scene images", EURASIP Journal on Advances in Signal Processing 2013(95), 2013
- a conference paper: A. Ikica, P. Peer, "An improved edge profile based method for text detection in images of natural scenes", EUROCON - International Conference on Computer as a Tool (EUROCON), 2011 IEEE, pp.1-4, 27-29 April 2011
Additional documentation:
Obtaining CVL OCR DB
Terms of usage:
- license agreement must be signed
- non-commercial usage
- research in computer vision
- a proper citation must be included in your publications
Contact:
Send the scan of the filled and hand signed license agreement to andrej.ikica@fri.uni-lj.si in order to obtain the CVL OCR DB dataset.
|