Back to Main Conference 2008
LREC 2008main

The UJIpenchars Database: a Pen-Based Database of Isolated Handwritten Characters

Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC 2008)

DOI:10.63317/27g7q8ydf7gb

Abstract

The availability of large amounts of data is a fundamental prerequisite for building handwriting recognition systems. Any system needs a test set of labelled samples for measuring its performance along its development and guiding it. Moreover, there are systems that need additional samples for learning the recognition task they have to cope with later, i.e. a training set. Thus, the acquisition and distribution of standard databases has become an important issue in the handwriting recognition research community. Examples of widely used databases in the online domain are UNIPEN, IRONOFF, and Pendigits. This paper describes the current state of our own database, UJIpenchars, whose first version contains online representations of 1,364 isolated handwritten characters produced by 11 writers and is freely available at the UCI Machine Learning Repository. Moreover, we have recently concluded a second acquisition phase, totalling more than 11,000 samples from 60 writers to be made available in short as UJIpenchars2.

Details

Paper ID
lrec2008-main-467
Pages
N/A
BibKey
llorens-etal-2008-ujipenchars
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
2-9517408-4-0
Conference
Sixth International Conference on Language Resources and Evaluation
Location
Marrakech, Morocco
Date
28 May 2008 30 May 2008

Authors

  • DL

    D. Llorens

  • FP

    F. Prat

  • AM

    A. Marzal

  • JV

    J. M. Vilar

  • MC

    M. J. Castro

  • JA

    J. C. Amengual

  • SB

    S. Barrachina

  • AC

    A. Castellanos

  • SE

    S. España

  • JG

    J. A. Gómez

  • JG

    J. Gorbe

  • AG

    A. Gordo

  • VP

    V. Palazón

  • GP

    G. Peris

  • RR

    R. Ramos-Garijo

  • FZ

    F. Zamora

Links