Lab Home | Dtaset Home               Chinese | English

 

SCUT-COUCH2009 Database Specification

datasets
num of sets
category num per set
data total num
details
samples
download
130
8 888
1 155 440
8 888 frequently used Chinese words
5
44 208
221 040
Including phrases of "The Contemporary Chinese Dictionary", the fourth edition
10
17 366
173 660
Frequently used phrases in PowerWord
188
3 755
705 940
3 755 characters in GB Set1
195
3 008
586 560
3 008 characters in GB Set2
195
52
10 140
52 English upper-case and lower-case alphabets
195
10
1 950
10 numeric digits
130
122
23 790
122 frequently used symbols
130
2 010
261 300
2 010 Pinyin
130
1 384
179 920
1 384 traditional characters in GB Set1
65
5 401
351 065
5 401 BIG5 traditional Chinese
1
2 632
159 866
8 809 online text lines

note

The SCUT-COUCH2009 database is public free to the academic community for research purpose usage. You should fill in a letter of commitment and send it via email to us (lianwen.jin@gmail.com). We will give you the decompression password to access the database after your letter has been received and approved.

  For more details of SCUT-COUCH2009 database, please refer our paper: Lianwen Jin, Yan Gao, Gang Liu, Yunyang Li, Kai Ding. "SCUT-COUCH2009----A Comprehensive Online Unconstrained Chinese Handwriting Database and Benchmark Evaluation", to appear in International Journal of Document Analysis and Recognition. 2010.