SCUT-COUCH2009 Database Specification
datasets |
num of sets |
category num per set |
data total num |
details |
samples |
download |
|
130 |
8 888 |
1 155 440 |
8 888 frequently used Chinese words |
||||
5 |
44 208 |
221 040 |
Including phrases of "The Contemporary Chinese Dictionary", the fourth edition |
||||
10 |
17 366 |
173 660 |
Frequently used phrases in PowerWord |
||||
188 |
3 755 |
705 940 |
3 755 characters in GB Set1 |
||||
195 |
3 008 |
586 560 |
3 008 characters in GB Set2 |
||||
195 |
52 |
10 140 |
52 English upper-case and lower-case alphabets |
||||
195 |
10 |
1 950 |
10 numeric digits |
||||
130 |
122 |
23 790 |
122 frequently used symbols |
||||
130 |
2 010 |
261 300 |
2 010 Pinyin |
||||
130 |
1 384 |
179 920 |
1 384 traditional characters in GB Set1 |
||||
65 |
5 401 |
351 065 |
5 401 BIG5 traditional Chinese |
||||
1 |
2 632 |
159 866 |
8 809 online text lines |
note:
The SCUT-COUCH2009 database is public free to the academic community for research purpose usage. You should fill in a letter of commitment and send it via email to us (lianwen.jin@gmail.com). We will give you the decompression password to access the database after your letter has been received and approved.
For more details of SCUT-COUCH2009 database, please refer our paper: Lianwen Jin, Yan Gao, Gang Liu, Yunyang Li, Kai Ding. "SCUT-COUCH2009----A Comprehensive Online Unconstrained Chinese Handwriting Database and Benchmark Evaluation", to appear in International Journal of Document Analysis and Recognition. 2010.