SCUT Online Handwritten Chinese Character Testing Database (SCUT-onHCCTestDB)
Introduction of SCUT-onHCCTestDB
SCUT-onHCCTestDB is an online handwritten Chinese character database. It contains five datasets that are simplified Chinese dataset (denoted as SimpleChar) in GB2312-80 standard, traditional Chinese dataset (denoted as TradChar) in Big5 standard, mixed simplified and traditional Chinese dataset (denoted as SimpTradChar), rarely-used Chinese character dataset (denoted as RarelyUsedChar), and symbol dataset (denoted as SymbolChar). Each of the above dataset includes 5 subsets (indexed from 1 to 5) respectively. It is worth mentioning that the SymbolChar dataset comprises uppercase letters, lowercase letters, digits, punctuation, common symbols, and so on.
The SCUT-onHCCTestDB is publicly available for academic research usage. These dataset files are packed in ZIP format that you can download by clicking the links below.
※ onHCCTestDB-SimpleChar (23.5M)
※ onHCCTestDB-TradChar (24.8M)
※ onHCCTestDB-SimpTradChar (24.3M)
※ onHCCTestDB-RarelyUsedChar (25.0M)
※ onHCCTestDB-SymbolChar (4.5M)
Contact:
Email:lianwen.jin@gmail.com