Academia Sinica Balanced Corpus of Modern Chinese

"Academia Sinica Balanced Corpus of Modern Chinese", simplified as Sinica Corpus, is designed for analyzing modern Chinese. Every text in the corpus is segmented and each segmented word is tagged with its part-of-speech. Texts are collected from different areas and classified according to five criteria: genre, style, mode, topic, and source. Therefore, this corpus is a representative sample of modern Chinese language.

http://www.sinica.edu.tw/SinicaCorpus/

 

 

 

 


There are no documents in this category