Thursday, August 25, 2005

Corpora for text mining

For research and development of text-mining techniques, there are a number of corpora available.
RCV1 Reuters Corpus
TREC Collections
Linguistic Data Consortium
ICAME (International Computer Archive of Modern and Medieval English)
TEI (Text Encoding Initiative)
Corpus Linguistics Links

0 Comments:

Post a Comment

<< Home