Corpora for text mining
For research and development of text-mining techniques, there are a number of corpora available.
RCV1 Reuters Corpus
TREC Collections
Linguistic Data Consortium
ICAME (International Computer Archive of Modern and Medieval English)
TEI (Text Encoding Initiative)
Corpus Linguistics Links
RCV1 Reuters Corpus
TREC Collections
Linguistic Data Consortium
ICAME (International Computer Archive of Modern and Medieval English)
TEI (Text Encoding Initiative)
Corpus Linguistics Links
0 Comments:
Post a Comment
<< Home