Our Datasets

For detailed eplanations of datasets please see readme.doc files in zip files.

Dataset Name Explanation Download Size
500 Columns 2000 Tweets 500 columns and 2000 tweets from 10 different Columnists 500koseyazisi2000tweet.zip 1.4 MB
2000 Parsed Sentence 2000 Parsed Turkish Sentences 2000cumle_oge.rar 0.1 MB
20 million Turkish Tweets 20 million Turkish Tweets 20milyonTweet.rar 640 MB
1200 words Most used Words in Turkish Children Literature 1200kelime.rar 0.1 MB
2 million Turkish Tweets 2 million Turkish Tweets 2milyonTweet.rar 91 MB
42 K news 42K Turkish news texts in 13 classes 42bin_haber.rar 42 MB
3000 Tweets 3000 Turkish Tweets for sentiment analysis (3 classes) 3000tweets.rar 1.6 MB
2500 Columns 2500 Columns from 50 different Columnists 2500koseyazisi.rar 5 MB
1500 Columns 1500 Columns from 30 different Columnists, in raw and arff formats. 30Columnists.zip 7 MB
157 blogs 157 Blogs for sentiment analysis, in raw and arff formats. ruh_hali.zip 0.7 MB
105 Reviews 105 positive, negative and neutral movie reviews, in raw and arff formats.  film_yorumlari.zip 0.2 MB
69 Authors 910 articles from 69 different Turkish authors, in raw and arff formats. 69yazar.zip 3.2 MB
1150 News 1150 Turkish news texts in 5 classes, in raw and arff formats. 1150haber.rar 3.7 MB
630 Articles 630 articles from 18 different Turkish authors, in raw and arff formats.  630koseyazisi.zip 3.5 MB
270 Articles 270 articles from 18 different Turkish authors, in raw and arff formats.  270koseyazisi.zip 1.6 MB
140 Poems 140 poems from 7 different Turkish poets, in raw and arff formats.  140siir.zip 0.4 MB
90 Articles 90 articles from 9 different Turkish authors, in raw and arff formats.  90koseyazisi.zip 0.7 MB
75 News 75 Turkish news texts in 5 classes, in raw and arff formats. 75haber.zip 0.4 MB
Word Meanings The meanings of Turkish 10535 words. Kelime_anlamlari.zip 0.3 MB
Term Groups 2750 Turkish terms in 27 term groups. Terim_gruplari.zip 0.1 MB
Semantic Roles 5849 Turkish words, word types, suffixes and semantic roles in arff format. semantik_rol.zip 0.1 MB
28567 Turkish Web Pages 28567 Turkish web pages in 758 hierarchical classes under 13 main category. trk_original.zip 173 MB
Turkish syllable statistics Turkish syllable statistics extracted from 24 million Turkish words hece_ist.zip 0.2 MB
Turkish Semantic Relation Database 127203 semantic relations automatically derived from TDK and Viki dictionaries. TDK_viki.rar 0.8 MB

 

Note: To work with arff files please download the weka software from  http://www.cs.waikato.ac.nz/ml/weka/ link.

 

_LANGUAGE

_TURKISH _ENGLISH

_LOGIN

_LOST_PASSWORD
 
© 2009-... Kemik@ce.yildiz.edu.tr - phpComasy Valid XHTML 1.0 Transitional
powered by phpComasy