Interactions between term weighting and feature selection methods on the sentiment analysis of Turkish reviews


PARLAR T., ÖZEL S. A., Song F.

17th International Conference on Intelligent Text Processing and Computational Linguistics, CICLing 2016, Konya, Türkiye, 3 - 09 Nisan 2016, cilt.9624 LNCS, ss.335-346, (Tam Metin Bildiri) identifier

  • Yayın Türü: Bildiri / Tam Metin Bildiri
  • Cilt numarası: 9624 LNCS
  • Doi Numarası: 10.1007/978-3-319-75487-1_26
  • Basıldığı Şehir: Konya
  • Basıldığı Ülke: Türkiye
  • Sayfa Sayıları: ss.335-346
  • Anahtar Kelimeler: Feature selection, Sentiment analysis, Term weighting
  • Hatay Mustafa Kemal Üniversitesi Adresli: Evet

Özet

Term weighting methods assign appropriate weights to the terms in a document so that more important terms receive higher weights for the text representation. In this study, we consider four term weighting and three feature selection methods and investigate how these term weighting methods respond to the reduced text representation. We conduct experiments on five Turkish review datasets so that we can establish baselines and compare the performance of these term weighting methods. We test these methods on the English reviews so that we can identify their differences with the Turkish reviews. We show that both tf and tp weighting methods are the best for the Turkish, while tp is the best for the English reviews. When feature selection is applied, tf * idf method with DFD and χ2 has the highest accuracies for the Turkish, while tf * idf and tp methods with χ2 have the best performance for the English reviews.