Corpora, Own resources, Resource

SinaiSACorpus

Resource type:

Corpora

Description:

This corpus has been prepared by the SINAI group in December 2008. SINAI SA (Sentiment Analysis) was created by tracking the Amazon website. Nearly 2,000 comments were extracted from different cameras.

Structure: The SINAI corpus containing 5 directories and each represents the number of stars for reviews. (eg directory 1 contains rated with a star). Each directory contains a file in plain text by document/comment.

The amount of comments is as follows:

    • 1…star: 78 comments
    • 2…stars: 67 comments
    • 3…stars: 97 comments
    • 4…stars: 411 comments
    • 5…stars: 1,290 comments

Total: 1,943 comments

Camera Comments
CanonA590IS 400
CanonA630 300
CanonSD1100IS 426
KodakCx7430 64
KodakV1003 95
KodakZ740 155
Nikon5700 119
Olympus1030SW 168
PentaxK10D 126
PentaxK200D 90
Total 1,943

Rushdi-Saleh, M., Martín-Valdivia, M. T., Montejo-Ráez, A., & Alfonso Ureña-López, L. (2011). Experiments with SVM to classify opinions in different domains. Expert Systems with Applications.
http://dx.doi.org/10.1016/j.eswa.2011.05.070

Resource files:

SINAI-SA-corpus.zip