OffendES_spans is an Spanish corpus created in the spirit of the original OffendES dataset, but including offensive spans automatically labeled using the SHARE lexicon resource of offensive terms and expressions. The corpora consist of 11,035 comments are annotated with offensive spans.
TERMS OF USE:
- The resource is available free for research purposes.
- Do not redistribute the data.
- SINAI disclaims any responsibility for the use of the lexicon and does not provide technical support. However, the following contacts will be happy to respond to queries and clarifications: fmplaza@ujaen.es, maite@ujaen.es.
If you use this resource, please cite the following paper:
@inproceedings{plaza-del-arco-etal-2021-offendes,
title = "{O}ffend{ES}: A New Corpus in {S}panish for Offensive Language Research",
author = "Plaza-del-Arco, Flor Miriam and
Montejo-R{\'a}ez, Arturo and
Ure{\~n}a-L{\'o}pez, L. Alfonso and
Mart{\'\i}n-Valdivia, Mar{\'\i}a-Teresa",
booktitle = "Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2021)",
month = sep,
year = "2021",
address = "Held Online",
publisher = "INCOMA Ltd.",
url = "https://aclanthology.org/2021.ranlp-main.123",
pages = "1096--1108",
}