Description: Catalog of abusive language data (PLoS 2020)
This page catalogues datasets annotated for hate speech, online abuse, and offensive language. They may be useful for e.g. training a natural language processing system to detect this language.
The list is maintained by Leon Derczynski , Bertie Vidgen , Hannah Rose Kirk , Pica Johansson, Yi-Ling Chung , Mads Guldborg Kjeldgaard Kongsbak, Laila Sprejer , and Philine Zeinert.
We provide a list of datasets and keywords . If you would like to contribute to our catalogue or add your dataset, please see the instructions for contributing .