hatespeechdata.com - Hate Speech Dataset Catalogue | hatespeechdata

Description: Catalog of abusive language data (PLoS 2020)

Example domain paragraphs

This page catalogues datasets annotated for hate speech, online abuse, and offensive language. They may be useful for e.g. training a natural language processing system to detect this language.

The list is maintained by Leon Derczynski , Bertie Vidgen , Hannah Rose Kirk , Pica Johansson, Yi-Ling Chung , Mads Guldborg Kjeldgaard Kongsbak, Laila Sprejer , and Philine Zeinert.

We provide a list of datasets and keywords . If you would like to contribute to our catalogue or add your dataset, please see the instructions for contributing .

Links to hatespeechdata.com (1)

punyajoy.github.io Punyajoy Saha