Data set for automatic detection of online misogynistic speech

Data Brief. 2019 Aug 22:26:104223. doi: 10.1016/j.dib.2019.104223. eCollection 2019 Oct.

Abstract

The data set is composed of 2285 definitions posted on the Urban Dictionary platform from 1999 to May 2016. The data was classified as misogynistic and non-misogynistic by three independent researchers with domain knowledge. The data set is available in public repository in a table containing two columns: the text-based definition from Urban Dictionary and its respective classification (1 for misogynistic and 0 for non-misogynistic).

Keywords: Hate speech; Misogynistic speech; Misogyny detection; Online speech; Urban dictionary.