Two balanced datasets have been created for the Italian and the English language. The corpora have been manually labelled by several annotators according to three levels:
- Misogyny ( Misogyny vs Not Misogyny)
- Misogynistic Category (Discredit, Derailing, Dominance, Sexual Harassment & Threats of Violence, Stereotype & Objectification)
- Target (Active vs Passive)
Training and Testing Set
The training and the testing set will be made through the AMI Google Group amievalita2018.