Label Propagation-Based Semi-Supervised Learningfor Hate Speech Classification

Speaker: Ashwin Geet D’Sa

Date and place: November 12, 2020 at 10:30, VISIO-CONFERENCE

Abstract:

Research on hate speech classification has received increased attention. In real-life scenarios, a small amount of labeled hate speech data is available to train a reliable classifier. Semi-supervised learning takes advantage ofa small amount of labeled data and a large amount of unlabeled data. In this work, label propagation-based semi-supervised learning is explored for the task of hate speech classification. The quality of labeling the unlabeled set depends on the input representations. We show that pre-trained representations are label agnostic, and when used with label propagation yield poor results. Neural network-based fine-tuning can be adopted to learn task-specific representations using asmall amount of labeled data. We show that fully fine-tuned representations may not always be the best representations for the label propagation and intermediate representations may perform better in a semi-supervised setup.