Author's posts
May 12
Multimodal speech animation
Speaker: Louis Abel Data and place: May 12, 2022, at 10:30 – Hybrid Abstract: Multimodal speech animation is the next step to speech synthesis, combining visuals with audio allows the creation of embodied conversational agent (ECA) which can convey more information than a classic text-to-speech approach, several works have been done in the team to progress in …
Apr 28
Expanding the training data for neural network based hate speech classification
Speaker: Ashwin Geet D’Sa Data and place: April 28, 2022, at 10:30 – Hybrid Abstract: The phenomenal increase in internet usage, catering to the dissemination of knowledge and expression, has also led to an increase in online hate speech. Online hate speech is anti-social communicative behavior, which leads to the threat and violence toward an individual or a group. …
Mar 31
Domain adaptation for accented speech
Speaker: Robin San Roman Data and place: March 31, 2022, at 10:30 – Hybrid Abstract: Automatic speech recognition (ASR) systems tend to have a performance drop with accented speech compared to standard speech. This is due to linguistic differences between the different domains of speech. In this talk, I will present a method to adapt CPC models …
Mar 24
Cross-corpora Hate-speech detection with Dynamically Refined Regularization
Speaker: Tulika Bose Data and place: March 24, 2022, at 10:30 – Hybrid Abstract: Hatespeechclassifiersexhibitsubstantial performance degradation when evaluated on datasets different from the source. This is due to learning spurious correlations between words that are not necessarily relevant to hateful language and hate speech labels from the training corpus. Previous work has attempted to mitigate this problem by regularizing specific terms from pre-defined static dictionaries. While this has been demonstrated to improve the generalizability of classifiers, the coverage of …
Mar 24
Cross-corpora Hate-speech detection with Dynamically Refined Regularization
Speaker: Tulika Bose Data and place: March 24, 2022, at 10:30 – Hybrid Abstract: Hate speech classifiers exhibit substantial performance degradation when evaluated on datasets different from the source. This is due to learning spurious correlations between words that are not necessarily relevant to hateful language and hate speech labels from the training corpus. Previous work has attempted to mitigate this problem by regularizing specific terms from pre-defined static dictionaries. While this has been demonstrated to improve the generalizability of classifiers, the coverage of such methods is …
Mar 17
Identification of Multiword Expressions in Tweets for Hate Speech Detection
Speaker: Nicolas Zampieri Data and place: March 17, 2022, at 10:30 – Hybrid Abstract: Multiword expression (MWE) identification in tweets is a complex task due to the complex linguistic nature of MWEs combined with the non-standard language use on social networks. MWE features were shown to be helpful for hate speech detection (HSD). In this article, we …
Mar 03
Low-resource MT: few-shot learning and historical language normalisation
Speaker: Rachel Bawden Data and place: March 3, 2022, at 10:30 – Hybrid Abstract: Huge progress has been seen in machine translation (MT), spurred on by advances in neural architectures. However, challenges still remain. One of the big challenges is dealing with scenarios that are low-resource (where there is little parallel data available). In this talk, I will …
Feb 03
Privacy-Preserving Speech Representation Learning using Vector Quantization
Speaker: Pierre Champion Data and place: February 3, 2022, at 10:00 – Hybrid Abstract: Speech signals are a rich source of speaker-related information, including sensitive attributes like gender, identity, etc. Those sensitive attributes can be extracted and used for malicious purposes like voice spoofing. Despite the inherent sensitivity of speech signals, more and more services, mainly …
Feb 02
Task representation learning
Speaker: Damien Siléo Data and place: February 2, 2022, at 10:00 – Videoconference Abstract: Current machine learning systems are often tuned for the desired task through annotated examples. This method is brittle or/and costly, as neural networks are still not very interpretable, and specifying their behavior through examples leads to many problems (unfairness and shallow reasoning). Task embeddings …
Jan 20
An Analogy based Approach for Solving Target Sense Verification
Speaker: Georgios Zervakis Data and place: January 20, 2022, at 10:30 – Videoconference Abstract: Contextualized language models are emerging as a de facto standard in natural language processing due to the vast amount of knowledge they acquire during pretraining. Nonetheless, their ability to solve tasks that require reasoning over this knowledge is limited. Certain tasks can …