Affevtive Bias in Large Pre-trained Language Models

Blacks is to Anger as Whites is to Joy? Understanding Latent Affective Bias in Large Pre-trained Neural Language Models
Anoop K¹, Deepak P.², Sahely Bhadra³, Manjary P Gangan¹, and Lajish V L¹
¹ Department of Computer Science, University, University of Calict, Kerala, India.
² School of Electronics, Electrical Engineering and Computer Science, Queen’s University Belfast, Northern Ireland, UK.
³ Department of Data Science, Indian Institute of Technology, Palakkad, India

📝 pre-print : https://arxiv.org/abs/2301.09003
🌏 Link : https://dcs.uoc.ac.in/cida/projects/ac/affective-bias.html

Abstract:Groundbreaking inventions and highly significant performance improvements in deep learning based Natural Language Processing are witnessed through the development of transformer based large Pre-trained Language Models (PLMs). The wide availability of unlabeled data within human generated data deluge along with self-supervised learning strategy helps to accelerate the success of large PLMs in language generation, language understanding, etc. But at the same time, latent historical bias/unfairness in human minds towards a particular gender, race, etc., encoded unintentionally/intentionally into the corpora harms and questions the utility and efficacy of large PLMs in many real-world applications, particularly for the protected groups. In this paper, we present an extensive investigation towards understanding the existence of “Affective Bias” in large PLMs to unveil any biased association of emotions such as anger, fear, joy, etc., towards a particular gender, race or religion with respect to the downstream task of textual emotion detection. We conduct our exploration of affective bias from the very initial stage of corpus level affective bias analysis by searching for imbalanced distribution of affective words within a domain, in large scale corpora that are used to pre-train and fine-tune PLMs. Later, to quantify affective bias in model predictions, we perform an extensive set of class-based and intensity-based evaluations using various bias evaluation corpora. Our results show the existence of statistically significant affective bias in the PLM based emotion detection systems, indicating biased association of certain emotions towards a particular gender, race, and religion.

For other inquiries, please contact:

Anoop K, University of Calicut, Kerala, India. 📧 anoopk_dcs@uoc.ac.in
Deepak P., Queen’s University Belfast, Northern Ireland, UK. 📧 deepaksp@acm.org
Sahely Bhadra, Indian Institute of Technology, Palakkad, India 📧 sahely@iitpkd.ac.in
Manjary P Gangan, University of Calicut, Kerala, India. 📧 manjaryp_dcs@uoc.ac.in
Lajish V. L., University of Calicut, Kerala, India. 📧 lajish@uoc.ac.in

Citation

will update soon...

Acknowledgement

The authors would like to thank the authors of [1] for making their source codes publicly available and the authors of [2,3,4] for making their evaluation corpora publicly available. The authors would like to thank Chanjal V.V., Master’s student (2018-20) of the Department of Women Studies, University of Calicut for her involvement and cooperation to create the list of target terms related to non-binary gender to conduct the corpus level experiments. The first author would like to thank Indian Institute of Technology Palakkad for organizing the GIAN course on Fairness in Machine Learning. The third author would like to thank the Department of Science and Technology (DST) of the Government of India for financial support through the Women Scientist Scheme-A (WOS-A) for Research in Basic/Applied Science under the Grant SR/WOS-A/PM-62/2018.

References

[1] Yi Chern Tan and L. Elisa Celis. 2019. Assessing Social and Intersectional Biases in Contextualized Word Representations. Curran Associates Inc., Red Hook, NY, USA, 13230–13241. https://dl.acm.org/doi/10.5555/3454287.3455472
[2] Svetlana Kiritchenko and Saif Mohammad. 2018. Examining Gender and Race Bias in Two Hundred Sentiment Analysis Systems. In Proceedings of the Seventh Joint Conference on Lexical and Computational Semantics. Association for Computational Linguistics, New Orleans, Louisiana, 43–53. https://doi.org/10.18653/v1/S18-2005
[3] Nikita Nangia, Clara Vania, Rasika Bhalerao, and Samuel R. Bowman. 2020. CrowS-Pairs: A Challenge Dataset for Measuring Social Biases in Masked Language Models. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP). Association for Computational Linguistics, Online, 1953–1967. https://doi.org/10.18653/v1/2020.emnlp-main.154
[4] Pranav Narayanan Venkit and Shomir Wilson. 2021. Identification of Bias Against People with Disabilities in Sentiment Analysis and Toxicity Detection Models. arXiv preprint arXiv:2111.13259 (2021). https://doi.org/10.48550/arXiv.2111.13259

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
Corpus level affective bias		Corpus level affective bias
Evaluation corpora		Evaluation corpora
LargePLMs based Emotion Detection		LargePLMs based Emotion Detection
Prediction level Affective bias		Prediction level Affective bias
images		images
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Corpus level affective bias

Corpus level affective bias

Evaluation corpora

Evaluation corpora

LargePLMs based Emotion Detection

LargePLMs based Emotion Detection

Prediction level Affective bias

Prediction level Affective bias

images

images

LICENSE

LICENSE

README.md

README.md

Repository files navigation

Affevtive Bias in Large Pre-trained Language Models

Citation

Acknowledgement

References

About

Releases

Packages

Contributors 2

Languages

License

anoopkdcs/affective_bias_in_plm

Folders and files

Latest commit

History

Repository files navigation

Affevtive Bias in Large Pre-trained Language Models

Citation

Acknowledgement

References

About

Topics

Resources

License

Stars

Watchers

Forks

Languages