Optimizing Bias Detection in Tweets Using Bayesian Probabilistic Model
- 1 Department of Computer Science and Engineering, University of Visvesvaraya College of Engineering, Bangalore, 560001, India
Abstract
Content moderation on social media faces persistent challenges from inconsistent evaluation shaped by subjective judgment and subtle semantic variations. This work proposes a Bayesian probabilistic framework for detecting bias in tweets using WordNet-based vocabulary filtering, statistical normalization via z-scores, and threshold optimization. The system is stateless, scalable, and dataset-agnostic, requiring no session-specific information. Unlike complex models such as Support Vector Machines (SVM), Multi-Layer Perceptrons (MLP), and AdaBoost, which tend to exhibit skewed classification patterns, the proposed approach achieves balanced confusion matrices and competitive F1 scores. Experimental evaluation across three benchmark datasets covering hate speech, political partisanship, and racial and gender-based discrimination demonstrates accuracy ranging from 71 to 82.4%, with the highest F1 score of 0.859 on Dataset 1, confirming the framework’s effectiveness for interpretable and balanced bias detection.
DOI: https://doi.org/10.3844/jcssp.2026.1552.1568
Copyright: © 2026 Prasanth G Rao, Harsha Chigurupati, Krish Hashia, Thriveni J, P Deepa Shenoy and Venugopal K R. This is an open access article distributed under the terms of the
Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
- 32 Views
- 10 Downloads
- 0 Citations
Download
Keywords
- X
- Bias
- Fairness
- SentiWordNet