Predictive linguistic cues for fake news: a societal artificial intelligence problem

Sandhya Aneja, Nagender Aneja, Ponnurangam Kumaraguru

Abstract


Media news are making a large part of public opinion and, therefore, must not be fake. News on web sites, blogs, and social media must be analyzed before being published. In this paper, we present linguistic characteristics of media news items to differentiate between fake news and real news using machine learning algorithms. Neural fake news generation, headlines created by machines, semantic incongruities in text and image captions generated by machine are other types of fake news problems. These problems use neural networks which mainly control distributional features rather than evidence. We propose applying correlation between features set and class, and correlation among the features to compute correlation attribute evaluation metric and covariance metric to compute variance of attributes over the news items. Features unique, negative, positive, and cardinal numbers with high values on the metrics are observed to provide a high area under the curve (AUC) and F1-score.

Keywords


Fake news; Lexicon analysis; Linguistic analysis; Machine learning; Sentiment analysis;

Full Text:

PDF


DOI: http://doi.org/10.11591/ijai.v11.i4.pp%25p

Refbacks

  • There are currently no refbacks.


View IJAI Stats

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.