Skip to main content

Detection of Hyperpartisan news articles using natural language processing techniques.

Naredla, N.R. and Adedoyin, F. F., 2022. Detection of Hyperpartisan news articles using natural language processing techniques. International Journal of Information Management Data Insights, 2 (1), 100064.

Full text available as:

[img]
Preview
PDF (OPEN ACCESS ARTICLE)
1-s2.0-S2667096822000088-main.pdf - Published Version
Available under License Creative Commons Attribution Non-commercial No Derivatives.

1MB
[img] PDF
Manuscript R3 Clean.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Attribution Non-commercial No Derivatives.

446kB

DOI: 10.1016/j.jjimei.2022.100064

Abstract

Yellow journalism has increased the spread of hyperpartisan news on the internet. It is very difficult for online news article readers to distinguish hyperpartisan news articles from mainstream news articles. There is a need for an automated model that can detect hyperpartisan news on the internet and tag them as hyperpartisan so that it is very easy for readers to avoid that news. A hyperpartisan news detection article was developed by using three different natural language processing techniques named BERT, ELMo, and Word2vec. This research used the bi-article dataset published at SEMEVAL-2019. The ELMo word embeddings which are trained on a Random forest classifier has got an accuracy of 0.88, which is much better than other state of art models. The BERT and Word2vec models have got the same accuracy of 0.83. This research tried different sentence input lengths to BERT and proved that BERT can extract context from local words. Evidenced from the described ML models, this study will assist the governments, news’ readers, and other political stakeholders to detect any hyperpartisan news, and also helps policy to track, and regulate, misinformation about the political parties and their leaders.

Item Type:Article
ISSN:2667-0968
Uncontrolled Keywords:BERT ; Transformers ; Word embedding ; ELMo ; natural language processing ; Word2vec ; Tensorflow ; bidirectional
Group:Bournemouth University Business School
ID Code:36683
Deposited By: Symplectic RT2
Deposited On:28 Feb 2022 12:39
Last Modified:14 Mar 2022 14:33

Downloads

Downloads per month over past year

More statistics for this item...
Repository Staff Only -