Naredla, N.R. and Adedoyin, F. F., 2022. Detection of Hyperpartisan news articles using natural language processing techniques. International Journal of Information Management Data Insights, 2 (1), 100064.
Full text available as:
|
PDF (OPEN ACCESS ARTICLE)
1-s2.0-S2667096822000088-main.pdf - Published Version Available under License Creative Commons Attribution Non-commercial No Derivatives. 1MB | |
PDF
Manuscript R3 Clean.pdf - Accepted Version Restricted to Repository staff only Available under License Creative Commons Attribution Non-commercial No Derivatives. 446kB | ||
Copyright to original material in this document is with the original owner(s). Access to this content through BURO is granted on condition that you use it only for research, scholarly or other non-commercial purposes. If you wish to use it for any other purposes, you must contact BU via BURO@bournemouth.ac.uk. Any third party copyright material in this document remains the property of its respective owner(s). BU grants no licence for further use of that third party material. |
DOI: 10.1016/j.jjimei.2022.100064
Abstract
Yellow journalism has increased the spread of hyperpartisan news on the internet. It is very difficult for online news article readers to distinguish hyperpartisan news articles from mainstream news articles. There is a need for an automated model that can detect hyperpartisan news on the internet and tag them as hyperpartisan so that it is very easy for readers to avoid that news. A hyperpartisan news detection article was developed by using three different natural language processing techniques named BERT, ELMo, and Word2vec. This research used the bi-article dataset published at SEMEVAL-2019. The ELMo word embeddings which are trained on a Random forest classifier has got an accuracy of 0.88, which is much better than other state of art models. The BERT and Word2vec models have got the same accuracy of 0.83. This research tried different sentence input lengths to BERT and proved that BERT can extract context from local words. Evidenced from the described ML models, this study will assist the governments, news’ readers, and other political stakeholders to detect any hyperpartisan news, and also helps policy to track, and regulate, misinformation about the political parties and their leaders.
Item Type: | Article |
---|---|
ISSN: | 2667-0968 |
Uncontrolled Keywords: | BERT ; Transformers ; Word embedding ; ELMo ; natural language processing ; Word2vec ; Tensorflow ; bidirectional |
Group: | Bournemouth University Business School |
ID Code: | 36683 |
Deposited By: | Symplectic RT2 |
Deposited On: | 28 Feb 2022 12:39 |
Last Modified: | 14 Mar 2022 14:33 |
Downloads
Downloads per month over past year
Repository Staff Only - |