A Multi-label Classification System to Distinguish among Fake, Satirical, Objective and Legitimate News in Brazilian Portuguese

Janaína Ignacio de Morais; Hugo Queiroz Abonizio; Gabriel Marques Tavares; André Azevedo da Fonseca; Sylvio Barbon Jr

A Multi-label Classification System to Distinguish among Fake, Satirical, Objective and Legitimate News in Brazilian Portuguese

Autores

Janaína Ignacio de Morais Universidade Estadual de Londrina - UEL
Hugo Queiroz Abonizio Universidade Estadual de Londrina (UEL)
Gabriel Marques Tavares Universidade Estadual de Londrina (UEL)
André Azevedo da Fonseca Universidade Estadual de Londrina (UEL)
Sylvio Barbon Jr Universidade Estadual de Londrina (UEL)

Palavras-chave:

Fake News, Decision Support System, Text Mining and Multi-Label

Resumo

Currently, there has been a significant increase in the diffusion of fake news worldwide, especially the political class, where the possible misinformation that can be propagated, appearing at the elections debates around the world. However, news with a recreational purpose, such as satirical news, is often confused with objective fake news. In this work, we decided to address the differences between objectivity and legitimacy of news documents, where each article is treated as belonging to two conceptual classes: objective/satirical and legitimate/fake. Therefore, we propose a DSS (Decision Support System) based on a Text Mining (TM) pipeline with a set of novel textual features using multi-label methods for classifying news articles on these two domains. For this, a set of multi-label methods was evaluated with a combination of different base classifiers and then compared with a multi-class approach. Also, a set of real-life news data was collected from several Brazilian news portals for these experiments. Results obtained reported our DSS as adequate (0.80 f1-score) when addressing the scenario of misleading news, challenging the multi-label perspective, where the multi-class methods (0.01 f1-score) overcome by the proposed method. Moreover, it was analyzed how each stylometric features group used in the experiments influences the result aiming to discover if a particular group is more relevant than others. As a result, it was noted that the complexity group of features could be more relevant than others.

Downloads

Não há dados estatísticos.

Referências

Rubin, Victoria, et al. "Fake news or truth? using satirical cues to detect potentially misleading news." Proceedings of the second workshop on computational approaches to deception detection. 2016.

Ruchansky, Natali, Sungyong Seo, and Yan Liu. "Csi: A hybrid deep model for fake news detection." Proceedings of the 2017 ACM on Conference on Information and Knowledge Management. ACM, 2017.

Singhania, Sneha, Nigel Fernandez, and Shrisha Rao. "3han: A deep neural network for fake news detection." International Conference on Neural Information Processing. Springer, Cham, 2017.

Sorower, Mohammad S. "A literature survey on algorithms for multi-label learning." Oregon State University, Corvallis 18 (2010): 1-25.

Downloads

PDF (English)

Publicado

2020-07-31

Como Citar

Morais, J. I. de, Abonizio, H. Q., Tavares, G. M., da Fonseca, A. A., & Jr, S. B. (2020). A Multi-label Classification System to Distinguish among Fake, Satirical, Objective and Legitimate News in Brazilian Portuguese. ISys - Brazilian Journal of Information Systems, 13(4), 126–149. Recuperado de https://seer.unirio.br/isys/article/view/9563

Baixar Citação

Edição

v. 13 n. 4 (2020)

Seção

VERSÕES ESTENDIDAS DE ARTIGOS SELECIONADOS

Licença

Autores que publicam nesta revista concordam com os seguintes termos: Autores mantém os direitos autorais e concedem à revista o direito de primeira publicação, com o trabalho simultaneamente licenciado sob a Licença Creative Commons Attribution - http://creativecommons.org/licenses/by/3.0/ que permite o compartilhamento do trabalho com reconhecimento da autoria e publicação inicial nesta revista. Autores têm autorização para assumir contratos adicionais separadamente, para distribuição não-exclusiva da versão do trabalho publicada nesta revista (ex.: publicar em repositório institucional ou como capítulo de livro), com reconhecimento de autoria e publicação inicial nesta revista. Autores têm permissão e são estimulados a publicar e distribuir seu trabalho online (ex.: em repositórios institucionais ou na sua página pessoal) a qualquer ponto antes ou durante o processo editorial, já que isso pode gerar alterações produtivas, bem como aumentar o impacto e a citação do trabalho publicado (acrescentando nesta distribuição a citação completa ao artigo na iSys).

A Multi-label Classification System to Distinguish among Fake, Satirical, Objective and Legitimate News in Brazilian Portuguese

Autores

Palavras-chave:

Resumo

Downloads

Referências

Downloads

Publicado

Como Citar

Edição

Seção

Licença

Enviar Submissão

Idioma

Informações