Use este identificador para citar ou linkar para este item: http://repositorio.ufla.br/jspui/handle/1/50511
Título: Big data analytics for critical information classification in online social networks using classifier chains
Palavras-chave: Big data
Age-group classifier
Gender classifier
Feature selection
Feature transformation
Multi-label classification
Data do documento: 10-Jan-2022
Editor: Springer
Citação: SILVA, D. H. et al. Big data analytics for critical information classification in online social networks using classifier chains. Peer-to-Peer Networking and Applications, [S.l.], v. 15, p. 626-641, Jan. 2022. DOI: 10.1007/s12083-021-01269-1.
Resumo: Industrial and academic organizations are using online social network (OSN) for different purposes, such as social and economic aspects. Now, OSN is a new mean of obtaining information from people about their preferences, and interests. Due to the large volume of user-generated content, researchers use various techniques, such as sentiment analysis or data mining to evaluate this information automatically. However, the sentiment analysis of OSN content is performed by different methods, but there are some problems to obtain highly reliable results, mainly because of the lack of user profile information, such as gender and age. In this work, a novel dataset is built, which contains the writing characteristics of 160,000 users of the Twitter OSN. Before creating classification models with Machine Learning (ML) techniques, feature transformation and feature selection methods are applied to determine the most relevant set of characteristics. To create the models, the Classifier Chain (CC) transformation technique and different machine learning algorithms are applied to the training set. Simulation results show that the Random Forest, XGBoost and Decision Tree algorithms obtain the best performance results. In the testing phase, these algorithms reached Hamming Loss values of 0.033, 0.033, and 0.034, respectively, and all of them reached the same F1 micro-average value equal to 0.976. Therefore, our proposal based on a multidimensional learning technique using CC transformation overcomes other similar proposals.
URI: https://link.springer.com/article/10.1007/s12083-021-01269-1
http://repositorio.ufla.br/jspui/handle/1/50511
Aparece nas coleções:DCC - Artigos publicados em periódicos

Arquivos associados a este item:
Não existem arquivos associados a este item.


Os itens no repositório estão protegidos por copyright, com todos os direitos reservados, salvo quando é indicado o contrário.

Ferramentas do administrador