Please use this identifier to cite or link to this item: http://repositorio.ufla.br/jspui/handle/1/50511
Title: Big data analytics for critical information classification in online social networks using classifier chains
Keywords: Big data
Age-group classifier
Gender classifier
Feature selection
Feature transformation
Multi-label classification
Issue Date: 10-Jan-2022
Publisher: Springer
Citation: SILVA, D. H. et al. Big data analytics for critical information classification in online social networks using classifier chains. Peer-to-Peer Networking and Applications, [S.l.], v. 15, p. 626-641, Jan. 2022. DOI: 10.1007/s12083-021-01269-1.
Abstract: Industrial and academic organizations are using online social network (OSN) for different purposes, such as social and economic aspects. Now, OSN is a new mean of obtaining information from people about their preferences, and interests. Due to the large volume of user-generated content, researchers use various techniques, such as sentiment analysis or data mining to evaluate this information automatically. However, the sentiment analysis of OSN content is performed by different methods, but there are some problems to obtain highly reliable results, mainly because of the lack of user profile information, such as gender and age. In this work, a novel dataset is built, which contains the writing characteristics of 160,000 users of the Twitter OSN. Before creating classification models with Machine Learning (ML) techniques, feature transformation and feature selection methods are applied to determine the most relevant set of characteristics. To create the models, the Classifier Chain (CC) transformation technique and different machine learning algorithms are applied to the training set. Simulation results show that the Random Forest, XGBoost and Decision Tree algorithms obtain the best performance results. In the testing phase, these algorithms reached Hamming Loss values of 0.033, 0.033, and 0.034, respectively, and all of them reached the same F1 micro-average value equal to 0.976. Therefore, our proposal based on a multidimensional learning technique using CC transformation overcomes other similar proposals.
URI: https://link.springer.com/article/10.1007/s12083-021-01269-1
http://repositorio.ufla.br/jspui/handle/1/50511
Appears in Collections:DCC - Artigos publicados em periódicos

Files in This Item:
There are no files associated with this item.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Admin Tools