Use este identificador para citar ou linkar para este item: http://repositorio.ufla.br/jspui/handle/1/58594
Registro completo de metadados
Campo DCValorIdioma
dc.creatorOliveira, Douglas Nunes de-
dc.creatorUtsch, Milo Noronha Rocha-
dc.creatorMachado, Diogo Villela Pedro de Almeida-
dc.creatorPena, Nina Goulart-
dc.creatorOliveira, Ramon Gomes Durães de-
dc.creatorCarvalho, Arthur Iperoyg Rodrigues-
dc.creatorMerschmann, Luiz Henrique de Campos-
dc.date.accessioned2023-11-24T15:48:53Z-
dc.date.available2023-11-24T15:48:53Z-
dc.date.issued2023-
dc.identifier.citationOLIVEIRA, D. N. de et al. Evaluating a new auto-ML approach for sentiment analysis and intent recognition tasks. Journal on Interactive Systems, Porto Alegre, v. 14, n. 1, p. 92-105, 2023.pt_BR
dc.identifier.urihttp://repositorio.ufla.br/jspui/handle/1/58594-
dc.description.abstractAutomated Machine Learning (AutoML) is a research area that aims to help humans solve Machine Learning (ML) problems by automatically discovering good ML pipelines (algorithms and their hyperparameters for every stage of a machine learning process) for a given dataset. Since we have a combinatorial optimization problem for which it is impossible to evaluate all possible pipelines, most AutoML systems use a Genetic Algorithm (GA) or Bayesian Optimization (BO) to find a good solution. These systems usually evaluate the performance of the pipelines using the K-fold cross-validation method, for which the more pipelines are evaluated, the higher the chance of finding an overfitted solution. To avoid the aforementioned issue, we propose a system named Auto-ML System for Text Classification (ASTeC), that uses the Bootstrap Bias Corrected CV (BBC-CV) method to evaluate the performance of the pipelines. More specifically, the proposed system combines GA, BO, and BBC-CV to find a good ML pipeline for the text classification task. We evaluated our approach by comparing it with state-of-the-art systems: in the the Sentiment Analysis (SA) task, we compared our approach to TPOT (Tree-based Pipeline Optimization Tool) and Google Cloud AutoML service, and for the Intent Recognition (IR) task, we compared with TPOT and MLJAR AutoML. Concerning the data, we analysed seven public datasets from the SA domain and sixteen from the IR domain. Four out of those sixteen are composed by written English text, while all of the others are in Brazilian Portuguese. Statistical tests show that, in 21 out of 23 datasets, our system's performance is equivalent to or better than the others.pt_BR
dc.languageen_USpt_BR
dc.publisherBrazilian Computing Societypt_BR
dc.rightsacesso abertopt_BR
dc.rightsAttribution 4.0 International*
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/*
dc.sourceJournal on Interactive Systemspt_BR
dc.subjectAutomated Machine Learning (AutoML)pt_BR
dc.subjectBiascorrectioncross-validationpt_BR
dc.subjectGenetic algorithmpt_BR
dc.subjectBayesian optimizationpt_BR
dc.subjectIntent recognitionpt_BR
dc.subjectChatbotpt_BR
dc.titleEvaluating a new auto-ML approach for sentiment analysis and intent recognition taskspt_BR
dc.typeArtigopt_BR
Aparece nas coleções:DAC - Artigos publicados em periódicos

Arquivos associados a este item:
Arquivo Descrição TamanhoFormato 
3161-Article Text-14257-1-10-20230509.pdf284,19 kBAdobe PDFVisualizar/Abrir


Este item está licenciada sob uma Licença Creative Commons Creative Commons