Use este identificador para citar ou linkar para este item:
http://repositorio.ufla.br/jspui/handle/1/14967
Título: | Evaluation and Comparison of Concept Based and N-Grams Based Text Clustering Using SOM |
Autor: | Amine, Abdelmalek Elberrichi, Zakaria Simonet, Michel Malki, Mimoun |
Palavras-chave: | Text clustering Self-Organizing Maps of Kohonen N-grams Concept Similarity Reuters21578 |
Publicador: | Universidade Federal de Lavras |
Data: | 1-Mar-2008 |
Referência: | AMINE, A.; ELBERRICHI, Z.; SIMONET, M.; MALKI, M. Evaluation and Comparison of Concept Based and N-Grams Based Text Clustering Using SOM. INFOCOMP Journal of Computer Science, Lavras, v. 7, n. 1, p. 27-35, Mar. 2008. |
Outras Identificações : | http://www.dcc.ufla.br/infocomp/index.php/INFOCOMP/article/view/203 |
Descrição: | With the great and rapidly growing number of documents available in digital form (Internet, library, CD-Rom…), the automatic classification of texts has become a significant research field and a fundamental task in document processing. This paper deals with unsupervised classification of textual documents also called text clustering using Self-Organizing Maps of Kohonen in two new situations: a conceptual representation of texts and a representation based on n-grams, instead of a representation based on words. The effects of these combinations are examined in several experiments using 4 measurements of similarity. The Reuters-21578 corpus is used for evaluation. The evaluation was done by using the F-measure and the entropy. |
Idioma: | eng |
Aparece nas coleções: | Infocomp |
Arquivos associados a este item:
Não existem arquivos associados a este item.
Os itens no repositório estão protegidos por copyright, com todos os direitos reservados, salvo quando é indicado o contrário.