Self-Organizing Maps capable of encoding structured information will be used for the clustering of XML documents. Documents formatted in XML are appropriately represented as graph data structures. It will be shown that the Self-Organizing Maps can be trained in an unsupervised fashion to group XML structured data into clusters, and that this task is scaled in linear time with in- creasing size of the corpus. It will also be shown that some simple prior knowl- edge of the data structures is beneficial to the efficient grouping of the XML documents.
Clustering XML Documents using Self-Organizing Maps for Structures
SPERDUTI, ALESSANDRO;
2006
Abstract
Self-Organizing Maps capable of encoding structured information will be used for the clustering of XML documents. Documents formatted in XML are appropriately represented as graph data structures. It will be shown that the Self-Organizing Maps can be trained in an unsupervised fashion to group XML structured data into clusters, and that this task is scaled in linear time with in- creasing size of the corpus. It will also be shown that some simple prior knowl- edge of the data structures is beneficial to the efficient grouping of the XML documents.File in questo prodotto:
Non ci sono file associati a questo prodotto.
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.