Abstract
This paper presents an approach to classify/cluster the web documents by decompositions of hypergraphs. The various levels of co-occurring frequent terms, called association rules (undirected rules), of documents form a hypergraph. Clustering methods is then applied to analyze such hypergraphs; a simple and fast clustering algorithm is used to decomposing hypergraph into connected components. Each connected component represents a primitive concept within the given documents. The documents will then be classified/clustered by such primitive concepts.
Original language | English |
---|---|
Title of host publication | Proceedings of SPIE - The International Society for Optical Engineering |
Editors | B.V. Dasarathy |
Pages | 168-177 |
Number of pages | 10 |
Volume | 5433 |
DOIs | |
Publication status | Published - 2004 |
Event | Data Mining and Knowledge Discovery: Theory, Tools, and Technology VI - Orlando, FL, United States Duration: Apr 12 2004 → Apr 13 2004 |
Other
Other | Data Mining and Knowledge Discovery: Theory, Tools, and Technology VI |
---|---|
Country/Territory | United States |
City | Orlando, FL |
Period | 4/12/04 → 4/13/04 |
Keywords
- Association rules
- Document clustering
- Hypergraph partition
ASJC Scopus subject areas
- Electrical and Electronic Engineering
- Condensed Matter Physics