To organize a huge amount of Web pages into topics, according to their relevance, is the efficient and effective method for information retrieval. Latent Semantic Space (LSS) naturally in the form on some geometric structure in Combinatorial Topology has been proposed for unstructured document clustering. Given a set of Web pages, the set of associations among frequently co-occurring terms in them forms naturally a CONCEPT, which is represented as a set of connected components of the simplicial complexes. Based on these concepts, Web pages can be clustered into meaningful categories.
|名字||Studies in Computational Intelligence|