Formal concept analysis and document clustering via granular computing

Tsau Young Lin, I-Jen Chiang

研究成果: 書貢獻/報告類型會議貢獻

摘要

A text/web document is a knowledge representation of a human idea (a structured set of thoughts). This paper refines TFIDF and Extended TFIDF(ETFIDF)[16]; These values really measures the co-occurrences of tokens. The ETFID captures the semantic more accurately. Tokens with high TFIDF values are called Keywords. The sets of (n+1) Co-occurring keywords with High ETFIDF are called n-granules. The collection of keywords and n-granules can be interpreted geometrically; they form a non-closed simplicial complex. The corresponding non-closed polyhedron is called Latent Semantic Space(LSS). LSS is a geometric knowledge base that provides the semantic to search engine:
原文英語
主出版物標題Conference Proceedings - IEEE International Conference on Systems, Man and Cybernetics
頁面4763-4767
頁數5
6
DOIs
出版狀態已發佈 - 2007
事件2006 IEEE International Conference on Systems, Man and Cybernetics - Taipei, 臺灣
持續時間: 10月 8 200610月 11 2006

其他

其他2006 IEEE International Conference on Systems, Man and Cybernetics
國家/地區臺灣
城市Taipei
期間10/8/0610/11/06

ASJC Scopus subject areas

  • 工程 (全部)

指紋

深入研究「Formal concept analysis and document clustering via granular computing」主題。共同形成了獨特的指紋。

引用此