In this paper, the defects latent semantic analysis, probabilistic latent semantic analysis using methods to construct the text-the words of co-occurrence matrix, using the em algorithm to solve.
This paper presents methods of mechanical matching, feature lexicon, binding matrix, grammar analysis and semantic understanding for the Chinese language automatic word segmentation.
本文给出了为汉语自动分词而提出的机械匹配法、特征词库法、约束矩阵法、语法分析法和理解切分法。
3
In this paper, we use the co-occurrence path to explain the relationship between the index words and extract the semantic information in the term-term matrix to expand the query.