The paper gives the method based on probabilistic techniques and rules for new word discovery via analyzing the current techniques of phrase extraction and combining the specialties of Chinese.
该文分析了已有短语抽取技术,并结合汉语特点,提出了基于概率统计技术和规则方法相结合的概念抽取方法。
2
This method includes the "bi-gram" probabilistic model, the statistical algorithm, the rich rules and rule-based algorithm for word filtering.