Welcome to Journal of Beijing Institute of Technology
ZHANG Feng, FAN Xiao-zhong, XU Yun. Chinese Term Extraction Based on PAT Tree[J]. JOURNAL OF BEIJING INSTITUTE OF TECHNOLOGY, 2006, 15(2): 162-166.
Citation: ZHANG Feng, FAN Xiao-zhong, XU Yun. Chinese Term Extraction Based on PAT Tree[J]. JOURNAL OF BEIJING INSTITUTE OF TECHNOLOGY, 2006, 15(2): 162-166.

Chinese Term Extraction Based on PAT Tree

  • A new method of automatic Chinese term extraction is proposed based on Patricia (PAT) tree. Mutual information is calculated based on prefix searching in PAT tree of domain corpus to estimate the internal associative strength between Chinese characters in a string. It can improve the speed of term candidate extraction largely compared with methods based on domain corpus directly. Common collocation suffix, prefix bank are constructed and term part of speech (POS) composing rules are summarized to improve the precision of term extraction. Experiment results show that the F-measure is 74.97%.
  • loading

Catalog

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return
    Baidu
    map