Information and Communications Technology and Policy

Information and Communications Technology and Policy

Information and Communications Technology and Policy ›› 2025, Vol. 51 ›› Issue (2): 87-96.doi: 10.12267/j.issn.2096-5931.2025.02.014

Previous Articles    

Research on the extraction and ambiguity handling method of standard text keywords based on multi-algorithm fusion

FU Zhenqiu1,2, TIAN Hui1,2   

  1. 1. Information and Communication Integration Innovation Research Center, China Academy of Information and Communications Technology, Beijing 100191, China
    2. Taier Rongchuang (Beijing) Technology Co., Ltd., Beijing 100191, China
  • Received:2024-10-09 Online:2025-02-25 Published:2025-03-04

Abstract:

Firstly, the extraction and ambiguity handling method of standard text keywords based on multi-algorithm fusion combines TF-IDF and TextRank, while considering word position, part of speech, word length, and word frequency to complete the keywords extraction of standard text. Then, it uses Hanlp to process the same text and complete the contrastive ambiguity processing. Through the analysis of experimental results, this method has a significant effect on improving the efficiency and processing quality of keywords extraction and ambiguity handling in standard texts. It also provides an innovative approach for large models to conduct standard knowledge mining by combining knowledge bases with intelligent agents.

Key words: standard text, keywords, extraction, ambiguity

CLC Number: