抽象的

A Modified Frequency Based Term Weighting Approach for Information Retrieval

M. Santhanakumar and C. Christopher Columbus


Term frequency-inverse document frequency (TF-IDF) is one of the repeatedly used term weighting methods, which assigns weights based on the occurrences of a term in a document. This paper proposes an improved TF-IDF method using multi term occurrences in a document. To achieve the best performance, pre-processing methods such as tokenization, stopword removal and stemming are applied on both user query and document terms. The experimental results of the proposed work are compared with existing term weighting methods such as TF, IDF, TF-IDF and entropy. The proposed method gives better average precision, recall and F-score values than the existing methods.


免责声明: 此摘要通过人工智能工具翻译,尚未经过审核或验证

索引于

  • 中国社会科学院
  • 谷歌学术
  • 打开 J 门
  • 中国知网(CNKI)
  • 宇宙IF
  • 研究期刊索引目录 (DRJI)
  • 秘密搜索引擎实验室
  • ICMJE

查看更多

期刊国际标准号

期刊 h 指数

Flyer