中文词频统计

分类: 文章 • 2025-04-12 17:03:22

import jieba

file=open('text','r',encoding = 'utf-8')

wordList=list(jieba.cut(file.read()))
wordDict={}
for word in wordList:
    if(len(word)==1):
        continue
    wordDict[word]= wordList.count(word)

wordListSort=sorted(wordDict.items(),key=lambda d: d[1],reverse=True)

for i in range(20):
    if i>= len(wordListSort):
        break
    print(wordListSort[i])

　　中文词频统计

中文词频统计

相关推荐