9.27作业

时间：2018-09-29 10:48:28 阅读：185 评论：0 收藏：0 [点我收藏+]

英文字频统计

strHello=‘‘‘...‘‘‘.lower()
fo = open(‘hello.txt‘, ‘r‘, encoding=‘utf-8‘)
hello = fo.read()
fo.close()
print(hello)
sep = ‘‘‘,?‘‘‘
for ch in sep:
    strHello = strHello.replace(ch, ‘‘)

    strList = strHello.split()
    print(len(strList),strList)
    strSet = set(strList)
    exclude = {‘i‘, ‘in‘, ‘the‘‘anymore‘}
    strSet = strSet-exclude

    print(len(strSet),strSet)

    strDict = {}
    for hello in strSet:
        strDict[hello] = strList.count(hello)

        print(strDict.items())

wcList = list(strDict.items())
wcList.sort()
print(strDict.items())
print(wcList[:20])

运行结果

技术分享图片

中文字频统计（小说《装在套子里的人》

import jieba

fo = open (‘taozi.txt‘, ‘r‘, encoding=‘utf-8‘)
zhuang = fo.read ().lower ()
fo.close ()
print (zhuang)

sep = ‘，。？！；：“”‘’-——<_/>‘
for en in sep:
    zhuang = zhuang.replace (en, ‘‘)

zhaung = list (jieba.cut_for_search (zhuang))

strSet = set (zhuang)
# print(len(strSet), strSet)

strDict = dict ()
for word in strSet:
    strDict[word] = zhuang.count (word)
    # print(len(strDict), strDict)

wcList = list (strDict.items ())
# print(wcList)
wcList.sort (key=lambda x: x[1], reverse=True)
# print(wcList)

for i in range (20):
    print (wcList[i])

运行结果

技术分享图片

9.27作业

原文：https://www.cnblogs.com/fanfanfan/p/9712284.html

踩

(0)

评论一句话评论（0）

分享档案

更多>

2021年09月23日 (328)
2021年09月24日 (313)
2021年09月17日 (191)
2021年09月15日 (369)
2021年09月16日 (411)
2021年09月13日 (439)
2021年09月11日 (398)
2021年09月12日 (393)
2021年09月10日 (160)
2021年09月08日 (222)