python爬runoob目录链接栏

时间：2019-12-19 13:47:16 阅读：64 评论：0 收藏：0 [点我收藏+]

import re
import requests
url=‘https://www.runoob.com/python3/python3.html‘
response=requests.get(url)
html=response.text
response.encoding=‘utf-8‘
dl=re.findall(r‘<div class="design" id="leftcolumn">.*?</div>‘,html,re.S)[0]
tree=re.findall(r‘title="(.*?)".*?href="(.*?)"‘,dl)
lst=[]
def get_data(link):
    lst.append(link)
    ht=requests.get(link)
    print(‘已下载‘,len(lst),‘条‘)
for tree_info in tree:
    url=‘https://www.runoob.com/python3{}\n‘.format(tree_info[1])
    with open(‘D:\Desktop\测试\html.txt‘,‘a‘) as f:
        f.write(url)
    get_data(url)

python爬runoob目录链接栏

原文：https://www.cnblogs.com/zhuyu139/p/12067020.html

踩

(0)

评论一句话评论（0）

分享档案

更多>

2021年09月23日 (328)
2021年09月24日 (313)
2021年09月17日 (191)
2021年09月15日 (369)
2021年09月16日 (411)
2021年09月13日 (439)
2021年09月11日 (398)
2021年09月12日 (393)
2021年09月10日 (160)
2021年09月08日 (222)