首页 > 编程语言 > 详细

python爬runoob目录链接栏

时间:2019-12-19 13:47:16      阅读:63      评论:0      收藏:0      [点我收藏+]
import re
import requests
url=https://www.runoob.com/python3/python3.html
response=requests.get(url)
html=response.text
response.encoding=utf-8
dl=re.findall(r<div class="design" id="leftcolumn">.*?</div>,html,re.S)[0]
tree=re.findall(rtitle="(.*?)".*?href="(.*?)",dl)
lst=[]
def get_data(link):
    lst.append(link)
    ht=requests.get(link)
    print(已下载,len(lst),)
for tree_info in tree:
    url=https://www.runoob.com/python3{}\n.format(tree_info[1])
    with open(D:\Desktop\测试\html.txt,a) as f:
        f.write(url)
    get_data(url)

python爬runoob目录链接栏

原文:https://www.cnblogs.com/zhuyu139/p/12067020.html

(0)
(0)
   
举报
评论 一句话评论(0
关于我们 - 联系我们 - 留言反馈 - 联系我们:wmxa8@hotmail.com
© 2014 bubuko.com 版权所有
打开技术之扣,分享程序人生!