首页 > 其他 > 详细

网易财经爬取

时间:2019-12-19 17:35:47      阅读:114      评论:0      收藏:0      [点我收藏+]

import requests
from lxml import etree

url = ‘http://quotes.money.163.com/old/‘
headers = {
‘User-Agent‘: ‘Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/79.0.3945.79 Safari/537.36‘
}

html = requests.get(url=url,headers=headers).text

tree = etree.HTML(html)

content = tree.xpath(‘//li[@qid="HS"]//li[@id="f0-f7"]/ul/li‘)
for con in content:
one = con.xpath(‘./a/text()‘)[0]
print(one)
two_list = con.xpath(‘./ul/li‘)
for t in two_list:
qid = t.xpath(‘./@qid‘)[0]
print(qid)
two = t.xpath(‘./a/text()‘)[0]
print(two)

网易财经爬取

原文:https://www.cnblogs.com/Iceredtea/p/12069065.html

(0)
(0)
   
举报
评论 一句话评论(0
关于我们 - 联系我们 - 留言反馈 - 联系我们:wmxa8@hotmail.com
© 2014 bubuko.com 版权所有
打开技术之扣,分享程序人生!