Beautifulsoup

时间：2020-05-17 22:56:54 阅读：87 评论：0 收藏：0 [点我收藏+]

Beautiful Soup：解析HTML页面信息标记与提取方法

获取网页源代码

import requests
from bs4 import BeautifulSoup

kv = {‘user-agent‘:‘Mozilla/5.0‘}
url = "https://python123.io/ws/demo.html"
r = requests.get(url,headers = kv)
print(r.status_code)
demo = r.text
soup = BeautifulSoup(demo,"html.parser")#解析
print(soup.prettify())

200
<html><head><title>This is a python demo page</title></head>
<body>
The demo python introduces several python courses.
Python is a wonderful general-purpose programming language. You can learn Python from novice to professional by tracking the following courses:
<a href="http://www.icourse163.org/course/BIT-268001" class="py1" id="link1">Basic Python</a> and <a href="http://www.icourse163.org/course/BIT-1001870001" class="py2" id="link2">Advanced Python</a>.
</body></html>

<html>
<head>
<title>
This is a python demo page
</title>
</head>
<body>


The demo python introduces several python courses.



Python is a wonderful general-purpose programming language. You can learn Python from novice to professional by tracking the following courses:
<a class="py1" href="http://www.icourse163.org/course/BIT-268001" id="link1">
Basic Python
</a>
and
<a class="py2" href="http://www.icourse163.org/course/BIT-1001870001" id="link2">
Advanced Python
</a>
.

</body>
</html>

BeautifulSoup的使用

技术分享图片

Beautifulsoup

原文：https://www.cnblogs.com/tingtin/p/12907452.html

踩

(0)

评论一句话评论（0）

分享档案

更多>

2021年09月23日 (328)
2021年09月24日 (313)
2021年09月17日 (191)
2021年09月15日 (369)
2021年09月16日 (411)
2021年09月13日 (439)
2021年09月11日 (398)
2021年09月12日 (393)
2021年09月10日 (160)
2021年09月08日 (222)