Python 中的网络爬虫

我的问题是是否有可能获得一个在这样的范围内的数字:


<html junk>

 <div class="test">

     <span>

     55

     </span>

 </div>

</html junk>

正如您所看到的,span 没有类或 id。


我当前的代码只是抓取工具的默认代码(删除了用户代理和 URL):


import requests

from bs4 import BeautifulSoup


URL = ''


headers = {"User-Agent": ''}


page = requests.get(URL, headers=headers)


soup = BeautifulSoup(page.content, 'html.parser')


#Here is where the "55" should be found (the number is going to change over time so im not excactly looking for it

title = soup.find('') 


print(title)


四季花海
浏览 120回答 1
1回答

UYOU

如果我正确理解你的问题,你正在尝试获取两个跨度标签之间的数字?如果是这样,你可以这样做。import requestsfrom bs4 import BeautifulSoupURL = ''headers = {"User-Agent": ''}page = requests.get(URL, headers=headers)soup = BeautifulSoup(page.text, 'html.parser')#Here is where the "55" should be found (the number is going to change over time so im not excactly looking for ittitle = soup.find('span').getText()&nbsp;print(title)
打开App,查看更多内容
随时随地看视频慕课网APP

相关分类

Python