猿问

python如何解析指定div标签中的标签

我已经尝试了这个网站上的其他解决方案,但仍然无法解决问题,我的问题是:

<div class="ds"><div title="Today" class="dh">...<div title="Pazartesi" class="dh">26 Agu Pzt

我想解析这个“ds”类中的所有 div 标题,谢谢你的帮助


宝慕林4294392
浏览 463回答 1
1回答

慕妹3146593

使用BeautifulSoup,lxml或类似模块代替regex.美丽汤:from bs4 import BeautifulSouptext = '<div class="ds"><div title="Today" class="dh">...<div title="Pazartesi" class="dh">26 Agu Pzt'soup = BeautifulSoup(text, 'html.parser')for item in soup.select('.ds div[title]'):&nbsp; &nbsp; print(item['title'])# or as list comprehensions&nbsp; &nbsp;&nbsp;titles = [item['title'] for item in soup.select('.ds div[title]')]print(titles)lxml:import lxml.htmltext = '<div class="ds"><div title="Today" class="dh">...<div title="Pazartesi" class="dh">26 Agu Pzt'soup = lxml.html.fromstring(text)for item in soup.cssselect('.ds div[title]'):&nbsp; &nbsp; print(item.attrib['title'])# or as list comprehensions&nbsp; &nbsp;&nbsp;titles = [item.attrib['title'] for item in soup.cssselect('.ds div[title]')]print(titles)查询:import pyquerytext = '<div class="ds"><div title="Today" class="dh">...<div title="Pazartesi" class="dh">26 Agu Pzt'soup = pyquery.PyQuery(text)for item in soup('.ds div[title]'):&nbsp; &nbsp; print(item.attrib['title'])# or as list comprehensions&nbsp; &nbsp;&nbsp;titles = [item.attrib['title'] for item in soup('.ds div[title]')]print(titles)parsel : (由Scrapy 的 Selectors使用)import parselsel = parsel.Selector(text)for item in sel.css('.ds div[title]'):&nbsp; &nbsp; print(item.attrib['title'])titles = [item.attrib['title'] for item in sel.css('.ds div[title]')]print(titles)
随时随地看视频慕课网APP

相关分类

Python
我要回答