为什么在使用的时候不能匹配到时间?如何解决?谢谢!

想做一个爬虫,使用BeautifulSoup假使获取以下内容:

<div class="authi"><img class="authicn vm" id="" src="static/image/common/online_member.gif" /><em id="">发表于 <span title="2013-4-2 08:30:11">4&nbsp;天前</span></em><span class="pipe">|</span><a href=>倒序浏览</a><div style="float:right;position:relative;top:-6px;"><ul><li style="float: left; width: 130px; height: 24px;"><wb:follow-button uid="" type="gray_2" width="136" height="24"></wb:follow-button></li><li style=" float:left; width: 160px; height: 24px;"></li></ul></div></div>

如何获取以上内容中的2013-4-2 08:30:11这个时间呢?
我试着写了一个

re.compile('^20\d{2}-\d+-\d+\s\d{2}:\d{2}:\d{2}')
红糖糍粑
浏览 120回答 2
2回答

弑天下

re.compile('20\d{2}-\d+-\d+\s\d{2}:\d{2}:\d{2}')

www说

#&nbsp;-*-&nbsp;coding:&nbsp;utf-8&nbsp;-*- from&nbsp;BeautifulSoup&nbsp;import&nbsp;BeautifulSoup&nbsp; import&nbsp;re def&nbsp;get_timestamp(html): &nbsp;&nbsp;&nbsp;&nbsp;soup&nbsp;=&nbsp;BeautifulSoup(html) &nbsp;&nbsp;&nbsp;&nbsp;authi_elems&nbsp;=&nbsp;soup.findAll('div',&nbsp;{&nbsp;'class':&nbsp;'authi'&nbsp;}) &nbsp;&nbsp;&nbsp;&nbsp;for&nbsp;authi_elem&nbsp;in&nbsp;authi_elems: &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;date_elem&nbsp;=&nbsp;authi_elem.find('span',&nbsp;title=re.compile(r'20\d{2}-\d+-\d+\s\d{2}:\d{2}:\d{2}')) &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;print&nbsp;date_elem.get('title') if&nbsp;__name__&nbsp;==&nbsp;'__main__': &nbsp;&nbsp;&nbsp;&nbsp;html&nbsp;=&nbsp;"""&nbsp;&nbsp;&nbsp;&nbsp;<div&nbsp;class="authi"> &nbsp;&nbsp;&nbsp;&nbsp;<img&nbsp;class="authicn&nbsp;vm"&nbsp;id=""&nbsp;src="static/image/common/online_member.gif"&nbsp;/> &nbsp;&nbsp;&nbsp;&nbsp;<em&nbsp;id="">发表于&nbsp;<span&nbsp;title="2013-4-2&nbsp;08:30:11">4&nbsp;天前</span></em> &nbsp;&nbsp;&nbsp;&nbsp;<span&nbsp;class="pipe">|</span><a&nbsp;href=>倒序浏览</a> &nbsp;&nbsp;&nbsp;&nbsp;<div&nbsp;style="float:right;position:relative;top:-6px;"><ul> &nbsp;&nbsp;&nbsp;&nbsp;<li&nbsp;style="float:&nbsp;left;&nbsp;width:&nbsp;130px;&nbsp;height:&nbsp;24px;"><wb:follow-button&nbsp;uid=""&nbsp;type="gray_2"&nbsp;width="136"&nbsp;height="24"></wb:follow-button></li> &nbsp;&nbsp;&nbsp;&nbsp;<li&nbsp;style="&nbsp;float:left;&nbsp;width:&nbsp;160px;&nbsp;height:&nbsp;24px;"></li> &nbsp;&nbsp;&nbsp;&nbsp;</ul></div></div> &nbsp;&nbsp;&nbsp;&nbsp;""" &nbsp;&nbsp;&nbsp;&nbsp;get_timestamp(html)
打开App,查看更多内容
随时随地看视频慕课网APP

相关分类

Python