soup.find('script',type='application/ld+json').text 返回空数据,为什么我无法提取文本。
>>> soup = BeautifulSoup(page.text,'lxml')
>>> soup.find('script',type='application/ld+json').text**
''
>>> soup.find('script',type='application/ld+json')
<script type="application/ld+json">{"@context":"http://schema.org","@type":"Organization","name":"Hamilton Medical Group - Dunkeld","url":"https://www.healthdirect.gov.au/australian-health-services/23000130/hamilton-medical-group-dunkeld/services/dunkeld-3294-sterling","contactPoint":{"@type":"ContactPoint","telephone":"03 5572 2422","email":"","website":"http://www.hamiltonmedicalgroup.net.au","fax":"03 5571 1606"},"address":{"@type":"PostalAddress","streetAddress":"14 Sterling Street","addressLocality":"DUNKELD","addressRegion":"VIC","postalCode":"3294","addressCountry":"AU"}}</script>
>>> json.loads(soup.find('script',type='application/ld+json'))
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
NameError: name 'json' is not defined
>>> import json
>>> json.loads(soup.find('script',type='application/ld+json'))
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
File "C:\Users\*******\Python38\lib\json\__init__.py", line 341, in loads
raise TypeError(f'the JSON object must be str, bytes or bytearray, '
TypeError: the JSON object must be str, bytes or bytearray, not Tag
喵喔喔
相关分类