python如何正确抓取网页标题

通过urllib将网页内容抓取下来，然后用正则表达式re模块将标题匹配出来，但是发现部分标题会出现问题，比如下面抓Apple的代码运行结果是App，测试发现匹配结果m是没有问题的，问题出现在了strip()这里。#-*-coding:utf-8-*-importurllibimportreurl='http://apple.com'html=urllib.urlopen(url).read()#printhtmlm=re.search(".*",html)printm.group()#这里输出结果Appleprintm.group().strip("")#问题应该出现在这个正则

慕桂英546537

浏览 362回答 2