报错信息:
craw 1:http://baike.baidu.com/view/21087.htm
craw failed
Traceback (most recent call last):
File "E:\python\python_practice1\baidubaike\spider_main.py", line 39, in <module>
obj_spider.craw(root_url)
File "E:\python\python_practice1\baidubaike\spider_main.py", line 34, in craw
self.outputer.output_html()
TypeError: output_html() missing 1 required positional argument: 'self'
贴上html_outputer代码:
class HtmlOutputer(object):
def __init__(self):
self.datas = []
def collect_data(self,data):
if data is None:
return
self.datas.append(data)
def output_html(self):
fout = open('output.html','w')
fout.write('<html>')
fout.write('<body>')
fout.write('<table>')
for data in self.datas:
fout.write('<tr>')
fout.write('<td>%s</td>'% data['url'])
fout.write('<td>%s</td>'% data['title'].encode('utf-8'))
fout.write('<td>%s</td>'% data['summary'].encode('utf-8'))
fout.write('</tr>')
fout.write('</table>')
fout.write('</body>')
fout.write('</html>')
fout.close()
慕粉3843115
相关分类