如何只抓取文本？

首页课程实战体系课手记专栏慕课教程

代码：

import scrapy

class BlogSpider(scrapy.Spider):

name = 'bijouterie'

start_urls = ['https://www.example.com']

def parse(self, response):

for post in response.css('#engine-results .drs'):

yield {'title': post.css('a.moodalbox.response').get()}

运行命令（Windows 10）：

scrapy runspider C:\Users\DELL\Desktop\icscrape\bijouterie.py -o posts.csv

我只想抓取文本而不是整个 html 类代码。

萧十郎

浏览 141回答 1

Smart猫小萌

只需在 css 选择器末尾添加 (::text) 即可，例如{'title': post.css('a.moodalbox.response::text').get()}

0 0

随时随地看视频慕课网APP