在Scrapy中抓取元素之间的文本

首页课程实战体系课手记专栏慕课教程

在Scrapy中抓取元素之间的文本

我正在使用 Scrapy，我正在尝试抓取这样的内容：

<html>

some elements

</div>

<div>

<h3> title </h3>

<h4> another title </h4>

"some text without any tag"

some elements

</div>

some elements

</div>

some elements

</div>

</html>

如果我想从类名“hi There”的 div 下以及类名“article”的 div 之前的所有元素中提取文本，是否有任何可能的方法使用 XPath 或 CSS 选择器进行枯萎？

米脂

浏览 73回答 1

1回答

倚天杖

没用过Scrapy。不知道它有什么功能，但是//div[@class='hi there']/div/(div[@class='article'])[1]/preceding-sibling::*挑选出 div 之前具有“article”类的元素，并且，//div[@class='hi there']/div/(div[@class='article'])[1]/preceding-sibling::text()在文章 div 之前为您提供内部文本。

0 0

随时随地看视频慕课网APP