使用python在html上提取<label>标签

我想提取网页，如： https://www.glassdoor.com/Overview/Working-at-Apple-EI_IE1138.11,16.htm，所以我想以以下格式返回结果。

Website Headquarters Size Revenue Type

www.apple.com Cupertino, CA 10000+ employees $10+ billion (USD) per year Company - Public (AAPL)

然后我使用下面的代码beatifulsoup来得到这个。

all_href = com_soup.find_all('span', {'class': re.compile('value')})

all_href = list(set(all_href))

它返回带有. 此外，它没有在下面显示标签<label>

[ Computer Hardware & Software,

Company - Public (AAPL) ,

10000+ employees,

$10+ billion (USD) per year,

,

Cupertino, CA,

1976,

,

<a class="link" href="http://www.apple.com" rel="nofollow noreferrer" target="_blank">www.apple.com</a>]

呼唤远方

浏览 351回答 2