在没有类的情况下在 BeautifulSoup 中获取第一个（或特定的）td

3回答

波斯汪

您可以使用 CSS 选择器nth-of-type(n)。它适用于两个链接：import requestsfrom bs4 import BeautifulSoupurl = "https://system.gotsport.com/org_event/events/1271/schedules?age=19&gender=m"soup = BeautifulSoup(requests.get(url).content, "html.parser")for tag in soup.select(".small-margin-bottom td:nth-of-type(1)"):    print(tag.text.strip())输出：OCYSFL RushJacksonville FCAtlanta UnitedSSA......Real Salt Lake U19Real ColoradoEmpire United Soccer Academy

0 0

慕田峪4524236

每个括号对应一个“面板”，每个面板有两行，第一行包含比赛表中所有球队的第一个表。def main():    import requests    from bs4 import BeautifulSoup    url = "https://system.gotsport.com/org_event/events/1271/schedules?age=19&gender=m"    response = requests.get(url)    response.raise_for_status()        soup = BeautifulSoup(response.content, "html.parser")    for panel in soup.find_all("div", {"class": "panel-body"}):        for row in panel.find("tbody").find_all("tr"):            print(row.find("td").text.strip())        return 0if __name__ == "__main__":    import sys    sys.exit(main())输出：OCYSFL RushJacksonville FCAtlanta UnitedSSAMiami Rush Kendall SCIMGTampa Bay UnitedWeston FCChargers SCSouth Florida FASolar SCRISE SC...

0 0

炎炎设计

我认为问题出在表的标题上，它包含th元素而不是td元素。当您尝试从空列表中检索第一个元素时，它会导致范围索引错误。尝试添加长度检查td：for row in rows:    team = row.find_all('td')    if(len(team) > 0):        teamName = team[0].text.strip()            print(teamName)它应该打印出团队名称。

0 0