猿问

如何使用 Python 使用 Selenium Ubuntu EC2 解决

我正在尝试提取此页面中“Nb B”列的所有数据: https: //www.coteur.com/cotes-foot.php


当我从 Ubuntu PC 运行我的代码时,它运行得很好,但是当我尝试使用 EC2 Ubuntu 时,它并没有给我很好的回报。


这是我的服务器:ubuntu/images/hvm-ssd/ubuntu-bionic-18.04-amd64-server-20200611 (ami-0a63f96e85105c6d3)


这是 python 脚本:


#!/usr/bin/python3

# -*- coding: utf­-8 ­-*-


from selenium import webdriver

from selenium.webdriver.chrome.options import Options


options = Options()

options.headless = True

options.add_argument("window-size=1400,800")

options.add_argument("--no-sandbox")

options.add_argument("--disable-gpu")

driver = webdriver.Chrome(options=options)


driver.get('https://www.coteur.com/cotes-foot.php')


#Store url associated with the soccer games

url_links = []

for i in driver.find_elements_by_xpath('//a[contains(@href, "match/cotes-")]'):

    url_links.append(i.get_attribute('href'))


print(len(url_links), '\n')


nb_bookies = []

for i in driver.find_elements_by_xpath('//tr[@id and @role="row" ]/td[last()]'):

    nb_bookies.append(i.text)


print(nb_bookies)

这是输出:


Traceback (most recent call last):

  File "./coteurchrome.py", line 25, in <module>

    url_links.append(i.get_attribute('href'))

  File "/home/ubuntu/.local/lib/python3.6/site-packages/selenium/webdriver/remote/webelement.py", line 141, in get_attribute

    self, name)

  File "/home/ubuntu/.local/lib/python3.6/site-packages/selenium/webdriver/remote/webdriver.py", line 636, in execute_script

    'args': converted_args})['value']

  File "/home/ubuntu/.local/lib/python3.6/site-packages/selenium/webdriver/remote/webdriver.py", line 321, in execute

    self.error_handler.check_response(response)

  File "/home/ubuntu/.local/lib/python3.6/site-packages/selenium/webdriver/remote/errorhandler.py", line 242, in check_response

    raise exception_class(message, screen, stacktrace)

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: element is not attached to the page document

  (Session info: headless chrome=83.0.4103.116)


心有法竹
浏览 121回答 1
1回答

慕桂英3389331

要打印href属性的值,您必须为引入WebDriverWaitvisibility_of_all_elements_located()并且您可以使用以下任一定位器策略:使用XPATH:driver.get("https://www.coteur.com/cotes-foot.php")print([my_elem.get_attribute("href") for my_elem in WebDriverWait(driver, 10).until(EC.visibility_of_all_elements_located((By.XPATH, "//a[contains(@href, 'match/cotes-')]")))])使用CSS_SELECTOR:driver.get("https://www.coteur.com/cotes-foot.php")print([my_elem.get_attribute("href") for my_elem in WebDriverWait(driver, 10).until(EC.visibility_of_all_elements_located((By.CSS_SELECTOR, "a[href*='match/cotes-']")))])控制台输出:['https://www.coteur.com/match/cotes-millwall-blackburn-rid1109557.html', 'https://www.coteur.com/match/cotes-cardiff-derby-county-rid1109558.html', 'https://www.coteur.com/match/cotes-sheffield-w-huddersfield-rid1109559.html', 'https://www.coteur.com/match/cotes-luton-queens-park-rangers-rid1109563.html', 'https://www.coteur.com/match/cotes-chelsea-norwich-rid1109746.html', 'https://www.coteur.com/match/cotes-atalanta-brescia-rid1109694.html', 'https://www.coteur.com/match/cotes-gil-vicente-tondela-rid1109683.html', 'https://www.coteur.com/match/cotes-benfica-vitoria-guimaraes-rid1109899.html', 'https://www.coteur.com/match/cotes-orlando-city-new-york-city-fc-rid1109292.html', 'https://www.coteur.com/match/cotes-philadelphia-union-inter-miami-cf-rid1109809.html', 'https://www.coteur.com/match/cotes-ural-ekaterinburg-arsenal-tula-rid1109831.html', 'https://www.coteur.com/match/cotes-spartak-moscou-terek-akhmat-grozni-rid1109833.html', 'https://www.coteur.com/match/cotes-krylya-sovetov-fc-krasnodar-rid1109834.html', 'https://www.coteur.com/match/cotes-moreirense-pacos-ferreira-rid1109682.html', 'https://www.coteur.com/match/cotes-brentford-preston-rid1109561.html', 'https://www.coteur.com/match/cotes-bristol-c-stoke-rid1109568.html', 'https://www.coteur.com/match/cotes-fk-bodo-glimt-kristiansund-bk-rid1109735.html', 'https://www.coteur.com/match/cotes-molde-viking-rid1109736.html', 'https://www.coteur.com/match/cotes-sandefjord-aalesund-rid1109737.html', 'https://www.coteur.com/match/cotes-stromsgodset-mjondalen-rid1109738.html', 'https://www.coteur.com/match/cotes-valerenga-fk-haugesund-rid1109740.html', 'https://www.coteur.com/match/cotes-shakhtar-donetsk-kolos-kovalivka-rid1109948.html']注意:您必须添加以下导入:from selenium.webdriver.support.ui import WebDriverWaitfrom selenium.webdriver.common.by import Byfrom selenium.webdriver.support import expected_conditions as EC
随时随地看视频慕课网APP

相关分类

Python
我要回答