获取抓取日期 - Python & BeautifulSoup4

我想使用以下代码在 CSV 文件中报告抓取的确切日期:


import bs4

from bs4 import BeautifulSoup

import requests

import pandas

from pandas import DataFrame

import csv


with open('sortlist_scrap.csv', mode='w') as csv_file:

   fieldnames = ['Link', 'Rank', 'Date']

   writer = csv.DictWriter(csv_file, fieldnames=fieldnames)

   writer.writeheader()


Found_link = []

Found_Rank = []

Found_date = []


url = "https://www.sortlist.fr/identite-visuelle/paris-fr?page={}"


for page_num in range(1, 100):

   page = requests.get(url.format(page_num))


link = soup.select_one('a[title*="Dupont Lewis"]')

previous_md_headlines = link.find_all_previous("a", {"class": "md-headline"})

next_md_headlines = link.find_all_next("a", {"class": "md-headline"})


position = 50 - len(next_md_headlines )

today = date.today()

page = sortlistscrap



Found_link.append(page)

Found_Rank.append(position)

Found_date.append(today)


data = { 'Found_Rank': Rank,

'Found_link':Found_link, 'Found_date':article_date}


df = DataFrame(data, columns = ['Found_link','Found_Rank','Found_date'])

df.to_csv(r'C:\Users\MacOS\Desktop\sortlist_scrap.csv')

但我收到以下错误:NameError:名称“日期”未定义


有人可以帮忙吗?谢谢!


慕虎7371278
浏览 36回答 1
1回答

临摹微笑

您可能想使用datetime.datetime.today(). 所以首先添加from datetime import datetime到代码的开头。并使用today = datetime.today()代替today = date.today()
打开App,查看更多内容
随时随地看视频慕课网APP

相关分类

Python