与 Databricks 笔记本中的 Blob 存储文件交互的过程

如果您想使用包pandas从 Azure blob 读取 CSV 文件，对其进行处理并将此 CSV 文件写入 Azure Databricks 中的 Azure blob，我建议您将 Azure blob 存储挂载为 Databricks 文件系统，然后执行此操作。欲了解更多详情，请参阅此处。例如装载 Azure 斑点dbutils.fs.mount(  source = "wasbs://<container-name>@<storage-account-name>.blob.core.windows.net",  mount_point = "/mnt/<mount-name>",  extra_configs = {"fs.azure.account.key.<storage-account-name>.blob.core.windows.net":"<account access key>"})处理 csvimport osimport globimport pandas as pdos.chdir(r'/dbfs/mnt/<mount-name>/<>')allFiles = glob.glob("*.csv") # match your csvsfor file in allFiles:    print(f" The old content of  file {file} : ")    df= pd.read_csv(file, header=None)    print(df)    df = df.iloc[4:,]    df.to_csv(file, index=False,header=False)    print(f" The new content of  file {file} : ")    df= pd.read_csv(file,header=None)    print(df)    break

与 Databricks 笔记本中的 Blob 存储文件交互的过程

2回答