如何将 AWS Athena 的多记录多行 JSON 转换为每记录单行 JSON？

3回答

慕尼黑8549860

我最终做了一些快速而肮脏的事情import jsonwith open('data.json') as jfile:    data = json.load(jfile)    for d in data:        print(json.dumps(d) + ',')哪个打印{'id': 200, 'name': 'bob', 'data': '<other> \n <xml> \n <data>'},{"id": 200, "name": "bob", "data": "<other> \n <xml> \n <data>"},刚刚将输出保存到另一个文件：P结果失败了，因为文件太大了，但是嘿..已经很接近了！

0 0

LEATH

使用正则表达式import rehtml = '''{  "id" : 10,  "name" : "bob",  "data" : "<some> \n <xml> \n <in here>"},{  "id" : 20,  "name" : "jane",  "data" : "<other> \n <xml> \n <in here>"}'''def replaceReg(html, regex, new):    return re.sub(re.compile(regex), new, html)html = replaceReg(html,' \n ',' ')html = replaceReg(html,'{[\s]+','{ ')html = replaceReg(html,'[\s]+}',' }')html = replaceReg(html,',[\s]+',', ')html = replaceReg(html,'}, ','\n')print (html)结果：{ "id" : 10, "name" : "bob", "data" : "<some> <xml> <in here>" { "id" : 20, "name" : "jane", "data" : "<other> <xml> <in here>" }

0 0

HUH函数

您只需要在写入另一个文件时替换结束换行符（\n ）：s=''with open('input.txt','r') as f_in, open('output.txt', 'w') as f_out:    for line in f_in:                s += line.replace('\n', '')    f_out.write(s)其中 input.txt 具有以下数据：{  "id" : 10,  "name" : "bob",  "data" : "<some> \n <xml> \n <in here>"},{  "id" : 20,  "name" : "jane",  "data" : "<other> \n <xml> \n <in here>"}

0 0