在 Python 3 中遍历元素列表时,如何“隔离”感兴趣的元素之间的内容?
我有一个清单:
list = ["<h1> question 1", "question 1 content", "question 1 more content", "<h1> answer 1", "answer 1 content", "answer 1 more content", "<h1> question 2", "question 2 content", "<h> answer 2", "answer 2 content"]
在此列表中,有带有标签 < h > 的元素和其他没有标签的元素。这个想法是具有此标签的元素是“标题”,直到下一个标签的以下元素是它的内容。
如何连接属于 header 的列表元素以具有两个相等大小的列表:
headers = ["<h1> question 1", "<h1> answer 1", "<h1> question 2", "<h> answer 2"]
content = ["question 1 content question 1 more content", "answer 1 content answer 1 more content", "question 2 content", "answer 2 content"]
这两个列表的长度相同,在这种情况下,每个列表有 4 个元素。
我能够将这些部分分开,但您可以使用一些帮助来完成:
list = ["<h1> question 1", "question 1 content", "question 1 more content", "<h1> answer 1", "answer 1 content", "answer 1 more content", "<h1> question 2", "question 2 content", "<h> answer 2", "answer 2 content"]
headers = []
content = []
for i in list:
if "<h1>" in i:
headers.append(i)
if "<h1>" not in i:
tempContent = []
tempContent.append(i)
content.append(tempContent)
关于如何组合这些文本以使它们一一对应的任何想法?
谢谢!
catspeake
慕勒3428872
相关分类