Python解决方案，可将HTML表转换为可读的纯文本

如何使用这个：解析HTML表到Python列表？但是，请使用collections.OrderedDict()而不是简单的字典来保留顺序。有了字典后，从字典中获取文本并设置其格式非常非常容易：使用@Colt 45的解决方案：import xml.etree.ElementTreeimport collectionss = """\<table>    <tr>        <th>Height</th>        <th>Width</th>        <th>Depth</th>    </tr>    <tr>        <td>10</td>        <td>12</td>        <td>5</td>    </tr>    <tr>        <td>0</td>        <td>3</td>        <td>678</td>    </tr>    <tr>        <td>5</td>        <td>3</td>        <td>4</td>    </tr></table>"""table = xml.etree.ElementTree.XML(s)rows = iter(table)headers = [col.text for col in next(rows)]for row in rows:    values = [col.text for col in row]    for key, value in collections.OrderedDict(zip(headers, values)).iteritems():        print key, value输出：Height 10Width 12Depth 5Height 0Width 3Depth 678Height 5Width 3Depth 4

Python解决方案，可将HTML表转换为可读的纯文本

3回答