Python实现将Html转换为UBB,pythonhtml转换ubb,python实现html


python实现html转UBB,通常大家都是UBB转正则,我有点特殊需求需要将html转为UBB,完全用正则表达式实现的。 可以转换部分标签,如下python代码:

import redef Html2UBB(content):    #以下是将html标签转为ubb标签    pattern = re.compile( '<a href=\"([sS]+?)\"[^>]*>([sS]+?)</a>',re.I)    content = pattern.sub(r'[url=1]2[/url]',content)    pattern = re.compile( '<img[^>]+src=\"([^\"]+)\"[^>]*>',re.I)    content = pattern.sub(r'[img]1[/img]',content)    pattern = re.compile( '<strong>([sS]+?)</strong>',re.I)    content = pattern.sub(r'[b]1[/b]',content)    pattern = re.compile( '<font color=\"([sS]+?)\">([sS]+?)</font>',re.I)    content = pattern.sub(r'[1]2[/1]',content)    pattern = re.compile( '<[^>]*?>',re.I)    content = pattern.sub('',content)    #以下是将html转义字符转为普通字符    content = content.replace('&lt;','<')    content = content.replace('&gt;','>')    content = content.replace('&rdquo;','”')    content = content.replace('&ldquo;','“')    content = content.replace('&quot;','"')    content = content.replace('&copy;','?')    content = content.replace('&reg;','?')    content = content.replace(' ',' ')    content = content.replace('&mdash;','—')    content = content.replace('&ndash;','–')    content = content.replace('&lsaquo;','?')    content = content.replace('&rsaquo;','?')    content = content.replace('&hellip;','…')    content = content.replace('&amp;','&')    return content

评论关闭