Python乱码问题又来了,访问繁体字网站出乱码怎么破?requests访问,pythonrequests,In [12]: r =


In [12]: r = requests.get('http://www.hkex.com.hk/chi/stat/smstat/dayquot/d170202c.htm')In [13]: r.text[-100:]Out[13]: '´Á®§$2.60\r\n  717 \xad^¬ÓÃÒ¨é         ¥½´Á®§$0.0128\r\n\r\n2. «ü¼Æ\xad×\xadq\r\n\r\n-\r\n\r\n</font></pre></body></html>\r\n

根据该网页的代码:

<html><head><meta name='MS.LOCALE' content='ZH-TW'><title>Hong Kong Exchanges and Clearing Limited</title><meta http-equiv='Content-Type' content='text/html; charset=big5'>

需指定字符编码为 big5:

r = requests.get('http://www.hkex.com.hk/chi/stat/smstat/dayquot/d170202c.htm')r.encoding = 'big5'

编橙之家文章,

评论关闭