Python 批量获取《oschina》高亮代码区域源码!,python《oschina》,仅供大家参考~impor


仅供大家参考~

import reimport urllib2from BeautifulSoup import BeautifulSoupdef readPage(Url):    page = urllib2.urlopen(Url).read()    pageContent = BeautifulSoup(page)    if pageContent.find('pre'):        preHandleCode = pageContent.find('pre').next        print preHandleCode    else:        print 'No code'www = urllib2.urlopen('http://www.oschina.net/code/list/7/python?show=time&p=7')msg = www.read()find=r'(http://www.oschina.net/code/snippet_\d+_\d+)'ak=re.findall(find,msg)if ak is not None:    for i in ak:        print i        readPage(i)

评论关闭