python正则表达式提取网页URL,python正则表达式,python正则表达式提


python正则表达式提取网页URL

import reimport urlliburl="http://www.open-open.com"s=urllib.urlopen(url).read()ss=s.replace(" ","")urls=re.findall(r"<a.*?href=.*?<\/a>",ss,re.I)for i in urls:print ielse:print 'this is over'

评论关闭