python提取网址URL的域名部分,python提取url域名,#!/usr/bin/p
文章由Byrx.net分享于2019-03-23 08:03:44
python提取网址URL的域名部分,python提取url域名,#!/usr/bin/p
#!/usr/bin/python# Desc : fetch the domain name of websiteimport reimport optparsedef get_filename(): p = optparse.OptionParser() p.add_option('-f', action = 'store', dest = 'filename', \ help = 'the directory you want monitoring') opt, args = p.parse_args() if None == opt.filename: p.error('Need a file !') else: filename = opt.filename return filenamefilename = get_filename()rc = re.compile(r'\w+\.(?:com\.cn|net\.cn|gov\.cn|org\.cn|com|cn|edu|gov|org|cc|net|tk|biz|info|tv|pro|co|ad|ae|af|ag|ai|al|am|an|ao|aq|ar|as|at|au|aw|az|ba|bb|bd|be|bf|bg|bh|bi|bj|bm|bn|bo|br|bs|bt|bv|bw|by|bz|ca|cc|cf|cg|ch|ci|ck|cl|cm|cn|co|cq|cr|cu|cv|cx|cy|cz|de|dj|dk|dm|do|dz|ec|ee|eg|eh|es|et|ev|fi|fj|fk|fm|fo|fr|ga|gb|gd|ge|gf|gh|gi|gl|gm|gn|gp|gr|gt|gu|gw|gy|hk|hm|hn|hr|ht|hu|id|ie|il|in|io|iq|ir|is|it|jm|jo|jp|ke|kg|kh|ki|km|kn|kp|kr|kw|ky|kz|la|lb|lc|li|lk|lr|ls|lt|lu|lv|ly|ma|mc|md|mg|mh|ml|mm|mn|mo|mp|mq|mr|ms|mt|mv|mw|mx|my|mz|na|nc|ne|nf|ng|ni|nl|no|np|nr|nt|nu|nz|om|pa|pe|pf|pg|ph|pk|pl|pm|pn|pr|pt|pw|py|qa|re|ro|ru|rw|sa|sb|sc|sd|se|sg|sh|si|sj|sk|sl|sm|sn|so|sr|st|su|sy|sz|tc|td|tf|tg|th|tj|tk|tm|tn|to|tp|tr|tt|tv|tw|tz|ua|ug|uk|us|uy|va|vc|ve|vg|vn|vu|wf|ws|ye|yu|za|zm|zr|zw|me)$', re.I)line_num=1try: for line in open(filename): print rc.search(line).group() line_num += 1except AttributeError: print '\nError:\nline_num:%s\ndomain_name:%s' %(str(line_num), line) raise SystemExit
相关内容
- python入门代码:打印圣诞树,python圣诞树,# Happy Holi
- HTMLParser笔记,,HTMLParser笔记
- Python Generators 用法,pythongenerators,def fact():
- python生成不重复的随机数代码,python随机数代码,impor
- 比较一个文件夹与另一同名文件夹多出哪些文件,并复制
- 比系统自带的更加友好的makedir函数,自带makedir函数,
- Python 读取mp3 ID3信息,python读取mp3id3,from mutagen
- 批量转换Linux文件权限,批量转换linux权限,使用窗口管
- python使用自定义的user-agent抓取网页,pythonuser-agent,imp
- python根据ip地址反向查找主机名称,pythonip查找主机,i
评论关闭