python3 bs4 抓取豆瓣MM图片,,python3 bs4


python3 bs4 抓取豆瓣MM图片

python3.3+BeautifulSoup

1.[代码]python3 bs4 抓取豆瓣MM图片[Python]代码

#!/usr/bin/env pythonimport urllib.requestfrom bs4 import BeautifulSoupdef crawl(url):    headers = {'User-Agent':'Mozilla/5.0 (Windows; U; Windows NT 6.1; en-US; rv:1.9.1.6) Gecko/20091201 Firefox/3.5.6'}    req = urllib.request.Request(url, headers=headers)    page = urllib.request.urlopen(req, timeout=20)    contents = page.read()    soup = BeautifulSoup(contents)    my_girl = soup.find_all('img')    for girl in my_girl:        link = girl.get('src')        print(link)        content2 = urllib.request.urlopen(link).read()        with open(u'D:\doubanmeizi'+'/'+link[-11:],'wb') as code:            code.write(content2)page_start = 0page_stop = 10for page in range(page_start, page_stop):    page += 1    url = 'http://www.dbmeinv.com/?pager_offset=%s' % page    crawl(url)print("玩蛇python之家提示, MM图片下载完毕。!")

编橙之家文章,

评论关闭