爬虫状态码返回状态200,自己访问400,这是什么原因?，爬虫400,import urlli

文章由Byrx.net分享于2019-03-23 09:03:08评论（526）

爬虫状态码返回状态200,自己访问400,这是什么原因?，爬虫400,import urlli

import urllib2opener = urllib2.build_opener()html = Noneresponse = Noneresponse = opener.open('http://www.sxxrcs.com/was5/web/')html = response.codeprint html

比如这个爬虫，输出状态码是200。

可是直接访问http://www.sxxrcs.com/was5/web/是404，抓包响应的也是404，请问这是为什么？

用requests吧
import requestsr = requests.get('http://www.sxxrcs.com/was5/web/')print r.status_codeprint r.text
200正常啊，requests方便快捷。

编橙之家文章，

热门文章：

Python requests爬虫编码encoding error是什么问题，
适合Python应用的Vim缩进调试方法，pythonvim缩进
python list列表append方法的性能问题，pythonapp
Python有没有开源包处理GBK Unicode编码问题，
了解python flask.Response(generator())流内容处理的
Ubuntu火狐浏览器可以用python脚本来控制吗?，

爬虫状态码返回状态200,自己访问400,这是什么原因?，爬虫400,import urlli

爬虫状态码返回状态200,自己访问400,这是什么原因?，爬虫400,import urlli

相关内容

最新python问答

python~HOT