对比浏览器,casperjs,httpclient的Header信息

		@Override
		protected void doGet(HttpServletRequest req, HttpServletResponse res) throws ServletException, IOException
		{
			String type=req.getParameter("type");
			Enumeration es=req.getHeaderNames();
			
			System.out.println("Request type is : "+type);
			
			while(es.hasMoreElements()){
				//stem.out.println(es.nextElement());
				String header=es.nextElement().toString();
				String value=req.getHeader(header);
				System.out.println(header+":"+value);
			}
			
			System.out.println("================================================");
		}


首先是浏览器:
Request type is : broswer
Cookie:JSESSIONID=1hvmuj22s3578zndh4x9xjvii
Host:localhost:8888
Accept:text/html,application/xhtml+xml,application/xml;q=0.9,image/webp,*/*;q=0.8
Accept-Language:zh-CN,zh;q=0.8
User-Agent:Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/38.0.2125.111 Safari/537.36
Connection:keep-alive
Cache-Control:max-age=0
Accept-Encoding:gzip,deflate,sdch
================================================


httpclient:


Request type is : httpclient
Host:localhost:8888
User-Agent:Apache-HttpClient/4.3.1 (java 1.5)
Connection:keep-alive
Accept-Encoding:gzip,deflate

最后是casperjs:
Request type is : casperjs
Host:localhost:8888
Accept:text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8
Accept-Language:zh-CN,en,*
Pragma:no-cache
Connection:keep-alive
User-Agent:Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/534.34 (KHTML, like Gecko) CasperJS/1.1.0-beta3+PhantomJS/1.9.7 Safari/534.34
Cache-Control:no-cache
Accept-Encoding:gzip
================================================

你可能感兴趣的:(爬虫,crawler,header)