网络爬虫发生java.net.SocketException异常:软件导致连接中止:recv的失败

问题描述:

java.net.SocketException: Software caused connection abort: recv failed 
at java.net.SocketInputStream.socketRead0(Native Method) 
at java.net.SocketInputStream.socketRead(Unknown Source) 
at java.net.SocketInputStream.read(Unknown Source) 
at java.net.SocketInputStream.read(Unknown Source) 
at java.io.BufferedInputStream.fill(Unknown Source) 
at java.io.BufferedInputStream.read1(Unknown Source) 
at java.io.BufferedInputStream.read(Unknown Source) 
at sun.net.www.http.HttpClient.parseHTTPHeader(Unknown Source) 
at sun.net.www.http.HttpClient.parseHTTP(Unknown Source) 
at sun.net.www.http.HttpClient.parseHTTP(Unknown Source) 
at sun.net.www.protocol.http.HttpURLConnection.getInputStream0(Unknown Source) 
at sun.net.www.protocol.http.HttpURLConnection.getInputStream(Unknown Source) 
at java.net.URL.openStream(Unknown Source) 
at TestHttpClient.testURL3(TestHttpClient.java:196) 
at TestHttpClient.main(TestHttpClient.java:226) 

我设置的UserAgent/cooike和其他设置喜欢的浏览器,但它仍然是这样这个exception.I测试其他网站程序是确定的,所以我想这可能是反爬行策略。 我希望你能帮我找出错误的原因或我该怎么做。 Thx !!!!!网络爬虫发生java.net.SocketException异常:软件导致连接中止:recv的失败

+0

您能否向我们解释一下您想要存档或向我们展示您的一些代码? –

好的,我知道为什么我的程序错误。因为GFW强制断开TCP连接。