使用easyOCR的tesseract出错,There may be spaces in your image's filename.
爬虫使用的是easyOCR图片识别技术,tesseract相关的代码都已被封装,我的代码如下
Request req = Request.create(url, Request.METHOD.GET);
req.getHeader().set("User-Agent", "Mozilla/5.0 (Windows NT 10.0; WOW64; Trident/7.0; rv:11.0) like Gecko");
req.getHeader().set("Referer","http://www.njwztx.com/");
req.getHeader().set("Cookie", "rememberPhone=0; phoneNumber="+tesseract.getNjwztx_dhhm()+"; friendlyReminder=true;JSESSIONID="+JSESSIONID+";");
Response resp = Sender.create(req).send();
if (resp.isOK()) {
File tmp = new File("yzm.jpg");//创建流
Files.write(tmp, resp.getStream());//输出
String code = fromFile(tmp);//调用easyOCR接口
Files.deleteFile(tmp);
return code;
}
return "";