PDFminer3k 将PDF转换为TXT时出错pdfminer.pdfparser.PDFSyntaxError: Invalid object number

weixin_43350191 2019-07-02 11:39:46
PDFminer3k 将PDF转换为TXT时出错,求大神解决方案!

"C:\Program Files\Python37\python.exe" D:/PYTHON/PythonWS/0702/0702.py
WARNING:root:Wrong type: <PDFStream(3): raw=278, {'Type': /Metadata, 'Subtype': /XML, 'Length': 278, 'Filter': /FlateDecode}> required: <class 'dict'>
WARNING:root:Cannot locate objid=221
Mark
Traceback (most recent call last):
File "C:\Program Files\Python37\lib\site-packages\pdfminer\pdfparser.py", line 377, in _getobj
obj = objs[i]
IndexError: list index out of range

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
File "D:/PYTHON/PythonWS/0702/0702.py", line 51, in <module>
readPDF(path, toPath)
File "D:/PYTHON/PythonWS/0702/0702.py", line 39, in readPDF
for page in pdfFile.get_pages():
File "C:\Program Files\Python37\lib\site-packages\pdfminer\pdfparser.py", line 568, in get_pages
for (pageid,tree) in search(self.catalog['Pages'], self.catalog):
File "C:\Program Files\Python37\lib\site-packages\pdfminer\pdfparser.py", line 552, in search
tree = dict_value(obj, strict=True).copy()
File "C:\Program Files\Python37\lib\site-packages\pdfminer\pdftypes.py", line 92, in typecheck_value
x = resolve1(x)
File "C:\Program Files\Python37\lib\site-packages\pdfminer\pdftypes.py", line 58, in resolve1
x = x.resolve()
File "C:\Program Files\Python37\lib\site-packages\pdfminer\pdftypes.py", line 47, in resolve
return self.doc.getobj(self.objid)
File "C:\Program Files\Python37\lib\site-packages\pdfminer\pdfparser.py", line 532, in getobj
result = self._getobj(objid)
File "C:\Program Files\Python37\lib\site-packages\pdfminer\pdfparser.py", line 379, in _getobj
raise PDFSyntaxError('Invalid object number: objid=%r' % (objid))
pdfminer.pdfparser.PDFSyntaxError: Invalid object number: objid=2

Process finished with exit code 1
...全文
1894 5 打赏 收藏 转发到动态 举报
AI 作业
写回复
用AI写文章
5 条回复
切换为时间正序
请发表友善的回复…
发表回复
是但你咯 2021-12-15
  • 打赏
  • 举报
回复

用Acrobat Reader打开PDF文件,然后另存文件,就可以了

tuerchou 2021-04-08
  • 打赏
  • 举报
回复
这个问题没人回达吗,真是棘手呀,求大神回复一下。
xiaoming_xiaoli 2021-02-12
  • 打赏
  • 举报
回复
遇到同样的问题,解决了吗
遥远方 2020-12-27
  • 打赏
  • 举报
回复
请问这个问题解决了吗,遇到同样问题?
weixin_43350191 2019-07-02
  • 打赏
  • 举报
回复

37,743

社区成员

发帖
与我相关
我的任务
社区描述
JavaScript,VBScript,AngleScript,ActionScript,Shell,Perl,Ruby,Lua,Tcl,Scala,MaxScript 等脚本语言交流。
社区管理员
  • 脚本语言(Perl/Python)社区
  • WuKongSecurity@BOB
加入社区
  • 近7日
  • 近30日
  • 至今

试试用AI创作助手写篇文章吧