Nutch在Hadoop平台运行出错

F_Guardian 2015-01-06 11:08:32
Nutch版本1.9
Hadoop版本1.2.1
Nutch的local模式运行没有问题,Hadoop解压后PATH配置好后运行deploy下的脚本,运行到Fetcher部分报错如下:
15/01/06 18:54:44 WARN mapred.LocalJobRunner: job_local357481773_0001
java.lang.RuntimeException: problem advancing post rec#0
at org.apache.hadoop.mapred.Task$ValuesIterator.next(Task.java:1217)
at org.apache.hadoop.mapred.ReduceTask$ReduceValuesIterator.moveToNext(ReduceTask.java:250)
at org.apache.hadoop.mapred.ReduceTask$ReduceValuesIterator.next(ReduceTask.java:246)
at org.apache.hadoop.mapred.lib.IdentityReducer.reduce(IdentityReducer.java:40)
at org.apache.hadoop.mapred.ReduceTask.runOldReducer(ReduceTask.java:522)
at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:421)
at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:398)
Caused by: java.io.IOException: can't find class: org.apache.nutch.protocol.ProtocolStatus because org.apache.nutch.protocol.ProtocolStatus
at org.apache.hadoop.io.AbstractMapWritable.readFields(AbstractMapWritable.java:204)
at org.apache.hadoop.io.MapWritable.readFields(MapWritable.java:146)
at org.apache.nutch.crawl.CrawlDatum.readFields(CrawlDatum.java:281)
at org.apache.nutch.util.GenericWritableConfigurable.readFields(GenericWritableConfigurable.java:55)
at org.apache.hadoop.io.serializer.WritableSerialization$WritableDeserializer.deserialize(WritableSerialization.java:67)
at org.apache.hadoop.io.serializer.WritableSerialization$WritableDeserializer.deserialize(WritableSerialization.java:40)
at org.apache.hadoop.mapred.Task$ValuesIterator.readNextValue(Task.java:1276)
at org.apache.hadoop.mapred.Task$ValuesIterator.next(Task.java:1214)
... 6 more
15/01/06 18:54:44 INFO mapred.JobClient: Job complete: job_local357481773_0001
15/01/06 18:54:44 INFO mapred.JobClient: Counters: 22
15/01/06 18:54:44 INFO mapred.JobClient: File Input Format Counters
15/01/06 18:54:44 INFO mapred.JobClient: Bytes Read=177
15/01/06 18:54:44 INFO mapred.JobClient: FileSystemCounters
15/01/06 18:54:44 INFO mapred.JobClient: FILE_BYTES_READ=66360399
15/01/06 18:54:44 INFO mapred.JobClient: FILE_BYTES_WRITTEN=66980331
15/01/06 18:54:44 INFO mapred.JobClient: FetcherStatus
15/01/06 18:54:44 INFO mapred.JobClient: bytes_downloaded=65536
15/01/06 18:54:44 INFO mapred.JobClient: success=1
15/01/06 18:54:44 INFO mapred.JobClient: Map-Reduce Framework
15/01/06 18:54:44 INFO mapred.JobClient: Map output materialized bytes=15876
15/01/06 18:54:44 INFO mapred.JobClient: Map input records=1
15/01/06 18:54:44 INFO mapred.JobClient: Reduce shuffle bytes=0
15/01/06 18:54:44 INFO mapred.JobClient: Spilled Records=2
15/01/06 18:54:44 INFO mapred.JobClient: Map output bytes=66279
15/01/06 18:54:44 INFO mapred.JobClient: Total committed heap usage (bytes)=175636480
15/01/06 18:54:44 INFO mapred.JobClient: CPU time spent (ms)=0
15/01/06 18:54:44 INFO mapred.JobClient: Map input bytes=79
15/01/06 18:54:44 INFO mapred.JobClient: SPLIT_RAW_BYTES=169
15/01/06 18:54:44 INFO mapred.JobClient: Combine input records=0
15/01/06 18:54:44 INFO mapred.JobClient: Reduce input records=0
15/01/06 18:54:44 INFO mapred.JobClient: Reduce input groups=0
15/01/06 18:54:44 INFO mapred.JobClient: Combine output records=0
15/01/06 18:54:44 INFO mapred.JobClient: Physical memory (bytes) snapshot=0
15/01/06 18:54:44 INFO mapred.JobClient: Reduce output records=0
15/01/06 18:54:44 INFO mapred.JobClient: Virtual memory (bytes) snapshot=0
15/01/06 18:54:44 INFO mapred.JobClient: Map output records=2
15/01/06 18:54:44 INFO mapred.JobClient: Job Failed: NA
15/01/06 18:54:44 ERROR fetcher.Fetcher: Fetcher: java.io.IOException: Job failed!
at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1357)
at org.apache.nutch.fetcher.Fetcher.fetch(Fetcher.java:1432)
at org.apache.nutch.fetcher.Fetcher.run(Fetcher.java:1468)
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
at org.apache.nutch.fetcher.Fetcher.main(Fetcher.java:1441)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:606)
at org.apache.hadoop.util.RunJar.main(RunJar.java:160)


求大神解答一下啊,困这一天了,网上各种方法尝试无果。。。
...全文
403 2 打赏 收藏 转发到动态 举报
写回复
用AI写文章
2 条回复
切换为时间正序
请发表友善的回复…
发表回复
F_Guardian 2015-01-23
  • 打赏
  • 举报
回复
应该是Hadoop没有配置好,多搜了一些教程重配了好几次才弄好。。。

2,760

社区成员

发帖
与我相关
我的任务
社区描述
搜索引擎的服务器通过网络搜索软件或网络登录等方式,将Internet上大量网站的页面信息收集到本地,经过加工处理建立信息数据库和索引数据库。
社区管理员
  • 搜索引擎技术社区
加入社区
  • 近7日
  • 近30日
  • 至今
社区公告
暂无公告

试试用AI创作助手写篇文章吧