刚刚学习大数据,自己搭建了一个分布式环境,启动成功后可以通过web界面访问,界面如下:
master节点上查看进程如下:
hadoop@master:~/hadoop-2.7.2$ jps
2418 Jps
1879 SecondaryNameNode
2056 ResourceManager
1694 NameNode
执行hadoop_home/share/hadoop/mapreduce/hadoop-mapreduce-examples-2.7.2.jar 的单词统计demo,命令如下:
hadoop jar hadoop-mapreduce-examples-2.7.2.jar wordcount inputfiles outfiles
inputfiles文件夹已在hdfs上创建,里面有一个txt文件,内容为一边英文新闻,200多单词,执行命令后出现如下日志,并一直卡住不动。
hadoop@master:~/hadoop-2.7.2$ hadoop jar hadoop-mapreduce-examples-2.7.2.jar wordcount inputfiles outfiles
16/06/04 10:58:18 INFO client.RMProxy: Connecting to ResourceManager at master-hadoop/192.168.100.180:8032
16/06/04 10:58:21 INFO input.FileInputFormat: Total input paths to process : 1
16/06/04 10:58:22 INFO mapreduce.JobSubmitter: number of splits:1
16/06/04 10:58:23 INFO mapreduce.JobSubmitter: Submitting tokens for job: job_1465008926631_0002
16/06/04 10:58:24 INFO impl.YarnClientImpl: Submitted application application_1465008926631_0002
16/06/04 10:58:24 INFO mapreduce.Job: The url to track the job: http://master-hadoop:8088/proxy/application_1465008926631_0002/
16/06/04 10:58:24 INFO mapreduce.Job: Running job: job_1465008926631_0002
通过Web界面查看job如下图:
很久很久不动不知道什么问题,请高手指教!