Kafka控制台无法消费Flume采集的生产数据

江湖侠客 2020-03-30 11:54:29

1、首先自己启动zookeeper、kafka集群后，集群启动起来后，进程查看如下：



[root@flink102 kafka-2.11]# jps

15459 QuorumPeerMain

21466 Kafka

2、自己已经把kafka的topic创建出来了，查看当前服务器中的所有topic如下：



[root@flink102 kafka-2.11]# bin/kafka-topics.sh --zookeeper flink102:2181 --list

ct

3、接着自己创建kafka消费者



[root@flink102 kafka-2.11]# bin/kafka-console-consumer.sh --zookeeper flink102:2181 --from-beginning --topic ct

Using the ConsoleConsumer with old consumer is deprecated and will be removed in a future major release. Consider using the new consumer by passing [bootstrap-server] instead of [zookeeper].

4、自己在在workProject文件目录下创建 flume-kafka.conf文件



[root@flink102 ~]# cd /opt/workProject/

[root@flink102 workProject]# ll

total 32

-rw-r--r-- 1 root root  4312 Mar 27 15:10 call.log

-rw-r--r-- 1 root root   543 Mar 24 12:26 contact.log

-rw-r--r-- 1 root root 14155 Mar 24 12:53 ct-producer.jar

-rw-r--r-- 1 root root   683 Mar 27 14:37 flume-kafka.conf

drwxr-xr-x 2 root root    24 Mar 25 11:11 log

[root@flink102 workProject]# vim flume-kafka.conf 





//添加配置参数：

# define

 a1.sources = r1

 a1.sinks = k1

 a1.channels = c1



# # source

 a1.sources.r1.type = exec

 a1.sources.r1.command = tail -F -c +0 /opt/workProject/call.log

 a1.sources.r1.shell = /bin/bash -c



# # sink

 a1.sinks.k1.type = org.apache.flume.sink.kafka.KafkaSink

 a1.sinks.k1.kafka.bootstrap.servers =flink102:9092

 a1.sinks.k1.kafka.topic = ct

 a1.sinks.k1.kafka.flumeBatchSize = 20

 a1.sinks.k1.kafka.producer.acks = 1

 a1.sinks.k1.kafka.producer.linger.ms = 1



# # channel

 a1.channels.c1.type = memory

 a1.channels.c1.capacity = 1000

 a1.channels.c1.transactionCapacity = 100

#

# # bind

 a1.sources.r1.channels = c1

 a1.sinks.k1.channel = c1

其中， call.log是有数据的，如下：

[root@flink102 workProject]# tail -f call.log 

15884588694	19154926260	20180721043739	1172

16574556259	19154926260	20180311120306	0942

15280214634	15647679901	20180904154615	0234

16160892861	14171709460	20181223154548	1720

15244749863	19342117869	20180404160230	2565

15647679901	14171709460	20180801213806	0758

15884588694	14397114174	20180222050955	0458

19154926260	16569963779	20180715235743	1489

14171709460	19602240179	20181120075855	2488

19683537146	16574556259	20180724031723	0652

5、启动flume做数据采集

[root@flink102 ~]# cd /usr/hadoop/module/flume/flume-1.7.0/

[root@flink102 flume-1.7.0]# bin/flume-ng agent -c conf/ -f /opt/workProject/flume-kafka.conf

执行加载数据的过程，如图所示：

6、在kafka消费者查看，数据发现没有，无法消费数据

一直停留在：



[root@flink102 kafka-2.11]# bin/kafka-console-consumer.sh --zookeeper flink102:2181 --from-beginning --topic ct

Using the ConsoleConsumer with old consumer is deprecated and will be removed in a future major release. Consider using the new consumer by passing [bootstrap-server] instead of [zookeeper].

[2020-03-30 10:59:11,139] INFO [Group Metadata Manager on Broker 3]: Removed 0 expired offsets in 0 milliseconds. (kafka.coordinator.group.GroupMetadataManager)

7 、在/flume-1.7.0目录下的logs日志查看，发现报错：



30 Mar 2020 11:15:33,808 ERROR [main] (org.apache.flume.node.Application.main:348)  - A fatal error occurred while running. Exception follows.

org.apache.commons.cli.MissingOptionException: Missing required option: n

	at org.apache.commons.cli.Parser.checkRequiredOptions(Parser.java:299)

	at org.apache.commons.cli.Parser.parse(Parser.java:231)

	at org.apache.commons.cli.Parser.parse(Parser.java:85)

	at org.apache.flume.node.Application.main(Application.java:263)

问下，大佬们，这是什么原因，如何解决，谢谢！

...全文

929 6 打赏收藏转发到动态举报

写回复

用AI写文章

6 条回复

切换为时间正序

请发表友善的回复…

发表回复

seeuido 2023-01-11

打赏
举报

你好解决了吗我也遇到了同样的问题无法正常消费

江湖侠客 2020-03-30

打赏
举报

好的，我刚才试了一下，确实少了agent，我重新执行一下：

[root@flink102 flume-1.7.0]# bin/flume-ng agent -n a1 -c conf/ -f /usr/hadoop/module/flume/flume-1.7.0/conf/flume-kafka.conf

加载过程：

kafka消费者，好像还是收不到

[root@flink102 kafka-2.11]# bin/kafka-console-consumer.sh --zookeeper flink102:2181  --topic ct
Using the ConsoleConsumer with old consumer is deprecated and will be removed in a future major release. Consider using the new consumer by passing [bootstrap-server] instead of [zookeeper].

如图所示：

LinkSe7en 2020-03-30

打赏
举报

看漏了，是你flume启动参数缺少agentName (-n a1) bin/flume-ng agent -n a1 -c conf/ -f /usr/hadoop/module/flume/flume-1.7.0/conf/flume-kafka.conf

江湖侠客 2020-03-30

打赏
举报

刚按照你的方法试过了，


# # source
 a1.sources.r1.type = exec
 a1.sources.r1.command =tail -f -c +0 /opt/workProject/call.log
 a1.sources.r1.shell = /bin/bash -c

如图，所示：

之后，我再重新启动kafka的消费者

[root@flink102 kafka-2.11]# bin/kafka-console-consumer.sh --zookeeper flink102:2181  --topic ct
Using the ConsoleConsumer with old consumer is deprecated and will be removed in a future major release. Consider using the new consumer by passing [bootstrap-server] instead of [zookeeper].

再启动flume服务：

[root@flink102 flume-1.7.0]# bin/flume-ng agent -c conf/ a1 -f /usr/hadoop/module/flume/flume-1.7.0/conf/flume-kafka.conf

flume服务加载过程，正常

最后，在kafka消费者，还是无法接收数据

[root@flink102 kafka-2.11]# bin/kafka-console-consumer.sh --zookeeper flink102:2181  --topic ct
Using the ConsoleConsumer with old consumer is deprecated and will be removed in a future major release. Consider using the new consumer by passing [bootstrap-server] instead of [zookeeper].
[2020-03-30 16:30:28,065] INFO [Group Metadata Manager on Broker 1]: Removed 0 expired offsets in 0 milliseconds. (kafka.coordinator.group.GroupMetadataManager)

最后，自己在flume的logs文件查看日志信息，还是出错

[root@flink102 flume-1.7.0]# tail -f logs/flume.log 
	at org.apache.commons.cli.Parser.checkRequiredOptions(Parser.java:299)
	at org.apache.commons.cli.Parser.parse(Parser.java:231)
	at org.apache.commons.cli.Parser.parse(Parser.java:85)
	at org.apache.flume.node.Application.main(Application.java:263)
30 Mar 2020 16:38:04,434 ERROR [main] (org.apache.flume.node.Application.main:348)  - A fatal error occurred while running. Exception follows.
org.apache.commons.cli.MissingOptionException: Missing required option: n
	at org.apache.commons.cli.Parser.checkRequiredOptions(Parser.java:299)
	at org.apache.commons.cli.Parser.parse(Parser.java:231)
	at org.apache.commons.cli.Parser.parse(Parser.java:85)
	at org.apache.flume.node.Application.main(Application.java:263)

江湖侠客 2020-03-30

打赏
举报

好的，谢谢，我看看

LinkSe7en 2020-03-30

打赏
举报

# # source
 a1.sources.r1.type = exec
 a1.sources.r1.command = tail -F -c +0 /opt/workProject/call.log
 a1.sources.r1.shell = /bin/bash -c

tail -F -c +0 /opt/workProject/call.log 应该是这里的问题，你先把这部分拷出来在shell里测试通过，再复制进去。应该是-f而不是-F

实验背景 Flume 是大数据组件中重要的数据采集工具，我们常利用 Flume 采集某个各种数据源的数据供其他组件分析使用。在日志分析业务中，我们常采集服务器日志，以分析服务器运行状态是否正常。在实时业务中，我们常将数据采集到 Kafka 中，以供实时组件 streaming 或spark 等分析处理，Flume 在大数据业务中有着重要的应用。实验目的掌握 Flume 的配置和使用，能够使用 Flume 实现数据采集操作。

flume+kafka+storm搭建详细讲解大数据的消息平台的搭建。

基于spark streaming+flume+kafka+hbase的实时日志处理分析系统(分为控制台版本和基于s.zip

【建设】数据仓库建设方案.docx【建设】数据仓库建设方案.docx【建设】数据仓库建设方案.docx【建设】数据仓库建设方案.docx【建设】数据仓库建设方案.docx【建设】数据仓库建设方案.docx【建设】数据仓库建设方案.docx【建设】数据仓库建设方案.docx

数据仓库建设方案详细.docx

Hadoop生态社区

20,847

社区成员

4,695

社区内容

发帖

与我相关

我的任务

社区管理员

加入社区

近7日
近30日
至今

加载中

查看更多榜单

社区公告

暂无公告

试试用AI创作助手写篇文章吧

+ 用AI写文章