flink-on-yarn高可用 启动过了一会挂掉 报错

你的配jue971 2022-09-15 14:37:42

我flink集成到cdh上了,并启动

 

后面又配置flink-on-yarn的高可用

jobmanager.rpc.address: nodeone

jobmanager.rpc.port: 6123


# The total process memory size for the JobManager.
#

jobmanager.memory.process.size: 1600m


# The total process memory size for the TaskManager.
#

taskmanager.memory.process.size: 1728m

# It is not recommended to set both 'taskmanager.memory.process.size' and Flink memory.
#
# taskmanager.memory.flink.size: 1280m

# The number of task slots that each TaskManager offers. Each slot runs one parallel pipeline.

taskmanager.numberOfTaskSlots: 1

# The parallelism used for programs that did not specify and other parallelism.

parallelism.default: 1

# The default file system scheme and authority.
# 
# By default file paths without scheme are interpreted relative to the local
# root file system 'file:///'. Use this to override the default and interpret
# relative paths relative to a different file system,
# for example 'hdfs://mynamenode:12345'
#
# fs.default-scheme

#==============================================================================
# High Availability
#==============================================================================

# The high-availability mode. Possible options are 'NONE' or 'zookeeper'.
high-availability: zookeeper
# The path where metadata for master recovery is persisted. While ZooKeeper stores
# the small ground truth for checkpoint and leader election, this location stores
# the larger objects, like persisted dataflow graphs.
# 
# Must be a durable file system that is accessible from all nodes
# (like HDFS, S3, Ceph, nfs, ...) 
#
# high-availability.storageDir: hdfs:///flink/ha/
high-availability.storageDir: hdfs:///flink/ha/
# The list of ZooKeeper quorum peers that coordinate the high-availability
# setup. This must be a list of the form:
# "host1:clientPort,host2:clientPort,..." (default clientPort: 2181)
#
# high-availability.zookeeper.quorum: localhost:2181
high-availability.zookeeper.quorum: nodeone:2181,nodetwo:2181,nodethree:2181

high-availability.zookeeper.path.root: /flink
high-availability.cluster-id: /cluster_one

集群另外启动。

jps有进程但是过一会就挂了,报以下错

2022-09-15 14:25:41,859 INFO  org.apache.flink.runtime.entrypoint.ClusterEntrypoint        [] - Shutting StandaloneSessionClusterEntrypoint down with application status FAILED. Diagnostics java.io.IOException: Failed on local exception: java.io.IOException: Couldn't set up IO streams: java.lang.NoClassDefFoundError: org/apache/flink/shaded/hadoop2/com/google/re2j/PatternSyntaxException; Host Details : local host is: "nodeone/192.168.19.191"; destination host is: "nodetwo":8020; 
    at org.apache.hadoop.net.NetUtils.wrapException(NetUtils.java:808)
    at org.apache.hadoop.ipc.Client.getRpcResponse(Client.java:1495)
    at org.apache.hadoop.ipc.Client.call(Client.java:1437)
    at org.apache.hadoop.ipc.Client.call(Client.java:1347)
    at org.apache.hadoop.ipc.ProtobufRpcEngine$Invoker.invoke(ProtobufRpcEngine.java:228)
    at org.apache.hadoop.ipc.ProtobufRpcEngine$Invoker.invoke(ProtobufRpcEngine.java:116)
    at com.sun.proxy.$Proxy27.mkdirs(Unknown Source)
    at org.apache.hadoop.hdfs.protocolPB.ClientNamenodeProtocolTranslatorPB.mkdirs(ClientNamenodeProtocolTranslatorPB.java:639)
    at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
    at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
    at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
    at java.lang.reflect.Method.invoke(Method.java:498)
    at org.apache.hadoop.io.retry.RetryInvocationHandler.invokeMethod(RetryInvocationHandler.java:422)
    at org.apache.hadoop.io.retry.RetryInvocationHandler$Call.invokeMethod(RetryInvocationHandler.java:165)
    at org.apache.hadoop.io.retry.RetryInvocationHandler$Call.invoke(RetryInvocationHandler.java:157)
    at org.apache.hadoop.io.retry.RetryInvocationHandler$Call.invokeOnce(RetryInvocationHandler.java:95)
    at org.apache.hadoop.io.retry.RetryInvocationHandler.invoke(RetryInvocationHandler.java:359)
    at com.sun.proxy.$Proxy28.mkdirs(Unknown Source)
    at org.apache.hadoop.hdfs.DFSClient.primitiveMkdir(DFSClient.java:2376)
    at org.apache.hadoop.hdfs.DFSClient.mkdirs(DFSClient.java:2352)
    at org.apache.hadoop.hdfs.DistributedFileSystem$27.doCall(DistributedFileSystem.java:1243)
    at org.apache.hadoop.hdfs.DistributedFileSystem$27.doCall(DistributedFileSystem.java:1240)
    at org.apache.hadoop.fs.FileSystemLinkResolver.resolve(FileSystemLinkResolver.java:81)
    at org.apache.hadoop.hdfs.DistributedFileSystem.mkdirsInternal(DistributedFileSystem.java:1257)
    at org.apache.hadoop.hdfs.DistributedFileSystem.mkdirs(DistributedFileSystem.java:1232)
    at org.apache.hadoop.fs.FileSystem.mkdirs(FileSystem.java:2260)
    at org.apache.flink.runtime.fs.hdfs.HadoopFileSystem.mkdirs(HadoopFileSystem.java:183)
    at org.apache.flink.runtime.blob.FileSystemBlobStore.<init>(FileSystemBlobStore.java:64)
    at org.apache.flink.runtime.blob.BlobUtils.createFileSystemBlobStore(BlobUtils.java:98)
    at org.apache.flink.runtime.blob.BlobUtils.createBlobStoreFromConfig(BlobUtils.java:76)
    at org.apache.flink.runtime.highavailability.HighAvailabilityServicesUtils.createHighAvailabilityServices(HighAvailabilityServicesUtils.java:121)
    at org.apache.flink.runtime.entrypoint.ClusterEntrypoint.createHaServices(ClusterEntrypoint.java:361)
    at org.apache.flink.runtime.entrypoint.ClusterEntrypoint.initializeServices(ClusterEntrypoint.java:318)
    at org.apache.flink.runtime.entrypoint.ClusterEntrypoint.runCluster(ClusterEntrypoint.java:243)
    at org.apache.flink.runtime.entrypoint.ClusterEntrypoint.lambda$startCluster$1(ClusterEntrypoint.java:193)
    at java.security.AccessController.doPrivileged(Native Method)
    at javax.security.auth.Subject.doAs(Subject.java:422)
    at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1962)
    at org.apache.flink.runtime.security.contexts.HadoopSecurityContext.runSecured(HadoopSecurityContext.java:41)
    at org.apache.flink.runtime.entrypoint.ClusterEntrypoint.startCluster(ClusterEntrypoint.java:190)
    at org.apache.flink.runtime.entrypoint.ClusterEntrypoint.runClusterEntrypoint(ClusterEntrypoint.java:617)
    at org.apache.flink.runtime.entrypoint.StandaloneSessionClusterEntrypoint.main(StandaloneSessionClusterEntrypoint.java:59)
Caused by: java.io.IOException: Couldn't set up IO streams: java.lang.NoClassDefFoundError: org/apache/flink/shaded/hadoop2/com/google/re2j/PatternSyntaxException
    at org.apache.hadoop.ipc.Client$Connection.setupIOstreams(Client.java:860)
    at org.apache.hadoop.ipc.Client$Connection.access$3500(Client.java:409)
    at org.apache.hadoop.ipc.Client.getConnection(Client.java:1552)
    at org.apache.hadoop.ipc.Client.call(Client.java:1383)
    ... 39 more
Caused by: java.lang.NoClassDefFoundError: org/apache/flink/shaded/hadoop2/com/google/re2j/PatternSyntaxException
    at org.apache.hadoop.security.SaslRpcClient.getServerPrincipal(SaslRpcClient.java:311)
    at org.apache.hadoop.security.SaslRpcClient.createSaslClient(SaslRpcClient.java:234)
    at org.apache.hadoop.security.SaslRpcClient.selectSaslClient(SaslRpcClient.java:160)
    at org.apache.hadoop.security.SaslRpcClient.saslConnect(SaslRpcClient.java:390)
    at org.apache.hadoop.ipc.Client$Connection.setupSaslConnection(Client.java:613)
    at org.apache.hadoop.ipc.Client$Connection.access$2200(Client.java:409)
    at org.apache.hadoop.ipc.Client$Connection$2.run(Client.java:798)
    at org.apache.hadoop.ipc.Client$Connection$2.run(Client.java:794)
    at java.security.AccessController.doPrivileged(Native Method)
    at javax.security.auth.Subject.doAs(Subject.java:422)
    at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1962)
    at org.apache.hadoop.ipc.Client$Connection.setupIOstreams(Client.java:794)
    ... 42 more
Caused by: java.lang.ClassNotFoundException: org.apache.flink.shaded.hadoop2.com.google.re2j.PatternSyntaxException
    at java.net.URLClassLoader.findClass(URLClassLoader.java:382)
    at java.lang.ClassLoader.loadClass(ClassLoader.java:424)
    at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:349)
    at java.lang.ClassLoader.loadClass(ClassLoader.java:357)
    ... 54 more
.
2022-09-15 14:25:41,877 INFO  org.apache.flink.runtime.rpc.akka.AkkaRpcService             [] - Stopping Akka RPC service.
2022-09-15 14:25:41,916 INFO  akka.remote.RemoteActorRefProvider$RemotingTerminator        [] - Shutting down remote daemon.
2022-09-15 14:25:41,917 INFO  akka.remote.RemoteActorRefProvider$RemotingTerminator        [] - Remote daemon shut down; proceeding with flushing remote transports.
2022-09-15 14:25:41,944 INFO  akka.remote.RemoteActorRefProvider$RemotingTerminator        [] - Remoting shut down.
2022-09-15 14:25:41,973 INFO  org.apache.flink.runtime.rpc.akka.AkkaRpcService             [] - Stopped Akka RPC service.
2022-09-15 14:25:41,999 ERROR org.apache.flink.runtime.entrypoint.ClusterEntrypoint        [] - Could not start cluster entrypoint StandaloneSessionClusterEntrypoint.
org.apache.flink.runtime.entrypoint.ClusterEntrypointException: Failed to initialize the cluster entrypoint StandaloneSessionClusterEntrypoint.
    at org.apache.flink.runtime.entrypoint.ClusterEntrypoint.startCluster(ClusterEntrypoint.java:216) ~[flink-dist_2.11-1.14.4.jar:1.14.4]

 

...全文
555 1 打赏 收藏 转发到动态 举报
AI 作业
写回复
用AI写文章
1 条回复
切换为时间正序
请发表友善的回复…
发表回复
qq_34224360 2023-08-30
  • 打赏
  • 举报
回复

请问解决了吗?能分享一下吗

117

社区成员

发帖
与我相关
我的任务
社区描述
Flink开发者
其他 企业社区
社区管理员
  • csdnsqst0026
  • nihui123
加入社区
  • 近7日
  • 近30日
  • 至今
社区公告

1.用户在社区内发帖:帖子内容为推荐与本社区技术相关的【优质书籍、文章、视频】,贴子中要体现推荐的原因。每人可加入多个社区,一个社区中前20位发帖用户,且在同一社区累计发布2个帖子即可获得CSDN定制帆布包1个(同个ID在一个社区仅可领取1次)。同时,活动周结束后,社区专家会对【推荐贴】进行点评,并将优质的帖子整理成【精华贴】置顶。同时我们会在被评为精华帖的同学中随机抽取两位同学获得精美咖啡杯一个。

2.【推荐贴】要求:每个推荐贴中需要推荐和本社区技术相关的【书籍、文章、视频】中至少两类,文章和视频需要复制链接地址,并且在每个推荐的内容下需要附上至少100字的推荐原因/观后感受。同一社区所发的推荐帖内容不能重复。

 

 

试试用AI创作助手写篇文章吧