小白刚接触Heartbeat双机热备 日志求分析!求帮助啊!

chaojinhua 2014-03-17 05:00:04
Mar 14 21:27:59 lossoe1 heartbeat: [9038]: WARN: node lossoe2: is dead
Mar 14 21:27:59 lossoe1 heartbeat: [9038]: info: Dead node lossoe2 gave up resources.
Mar 14 21:27:59 lossoe1 heartbeat: [9038]: info: Link lossoe2:eth3 dead.
Mar 14 21:27:59 lossoe1 ipfail: [9066]: info: Status update: Node lossoe2 now has status dead
Mar 14 21:28:00 lossoe1 ipfail: [9066]: info: NS: We are still alive!
Mar 14 21:28:00 lossoe1 ipfail: [9066]: info: Link Status update: Link lossoe2/eth3 now has status dead
Mar 14 21:28:01 lossoe1 heartbeat: [9038]: CRIT: Cluster node lossoe2 returning after partition.
Mar 14 21:28:01 lossoe1 heartbeat: [9038]: info: For information on cluster partitions, See URL: http://linux-ha.org/wiki/Split_Brain
Mar 14 21:28:01 lossoe1 heartbeat: [9038]: WARN: Deadtime value may be too small.
Mar 14 21:28:01 lossoe1 heartbeat: [9038]: info: See FAQ for information on tuning deadtime.
Mar 14 21:28:01 lossoe1 heartbeat: [9038]: info: URL: http://linux-ha.org/wiki/FAQ#Heavy_Load
Mar 14 21:28:01 lossoe1 heartbeat: [9038]: info: Link lossoe2:eth3 up.
Mar 14 21:28:01 lossoe1 heartbeat: [9038]: WARN: Late heartbeat: Node lossoe2: interval 32020 ms
Mar 14 21:28:01 lossoe1 heartbeat: [9038]: info: Status update for node lossoe2: status active
harc[73439]: 2014/03/14_21:28:01 info: Running /etc/ha.d//rc.d/status status
Mar 14 21:28:01 lossoe1 ipfail: [9066]: info: Asking other side for ping node count.
Mar 14 21:28:01 lossoe1 ipfail: [9066]: info: Checking remote count of ping nodes.
Mar 14 21:28:01 lossoe1 ipfail: [9066]: info: Link Status update: Link lossoe2/eth3 now has status up
Mar 14 21:28:01 lossoe1 ipfail: [9066]: info: Status update: Node lossoe2 now has status active
Mar 14 21:28:03 lossoe1 ipfail: [9066]: info: No giveup timer to abort.
Mar 14 21:28:03 lossoe1 heartbeat: [9038]: info: Heartbeat shutdown in progress. (9038)
Mar 14 21:28:03 lossoe1 heartbeat: [73456]: info: Giving up all HA resources.
.....





上面是我的日志输出文件,是什么问题导致切换到备用服务器上去了呢?

补充一下 我的ha.cf配置
logfile /var/log/ha_log/ha-log.log
logfacility local0
bcast eth3
keepalive 2
#warntime 10
deadtime 30
initdead 120
ucast eth3 192.168.3.1
udpport 694
auto_failback off
node lossoe1
node lossoe12
ping 10.160.0.126
respawn hacluster /usr/lib64/heartbeat/ipfail
...全文
95 2 打赏 收藏 转发到动态 举报
AI 作业
写回复
用AI写文章
2 条回复
切换为时间正序
请发表友善的回复…
发表回复
lucky-lucky 2014-03-18
  • 打赏
  • 举报
回复
楼主是什么控制系统上面的双机热备?日志里面没看清楚为什么切换过去了
chaojinhua 2014-03-17
  • 打赏
  • 举报
回复
自己给自己顶一下!多谢各位了!! 我自己认为 配置是不是要更改呢? 1.打开warntime 2.deadtime设置时间长点 3.把auto_failback 设置为on

19,619

社区成员

发帖
与我相关
我的任务
社区描述
系统使用、管理、维护问题。可以是Ubuntu, Fedora, Unix等等
社区管理员
  • 系统维护与使用区社区
加入社区
  • 近7日
  • 近30日
  • 至今
社区公告
暂无公告

试试用AI创作助手写篇文章吧