代码之家  ›  专栏  ›  技术社区  ›  slm yuvaeasy

Ambari代理无法再通过http://dom>:8440联系服务器?

  •  1
  • slm yuvaeasy  · 技术社区  · 6 年前

    我们目前正在运行HortonWorks 2.6.5.0:

    $ hadoop version
    Hadoop 2.7.3.2.6.5.0-292
    Subversion git@github.com:hortonworks/hadoop.git -r 3091053c59a62c82d82c9f778c48bde5ef0a89a1
    Compiled by jenkins on 2018-05-11T07:53Z
    Compiled with protoc 2.5.0
    From source with checksum abed71da5bc89062f6f6711179f2058
    This command was run using /usr/hdp/2.6.5.0-292/hadoop/hadoop-common-2.7.3.2.6.5.0-292.jar
    

    操作系统是Centos 7:

    $ cat /etc/redhat-release
    CentOS Linux release 7.5.1804 (Core)
    

    我们最近开始注意到这些问题 ambari-agent 日志文件:

    $ grep -i "error|warn" /var/log/ambari-agent/*
    /var/log/ambari-agent/ambari-agent.log:WARNING 2018-07-30 14:03:50,982 NetUtil.py:124 - Server at https://hbase26-2.mydom.com:8440 is not reachable, sleeping for 10 seconds...
    /var/log/ambari-agent/ambari-agent.log:ERROR 2018-07-30 14:04:00,986 NetUtil.py:96 - EOF occurred in violation of protocol (_ssl.c:579)
    /var/log/ambari-agent/ambari-agent.log:ERROR 2018-07-30 14:04:00,990 NetUtil.py:97 - SSLError: Failed to connect. Please check openssl library versions.
    /var/log/ambari-agent/ambari-agent.log:WARNING 2018-07-30 14:04:00,990 NetUtil.py:124 - Server at https://hbase26-2.aa.mydom.com:8440 is not reachable, sleeping for 10 seconds...
    /var/log/ambari-agent/ambari-agent.log:ERROR 2018-07-30 14:04:10,993 NetUtil.py:96 - EOF occurred in violation of protocol (_ssl.c:579)
    /var/log/ambari-agent/ambari-agent.log:ERROR 2018-07-30 14:04:10,994 NetUtil.py:97 - SSLError: Failed to connect. Please check openssl library versions.
    /var/log/ambari-agent/ambari-agent.log:WARNING 2018-07-30 14:04:10,994 NetUtil.py:124 - Server at https://hbase26-2.aa.mydom.com:8440 is not reachable, sleeping for 10 seconds...
    /var/log/ambari-agent/ambari-agent.log:ERROR 2018-07-30 14:04:20,996 NetUtil.py:96 - EOF occurred in violation of protocol (_ssl.c:579)
    /var/log/ambari-agent/ambari-agent.log:ERROR 2018-07-30 14:04:20,997 NetUtil.py:97 - SSLError: Failed to connect. Please check openssl library versions.
    

    当这些开始发生时,我们无法再通过Ambari管理Hadoop集群的任何方面。所有的服务都显示出一些黄色的问号,并说“心跳停止”。

    多次重新启动不允许我们恢复ambari,最终重新获得对集群的控制。

    1 回复  |  直到 6 年前
        1
  •  6
  •   slm yuvaeasy    6 年前

    $ netstat -tapn|grep 8440
    tcp        0      0 0.0.0.0:8440            0.0.0.0:*               LISTEN      1203/java
    

    curl --insecure

    $ cat /etc/ambari-agent/conf/ambari-agent.ini
    ...
    [security]
    ssl_verify_cert = 0
    ...
    

    $ grep -E '\[https|verify' /etc/python/cert-verification.cfg
    [https]
    #verify=platform_default
    verify=disable
    

    ambari-agent

    $ grep -E "\[security|force" /etc/ambari-agent/conf/ambari-agent.ini
    [security]
    force_https_protocol=PROTOCOL_TLSv1_2
    

    ambari-agent restart

    Disabling TLSv1 & TLS1.1 - Enabling TLSv1.2