Got it

Common Faults in Installing OMS

Latest reply: May 23, 2018 03:46:38 1908 4 1 0 0

1. A Message Indicating a Decryption Failure Is Displayed When OMS Is Installed

Symptom:
java.security.InvalidKeyException: Illegal key size or default parameters
        at javax.crypto.Cipher.checkCryptoPerm(Cipher.java:1011)
        at javax.crypto.Cipher.implInit(Cipher.java:786)
        at javax.crypto.Cipher.chooseProvider(Cipher.java:849)
        at javax.crypto.Cipher.init(Cipher.java:1213)
        at javax.crypto.Cipher.init(Cipher.java:1153)
        at org.wcc.framework.util.encrypt.PasswordUtil.decryptByAes256(PasswordUtil.java:180)
        at org.wcc.framework.util.encrypt.PasswordUtil.decryptByAes256(PasswordUtil.java:146)
        at com.omm.encrypt.DecryptCommandMain.decryptCommandLine(DecryptCommandMain.java:71)
        at com.omm.encrypt.DecryptCommandMain.main(DecryptCommandMain.java:130)
[2014-12-09 02:58] ERROR Failed to get omm db password. [installcontroller.sh(main):788](12851)

Fault Locating:
The commonest cause of a decryption failure is that JDK versions do not match. The OMS system installs JDK under user omm. If JDK has been installed or JAVA_HOME environment variables have already existed under user root, OMS installation may fail.
Solution:
a.Uninstall JDK and remove JAVA_HOME environment variables under user root.

b.Run the env |grep JAVA_HOME command as user root to check for JAVA-related environment variables.

c.Delete any existed JAVA-related environment variables from the /root/.bash_profile file or JAVA_HOME-related variables from the /etc/profile file.

 

 

2. OMS Installation Fails Because LogMan Installation Fails

Symptom:

20171031152636388001.png
Fault Locating:
The cron configuration file has been modified. As a result, the crontab file cannot be modified and LogMan installation fails.
Solution:
Modify cron.deny.bak to cron.deny. Uninstall and then reinstall OMS.

 

 

3. GaussDB Installation Failure
Symptom:

20171031152636441002.png 
Fault Locating:

No resources are available.

Solution:

Perform the following operations to increase the value of the parameter for restricting the OS semaphore resources based on requirements:

a. Add the following parameter to the /etc/sysctl.conf file:

kernel.sem = 2500 32000 1000 1280

b. For a SUSE system, run the following command to enable the SUSE Linux to automatically read kernel parameters during startup: (Skip this step in a Red Hat system.)

/sbin/chkconfig boot.sysctl on

c. Run the following command for the kernel parameters to take effect:

/sbin/sysctl -p

d. Reinstall OMS.

 


4. GaussDB Installation Failure
Symptom:
20171031152637668003.png
Fault Locating:
User root is not assigned to a correct group.

Solution:
Change the ID of group to which user root user assigns to 0 and reinstall OMS.

 


5. GaussDB Installation Failure
Symptom:
20171031152638013004.png
Fault Locating:
The value of shared_buffers is greater than the value of kernel.shmmax.
Solution:
a.
Change the value of kernel.shmmax in the /etc/sysctl.conf file.
kernel.shmmax = value

You are advised to set the value of kernel.shmmaxto50% of the physical memory space.

Run the following command to query the physical memory space: grep MemTotal /proc/meminfo

Information similar to the following is displayed:

MeMTotal: 6088240 kB

In this example, set the value of kernel.shmmax to 3117178880 (6088240 x 1024/2 = 3117178880).

kernel.shmmax = 3117178880
b. Run the following command to enable the SUSE Linux to automatically read kernel parameters during startup:

/sbin/chkconfig boot.sysctl on
c.
Run the following command for the kernel parameters to take effect:

/sbin/sysctl –p

 


6. OMS Installation Failure Due to Incorrect LADAP Installation

Symptom:
2014-12-09 02:58 566 linux-126 SlapdServer ERROR ldap server rpm is not installed!

2014-12-09 02:58 569 linux-126 SlapdServer ERROR ldap server rpm is not installed!

Fault Locating:
OMS Installation Failure Due to Incorrect LADAP Installation
Solution:

Run the preinstall script to add LADP-related RPM packages.


7. OMS Installation Failure Due to an Incorrect Floating IP Address

Symptom:  

20171031152638453005.jpg
Fault Locating:  
The NIC IP address does not map to the host name.

Processing logic of the installation script: If OMS is installed in single-node mode, values of om_float_ip_interface and ws_float_ip_interface can be obtained from the output of the ifconfig command by running the sed command by default.

local defaultInterface=$(ifconfig | sed -n "/${g_server1IP1}/{x;p};h" | awk '{print $1}')

The g_server1IP1 is obtained by running the hostname-i command. Therefore, error "The float_ip_interface is empty" is displayed because the IP addresses obtained by running the ifconfig command and the hostname –i command are inconsistent.
Solution:
a.Run the hostname -i command to check the host IP address.

b.Run the ifconfig command to check whether the corresponding NIC IP address is the same as the host IP address. If they are different, change them to the same value.

c.Verify that the host name in /etc/hosts maps the NIC IP address and reinstall OMS.

 


8. OMS Installation Failure Due to Incorrect File Permissions
Symptom:
Config_cmd:[/opt/huawei/Bigdata/OMSV100R001C00x8664/gaussdb/bin/gs_guc reload -c "replconninfo1='localhost=10.75.201.227 localport=12211 remotehost=10.75.201.218 remoteport=12211'"]sudo: must be setuid root
[/opt/huawei/Bigdata/OMSV100R001C00x8664/tools/omsconfig.sh:488] Gauss config failure, return:[1].[/opt/huawei/Bigdata/OMSV100R001C00x8664/tools/omsconfig.sh:626] ERROR: config Database system failure!
Fault Locating:
The sudo command permissions in /usr are changed.
Solution:
ls -l  /usr/bin/sudo
chown root:root /usr/bin/sudo
chmod 4755 /usr/bin/sudo
reboot


9. OMS Installation Fails Because the Installation Directory Is Not Empty

Symptom:

20171031152639386006.png
Fault Locating:
The OMS installation directory must not store any other files or directories.
Solution:

a. If the installation directory contains the uninstall script, run the script to clear the directory.

b. If the installation directory contains files irrelevant to FusionInsight HD, delete the files and reinstall OMS.


10. OMS Installation Fails Because /etc/hosts Is Not Correctly Configured
Symptom:

20171031152640301007.jpg
Fault Locating:

localhost 127.0.0.1 is not configured in /etc/hosts.
Solution:
Add localhost 127.0.0.1 in /etc/hosts. Uninstall and then reinstall OMS.

 

11. OMS Installation Failure Due to Incorrect Permission on the /etc/sudoers Directory
Symptom:

20171031152641401008.png
Fault Locating:

The permission on the /etc/sudoers file is incorrect. As a result, the sudo command fails to be executed and OMS fails to be installed.

Solution:

Change the permission on the /etc/sudoers directory to 440. Uninstall and then reinstall OMS.

12. NTP Startup Failure During Standby OMS Installation
Symptom
:
20171031152642228009.png
Solution:
a.Uninstall the standby OMS.

b.Disable the firewall.

 

c.Reinstall the standby OMS.


Common Faults in Installing a Cluster
1. NodeAgent Installation Failure
Symptom:
2014-12-09 02:58,164 WARN  [pool-33-thread-1] 160.138.1.19: Failed Command execution CommandResponse [success=false, completed=true, exitCode=12, stdOut=Installing node agent...
keytool error: java.lang.Exception: Alias <tomcat> does not exist
ERROR:Failed to start nodeagent Start nodeagent process.
ERROR:Failed to install node agent.
, stdErr=chmod: cannot access `/opt/huawei/Bigdata/nodeagent/setup/web/*': No such file or directory, command=Command [executableScript=su - omm -c "/opt/huawei/Bigdata/nodeagent/setup/installnodeagent.sh {OM_NODEAGENT_NODEID} 160.157.0.99 20025 /var/log/Bigdata {IS_START_AGENT}", capture=EXECUTE_ONLY, script=su - {OMM_USER} -c "{NODEAGENT_HOME}/setup/installnodeagent.sh {OM_NODEAGENT_NODEID} {OM_NODEAGENT_CONTROLLER_IP} {OM_NODEAGENT_CONTROLLER_PORT} {BIGDATA_LOG_HOME} {IS_START_AGENT}", success=null, ifFail=null]]  com.huawei.hadoop.om.controller.agent.nodesetup.SshActionChannelHandler.processFail(SshActionChannelHandler.java:515)
Fault Locating:

The permission on /etc/hosts is 600. As a result, the IP address cannot be found after the hostname -i command is run under user omm.
Solution:
Change the permission on /etc/hosts to a value greater than 644 and click Retry.
2. Cluster Installation Fails Due to a Package Download Failure
Symptom:

20171031152642663010.png
Possible Causes:

Cause 1: The disk space is exhausted.

Cause 2: The mutual trust fails between the node that fails to download packages and the controller node.

Check whether configurations about user omm (wheel group) are correct in DenyUsers, DenyGroups, AllowUsers, and AllowGroups in the /etc/ssh/sshd_config file.

If security hardening has been performed, the /etc/ssh/sshd_config file may contain the following configurations:

 

These configurations affect SSH connections, resulting in a package download failure.

Log in to the active OMS node as user omm. Connect to the other agent nodes using SSH. If the password is required, mutual trust has been damaged. Check permissions on /home/omm/.ssh and /home/omm/.ssh/*. Their permissions are 750, which is different from the normal values 700 and 600. It is concluded that the permission on /home/omm/.ssh has been changed, resulting in a mutual trust failure and finally an installation package download failure.

Run the expect command on the involved node. If this command cannot be found, package download fails due to the lack of the expect command.

Solution:

a.For cause 1

Delete unused files in the environment to release space.

b.For cause 2

Comment out ListenAddress and AllowGroups.

Change permissions on /home/omm/.ssh and /home/omm/.ssh/* to700 and 600.

Run the preinstall script on the involved node or install the expect software.

4. Node Discovery Failure
Symptom:

20171031152643058011.png
Fault Locating:

Because some encryption algorithms that are not supported are configured in /etc/ssh/sshd_config.

Solution:
a.Modify /etc/ssh/sshd_config.

b.Restart the SSH service.

service sshd restart

c.Re-discover nodes.

 


5. Multiple Service IP Addresses Are Displayed in the Node Discovery Result

Symptom: :20171031152644437012.png

Possible Cause:

The same host name corresponds to multiple IP addresses in the /etc/hosts file.

Solution:

Delete the unnecessary IP addresses from the /etc/hosts file and re-discover nodes.

 


This post was last edited by z00381792 at 2014-12-09 02:58.

thanks!
View more
  • x
  • convention:

:)
View more
  • x
  • convention:

Great sharing! :)
View more
  • x
  • convention:

Common Faults in Installing OMS-2670015-1
View more
  • x
  • convention:

Comment

You need to log in to comment to the post Login | Register
Comment

Notice: To protect the legitimate rights and interests of you, the community, and third parties, do not release content that may bring legal risks to all parties, including but are not limited to the following:
  • Politically sensitive content
  • Content concerning pornography, gambling, and drug abuse
  • Content that may disclose or infringe upon others ' commercial secrets, intellectual properties, including trade marks, copyrights, and patents, and personal privacy
Do not share your account and password with others. All operations performed using your account will be regarded as your own actions and all consequences arising therefrom will be borne by you. For details, see " User Agreement."

My Followers

Login and enjoy all the member benefits

Login

Block
Are you sure to block this user?
Users on your blacklist cannot comment on your post,cannot mention you, cannot send you private messages.
Reminder
Please bind your phone number to obtain invitation bonus.