环境:11g单点的asm,在进行停库时发生问题无法启动
数据库集群start报错crs-4639/4000
检查后台集群进程
ps -ef|grep d.bin
发现只有ohash进程
进行排查ohash集群日志
2023-06-26 21:10:32.855: [UiServer][452949760]{0:0:9127} Done for ctx=0x7fc3ec011030
2023-06-26 21:10:32.855: [ INIT][452949760]{0:0:9127} Exiting on request of the Policy Engine...
2023-06-26 21:10:32.855: [ INIT][452949760]{0:0:9127} Done.
2023-06-26 21:10:32.867: [UiServer][450848512] CS(0x7fc3f0006bb0)set Properties ( root,0x7fc41c09fae0)
2023-06-26 22:59:39.817: [ default][4004874048] Created alert : (:OHAS00117:) : TIMED OUT WAITING FOR OHASD MONITOR
2023-06-26 23:07:47.023: [ default][4241286976] Created alert : (:OHAS00117:) : TIMED OUT WAITING FOR OHASD MONITOR
2023-06-26 23:10:51.773: [ default][3683718976] Created alert : (:OHAS00117:) : TIMED OUT WAITING FOR OHASD MONITOR
2023-06-26 23:39:49.445: [ default][1268569920] Created alert : (:OHAS00117:) : TIMED OUT WAITING FOR OHASD MONITOR
2023-06-27 00:01:54.466: [ default][4034611008] Created alert : (:OHAS00117:) : TIMED OUT WAITING FOR OHASD MONITOR
通过网络资源搜索解决办法,需要对文件进行dd操作,才能启动,找到相应文件后,本次使用mv备份并重启
重启后依然报错,但检查进程发现agent进程启动,检查集群状态,发现集群启动
prw-r--r-- 1 grid oinstall 0 Jun 27 00:02 npohasd
srwxrwxrwx 1 grid oinstall 0 May 26 15:29 sAevm
srwxrwxrwx 1 grid oinstall 0 May 26 15:29 sCevm
srwxrwxrwx 1 grid oinstall 0 May 26 15:27 sCRSD_UI_SOCKET
srwxrwxrwx 1 grid oinstall 0 May 26 15:29 sgylglDBG_CSSD
srwxrwxrwx 1 grid oinstall 0 May 26 15:29 sgylglDBG_EVMD
srwxrwxrwx 1 grid oinstall 0 May 26 15:27 sgylglDBG_OHASD
srwxrwxrwx 1 grid oinstall 0 May 26 15:29 sOCSSD_LL_gylgl_
srwxrwxrwx 1 grid oinstall 0 May 26 15:29 sOCSSD_LL_gylgl_localhost
-rw-r--r-- 1 grid oinstall 0 Apr 27 16:23 sOCSSD_LL_gylgl_localhost_lock
-rw-r--r-- 1 grid oinstall 0 Apr 27 16:23 sOCSSD_LL_gylgl__lock
srwxrwxrwx 1 grid oinstall 0 May 26 15:27 sOHASD_IPC_SOCKET_11
-rw-r--r-- 1 grid oinstall 0 Apr 27 16:21 sOHASD_IPC_SOCKET_11_lock
srwxrwxrwx 1 grid oinstall 0 May 26 15:27 sOHASD_UI_SOCKET
srwxrwxrwx 1 grid oinstall 0 May 26 15:29 sOracle_CSS_LclLstnr_localhost_1
-rw-r--r-- 1 grid oinstall 0 Apr 27 16:23 sOracle_CSS_LclLstnr_localhost_1_lock
srwxrwxrwx 1 grid oinstall 0 May 26 15:27 sprocr_local_conn_0_PROL
-rw-r--r-- 1 grid oinstall 0 Apr 27 16:21 sprocr_local_conn_0_PROL_lock
srwxrwxrwx 1 grid oinstall 0 May 26 15:29 sSYSTEM.evm.acceptor.auth
通用解决:
/bin/dd if=/var/tmp/.oracle/npohasd of=/dev/null bs=1024 count=1
此问题不可永久解决,为11g版本bug,如遇到进行此法操作即可