Oracle-從一次打補丁后出現的故障中總結數據庫啟停的流程

原創張玉龍 2022-01-13

6907

一次打Oracle小補丁引發的事故，環境是Linux 7上的19C兩節點RAC，打完小補丁過了幾個小時，其中一個節點發生重啟現象。

DB alert日志無報錯信息，05:17:22 開始啟動數據庫

2022-01-11T04:55:07.629428+08:00
TT02 (PID:67750): SRL selected for T-2.S-26402 for LAD:2
2022-01-11T04:55:16.834464+08:00
ARC2 (PID:67638): Archived Log entry 100056 added for T-2.S-26401 ID 0xecb9a320 LAD:1
# 重啟前無報錯信息
2022-01-11T05:17:22.625738+08:00
Starting ORACLE instance (normal) (OS id: 15301)
2022-01-11T05:17:22.777374+08:00
************************************************************
Instance SGA_TARGET = 307200 MB and SGA_MAX_SIZE = 307200 MB
************************************************************

CRS alter日志(故障節點2)

2022-01-11 00:32:49.113 [OCSSD(72170)]CRS-7503: The Oracle Grid Infrastructure process 'ocssd' observed communication issues between node 'rac2' and node 'rac1', interface list of local node 'rac2' is '192.168.0.2:54393;', interface list of remote node 'rac1' is '192.168.0.1:18424;'.
2022-01-11 01:06:58.153 [EVMD(68443)]CRS-7503: The Oracle Grid Infrastructure process 'evmd' observed communication issues between node 'rac2' and node 'rac1', interface list of local node 'rac2' is '192.168.0.2:40437;', interface list of remote node 'rac1' is '192.168.0.1:43662;'.
# 重啟前無報錯信息
2022-01-11 05:15:52.838 [OHASD(61516)]CRS-8500: Oracle Clusterware OHASD process is starting with operating system process ID 61516
2022-01-11 05:15:52.909 [OHASD(61516)]CRS-0714: Oracle Clusterware Release 19.0.0.0.0.
2022-01-11 05:15:52.924 [OHASD(61516)]CRS-2112: The OLR service started on node rac2.
2022-01-11 05:15:53.251 [OHASD(61516)]CRS-1301: Oracle High Availability Service started on node rac2.
# 此處顯示 05:08:56 節點2 由于大多數表決磁盤沒有完成 I/O，oracssdagent 即將重新啟動此節點。
2022-01-11 05:15:53.254 [OHASD(61516)]CRS-8011: reboot advisory message from host: rac2, component: cssagent, with time stamp: L-2022-01-11-05:08:56.490
2022-01-11 05:15:53.255 [OHASD(61516)]CRS-8013: reboot advisory message text: oracssdagent is about to reboot this node due to no I/O completions with majority of voting disks.
2022-01-11 05:15:53.256 [OHASD(61516)]CRS-8017: location: /etc/oracle/lastgasp has 2 reboot advisory log files, 1 were announced and 0 errors occurred
2022-01-11 05:15:53.809 [ORAROOTAGENT(69333)]CRS-8500: Oracle Clusterware ORAROOTAGENT process is starting with operating system process ID 69333
2022-01-11 05:15:53.814 [CSSDAGENT(69347)]CRS-8500: Oracle Clusterware CSSDAGENT process is starting with operating system process ID 69347
2022-01-11 05:15:53.815 [CSSDMONITOR(69354)]CRS-8500: Oracle Clusterware CSSDMONITOR process is starting with operating system process ID 69354
2022-01-11 05:15:53.832 [ORAAGENT(69343)]CRS-8500: Oracle Clusterware ORAAGENT process is starting with operating system process ID 69343
2022-01-11 05:15:54.485 [ORAAGENT(69655)]CRS-8500: Oracle Clusterware ORAAGENT process is starting with operating system process ID 69655

CRS alter日志(遠程節點1)

2022-01-11 03:54:42.911 [CVUD(117742)]CRS-10051: CVU found following errors with Clusterware setup : PRVG-13159 : On node "rac2" the file "/etc/resolv.conf" could not be parsed because the file is empty.
PRVE-0421 : No entry exists in /etc/fstab for mounting /dev/shm
PRVF-4664 : Found inconsistent name resolution entries for SCAN name "rac-scan"
PRVG-11368 : A SCAN is recommended to resolve to "3" or more IP addresses, but SCAN "rac-scan" resolves to only "/192.168.0.13"

2022-01-11 05:06:04.438 [CRSD(110493)]CRS-7503: The Oracle Grid Infrastructure process 'crsd' observed communication issues between node 'rac1' and node 'rac2', interface list of local node 'rac1' is '192.168.0.1:15060;', interface list of remote node 'rac2' is '192.168.0.2:33424;'.
2022-01-11 05:07:54.319 [ORAAGENT(120835)]CRS-5818: Aborted command 'check' for resource 'ora.LISTENER_SCAN1.lsnr'. Details at (:CRSAGF00113:) {1:16836:2} in /oracle/app/grid/diag/crs/rac1/crs/trace/crsd_oraagent_grid.trc.
2022-01-11 05:08:04.332 [EVMD(67934)]CRS-7503: The Oracle Grid Infrastructure process 'evmd' observed communication issues between node 'rac1' and node 'rac2', interface list of local node 'rac1' is '192.168.0.1:43662;', interface list of remote node 'rac2' is '192.168.0.2:40437;'.
2022-01-11 05:08:14.620 [CRSD(110493)]CRS-7503: The Oracle Grid Infrastructure process 'crsd' observed communication issues between node 'rac1' and node 'rac2', interface list of local node 'rac1' is '192.168.0.1:15060;', interface list of remote node 'rac2' is '192.168.0.2:33424;'.
2022-01-11 05:08:55.476 [OCSSD(71998)]CRS-1663: Member kill issued by PID 15278-387376 for 1 members, group DB+ASM. Details at (:CSSGM00044:) in /oracle/app/grid/diag/crs/rac1/crs/trace/ocssd.trc.
2022-01-11 05:08:55.579 [OCSSD(71998)]CRS-1607: Node rac2 is being evicted in cluster incarnation 516104351; details at (:CSSNM00007:) in /oracle/app/grid/diag/crs/rac1/crs/trace/ocssd.trc.
# 節點2 由于大多數表決磁盤沒有完成 I/O 被 reboot。
2022-01-11 05:08:56.492 [OHASD(58992)]CRS-8011: reboot advisory message from host: rac2, component: cssagent, with time stamp: L-2022-01-11-05:08:56.490
2022-01-11 05:08:56.494 [OHASD(58992)]CRS-8013: reboot advisory message text: oracssdagent is about to reboot this node due to no I/O completions with majority of voting disks.
2022-01-11 05:09:03.844 [OCSSD(71998)]CRS-7503: The Oracle Grid Infrastructure process 'ocssd' observed communication issues between node 'rac1' and node 'rac2', interface list of local node 'rac1' is '192.168.0.1:54820;', interface list of remote node 'rac2' is '192.168.0.2:58358;'.
2022-01-11 05:09:25.713 [OCSSD(71998)]CRS-1601: CSSD Reconfiguration complete. Active nodes are rac1 .
2022-01-11 05:09:27.305 [CRSD(110493)]CRS-5504: Node down event reported for node 'rac2'.
2022-01-11 05:09:28.523 [CRSD(110493)]CRS-2773: Server 'rac2' has been removed from pool 'Generic'.
2022-01-11 05:09:28.525 [CRSD(110493)]CRS-2773: Server 'rac2' has been removed from pool 'ora.orcl'.

通過OSW分析系統資源使用情況
分析iostat，以下是5個 voting disk

所有磁盤的繁忙程度

可見磁盤的問題不大，下面分析vmstat可以看出當時內存使用非常嚴重

使用了SWAP

cache使用較多，不知道為啥

CPU使用率不高
知識點

oracle@rac2:/home/oracle> cat /proc/sys/vm/min_free_kbytes   # 頁最小閾值
4194304
頁低閾值 pages_low = pages_min*5/4 = 5242880k 
頁高閾值 pages_high = pages_min*3/2 = 6291456k

# 剩余內存小于頁最小閾值， 說明進程可?內存都耗盡了， 只有內核才可以分配內存。
# 剩余內存落在頁最小閾值和頁低閾值中間， 說明內存壓力比較大， 剩余內存不多了。 這時 kswapd0 會執?內存回收，直到剩余內存大于頁高閾值為止。
# 剩余內存落在頁低閾值和頁高閾值中間， 說明內存有一定壓力， 但還可以滿?新內存請求。
# 剩余內存大于頁高閾值， 說明剩余內存比較多， 沒有內存壓力。

節點2重啟前，剩余內存較少(5190700k), 小于 pages_low(5242880K)，內存壓力大。

zzz ***Tue Jan 11 05:05:32 CST 2022
procs -----------memory---------- ---swap-- -----io---- -system-- ------cpu-----
 r  b   swpd   free   buff  cache   si   so    bi    bo   in   cs us sy id wa st
16  1 4490752 5207092  84692 253125568    0    0   478   273    0    0  2  2 95  0  0
 6  1 4494592 5189996  84708 253123104    0 3324 30951 34025 197572 111405  1  3 96  0  0
 4  0 4494848 5190700  84732 253122400    0  188 24189  3557 146797 88372  1  2 98  0  0
Linux OSWbb v8.3.2 rac2
SNAP_INTERVAL 15
CPU_CORES 80
VCPUS 160
OSWBB_ARCHIVE_DEST /oracle/app/grid/oracle.ahf/data/repository/suptools/rac2/oswbb/grid/archive
zzz ***Tue Jan 11 05:16:06 CST 2022
procs -----------memory---------- ---swap-- -----io---- -system-- ------cpu-----
 r  b   swpd   free   buff  cache   si   so    bi    bo   in   cs us sy id wa st
12  1      0 380316096   6660 1503980    0    0   159     8  372  219  1  4 95  0  0
 7  0      0 378362304   7852 1584300    0    0 24612  4577 90896 98582  6  2 92  0  0
15  0      0 377275776   9536 1601152    0    0 21580 21502 102854 123845  4  2 94  0  0

數據庫 SGA(300G) + PGA(80G) + 進程內存(3000個進程左右，預估每個3M，合計9G) < 400G

[root@pg13 ~]# cat oswps.sh
fname=$1
data_array=(`grep "zzz" $fname |awk -F"***" '{print $2}' |sed 's/ /_/g'`)

for i in ${!data_array[@]};
do
        sed_start_1=${data_array[$i]}
        sed_start_2=`echo $sed_start_1 |sed 's/_/ /g'`
        i=$i+1
        sed_end=${data_array[$i]}
        sed_end=`echo $sed_end |sed 's/_/ /g'`

        num=`sed -n "/$sed_start_2/,/$sed_end/p" $fname |egrep ^oracle |wc -l`
        echo $sed_start_1 $num
done

[root@pg13 ~]# sh oswps.sh rac2_ps_22.01.11.0500.dat
Tue_Jan_11_05:03:47_CST_2022 3022
Tue_Jan_11_05:04:02_CST_2022 3031
Tue_Jan_11_05:04:17_CST_2022 0
Tue_Jan_11_05:16:06_CST_2022 6
Tue_Jan_11_05:16:21_CST_2022 0
Tue_Jan_11_05:16:36_CST_2022 0
Tue_Jan_11_05:16:51_CST_2022 0
Tue_Jan_11_05:17:06_CST_2022 6
Tue_Jan_11_05:17:21_CST_2022 2
Tue_Jan_11_05:17:36_CST_2022 74

分析 meminfo
從 meminfo 看，CACHE使用較多(250G), 匿名頁使用也較多（163G），CACHE一般為文件系統頁緩存(ORACLE ASM 不使用 CACHE)，從PS的監控數據看，未找到異常占用內存的進程。
但后面分現 meminfo 中還有一個異常點，就是 pagetables 占的內存較多（PageTables: 91600048 kB），剩余的大頁較多（HugePages_Free: 153805），不正常。

zzz ***Tue Jan 11 05:05:32 CST 2022
MemTotal:       790552132 kB
MemFree:         5201744 kB
MemAvailable:          0 kB
Buffers:           84692 kB
Cached:         250267324 kB  # <<<<< 250G
SwapCached:        17512 kB
Active:         163760368 kB  # <<<<< 163G
Inactive:       113563000 kB
Active(anon):   163400368 kB
Inactive(anon): 113033788 kB
Active(file):     360000 kB
Inactive(file):   529212 kB
Unevictable:     4251944 kB
Mlocked:         4252088 kB
SwapTotal:      20971516 kB
SwapFree:       16480764 kB
Dirty:               204 kB
Writeback:             0 kB
AnonPages:      31370864 kB
Mapped:         207003588 kB
Shmem:          249168216 kB
Slab:            2858256 kB
SReclaimable:     962168 kB
SUnreclaim:      1896088 kB
KernelStack:      119856 kB
PageTables:     91600048 kB  # <<<<< 91G
NFS_Unstable:          0 kB
Bounce:                0 kB
WritebackTmp:          0 kB
CommitLimit:    218627868 kB
Committed_AS:   313497152 kB
VmallocTotal:   34359738367 kB
VmallocUsed:     2418400 kB
VmallocChunk:   34357114360 kB
HardwareCorrupted:     0 kB
AnonHugePages:         0 kB
CmaTotal:              0 kB
CmaFree:               0 kB
HugePages_Total:   192988
HugePages_Free:    153805  # <<<<< 300G
HugePages_Rsvd:        0
HugePages_Surp:        0
Hugepagesize:       2048 kB
DirectMap4k:    34501632 kB
DirectMap2M:    242862080 kB
DirectMap1G:    528482304 kB

繼續分析，查看DB ALERT LOG，可以看到在打完補丁，起庫的日志中看到大頁只分配了39183(76G)，剩余共享內存使用的4k的頁，分配了58581835(223G)。

2022-01-11T00:28:34.776117+08:00
 Domain name: user.slice
2022-01-11T00:28:34.776192+08:00
 Per process system memlock (soft) limit = UNLIMITED
2022-01-11T00:28:34.776257+08:00
 Expected per process system memlock (soft) limit to lock
 instance MAX SHARED GLOBAL AREA (SGA) into memory: 300G
2022-01-11T00:28:34.776383+08:00
 Available system pagesizes:
  4K, 2048K
2022-01-11T00:28:34.776504+08:00
 Supported system pagesize(s):
2022-01-11T00:28:34.776567+08:00
  PAGESIZE  AVAILABLE_PAGES  EXPECTED_PAGES  ALLOCATED_PAGES  ERROR(s)
2022-01-11T00:28:34.776633+08:00
        4K       Configured              14        58581835        NONE
2022-01-11T00:28:34.777018+08:00
     2048K            39387          153601           39183        NONE

這里為什么沒有全用上大頁呢，懷疑當時起庫時可用的大頁不足。

查看打補丁時的停庫(00:13:23)和起庫時間(00:28:34)

2022-01-11T00:04:57.263597+08:00
Thread 2 advanced to log sequence 26368 (LGWR switch),  current SCN: 17283142928376
  Current log# 7 seq# 26368 mem# 0: +DATA/ORCL/ONLINELOG/group_7.398.1080744843
2022-01-11T00:04:57.492759+08:00
TT03 (PID:103953): SRL selected for T-2.S-26368 for LAD:2
2022-01-11T00:05:02.189653+08:00
ARC1 (PID:47485): Archived Log entry 99931 added for T-2.S-26367 ID 0xecb9a320 LAD:1
# 停庫時間
2022-01-11T00:13:23.046802+08:00
Shutting down ORACLE instance (immediate) (OS id: 43111)

2022-01-11T00:14:10.945385+08:00
Stopping background process RBAL
2022-01-11T00:14:14.725352+08:00
freeing rdom 0
freeing the fusion rht of pdb 0
2022-01-11T00:14:28.584765+08:00
Warning: 2 processes are still attacheded to shmid 753673:
 (size: 57344 bytes, creator pid: 18969, last attach/detach pid: 28293)
Instance shutdown complete (OS id: 43111)
# 起庫時間
2022-01-11T00:28:34.620322+08:00
Starting ORACLE instance (normal) (OS id: 54670)
2022-01-11T00:28:34.773499+08:00
************************************************************
Instance SGA_TARGET = 307200 MB and SGA_MAX_SIZE = 307200 MB
************************************************************

起庫前(00:28:34)大頁內存的使用情況，剩余39808(77G)，果然起庫時可用的大頁不足。

zzz ***Tue Jan 11 00:28:24 CST 2022
MemTotal:       790552132 kB
MemFree:        359520492 kB
MemAvailable:   354082620 kB
Buffers:          142740 kB
Cached:         10248144 kB
...
HugePages_Total:   192988
HugePages_Free:    39808
HugePages_Rsvd:      421
HugePages_Surp:        0
Hugepagesize:       2048 kB

所以再次懷疑打補丁關庫(00:13:23)后，大頁沒有釋放。

# 停庫時間
2022-01-11T00:13:23.046802+08:00
Shutting down ORACLE instance (immediate) (OS id: 43111)

2022-01-11T00:14:10.945385+08:00
Stopping background process RBAL
2022-01-11T00:14:14.725352+08:00
freeing rdom 0
freeing the fusion rht of pdb 0
2022-01-11T00:14:28.584765+08:00
Warning: 2 processes are still attacheded to shmid 753673:
 (size: 57344 bytes, creator pid: 18969, last attach/detach pid: 28293)
Instance shutdown complete (OS id: 43111)

可以看到數據庫是在00:14:28分關閉的，但大00:22:09時，大頁還沒有釋放。

zzz ***Tue Jan 11 00:22:09 CST 2022
MemTotal:       790552132 kB
MemFree:        361520900 kB
MemAvailable:   355090684 kB
Buffers:          130284 kB
Cached:          8306876 kB
...
HugePages_Total:   192988
HugePages_Free:    39808
HugePages_Rsvd:      421
HugePages_Surp:        0
Hugepagesize:       2048 kB

同時細看日志，發現關閉時，仍有進程在使用共享內存。

2022-01-11T00:14:28.584765+08:00
Warning: 2 processes are still attacheded to shmid 753673:
 (size: 57344 bytes, creator pid: 18969, last attach/detach pid: 28293)

通過ps監控數據，定位到關庫后，還沒有關閉的server進程(82595)，在00:20:33已經關完數據庫的情況下又產生一個進程(73140)。正常數據庫關閉后，SERVER進程會自動關掉，這次挺奇怪。

grid@rac2:/home/grid> cat /oracle/app/grid/.../oswps/rac2_ps_22.01.11.0000.dat |sed -n '/00:14:38/,/00:28:24/p'|egrep "^oracle" |grep oracleorcl2

oracle    82595  82535  19  0.0  0.0 315033232 15416 pipe_wait  S 00:13:37 00:00:00 oracleorcl2 (DESCRIPTION=(LOCAL=YES)(ADDRESS=(PROTOCOL=beq)))
oracle    82595  82535  19  0.0  0.0 315033232 15416 pipe_wait  S 00:13:37 00:00:00 oracleorcl2 (DESCRIPTION=(LOCAL=YES)(ADDRESS=(PROTOCOL=beq)))
oracle    82595  82535  19  0.0  0.0 315033232 15416 pipe_wait  S 00:13:37 00:00:00 oracleorcl2 (DESCRIPTION=(LOCAL=YES)(ADDRESS=(PROTOCOL=beq)))
oracle    73140  73132  19  0.0  0.0 458072 14960 pipe_wait     S 00:20:33 00:00:00 oracleorcl2 (DESCRIPTION=(LOCAL=YES)(ADDRESS=(PROTOCOL=beq)))
oracle    82595  82535  19  0.0  0.0 315033232 15416 pipe_wait  S 00:13:37 00:00:00 oracleorcl2 (DESCRIPTION=(LOCAL=YES)(ADDRESS=(PROTOCOL=beq)))
oracle    73140  73132  19  0.0  0.0 458072 14960 pipe_wait     S 00:20:33 00:00:00 oracleorcl2 (DESCRIPTION=(LOCAL=YES)(ADDRESS=(PROTOCOL=beq)))
oracle    82595  82535  19  0.0  0.0 315033232 15416 pipe_wait  S 00:13:37 00:00:00 oracleorcl2 (DESCRIPTION=(LOCAL=YES)(ADDRESS=(PROTOCOL=beq)))
oracle    73140  73132  19  0.0  0.0 458072 14960 pipe_wait     S 00:20:33 00:00:00 oracleorcl2 (DESCRIPTION=(LOCAL=YES)(ADDRESS=(PROTOCOL=beq)))
oracle    82595  82535  19  0.0  0.0 315033232 15416 pipe_wait  S 00:13:37 00:00:00 oracleorcl2 (DESCRIPTION=(LOCAL=YES)(ADDRESS=(PROTOCOL=beq)))
oracle    73140  73132  19  0.0  0.0 458072 14960 pipe_wait     S 00:20:33 00:00:00 oracleorcl2 (DESCRIPTION=(LOCAL=YES)(ADDRESS=(PROTOCOL=beq)))
oracle    82595  82535  19  0.0  0.0 315033232 15416 pipe_wait  S 00:13:37 00:00:00 oracleorcl2 (DESCRIPTION=(LOCAL=YES)(ADDRESS=(PROTOCOL=beq)))

追蹤一下進程(82595)

grid@rac2:/home/grid> cat /oracle/app/grid/oracle.ahf/data/repository/suptools/rac2/oswbb/grid/archive/oswps/rac2_ps_22.01.11.0000.dat |sed -n '/00:15:23/,/00:15:38/p'|egrep "^oracle" |grep oracleorcl2
oracle    82595  82535  19  0.0  0.0 315033232 15416 pipe_wait                               S 00:13:37 00:00:00 oracleorcl2 (DESCRIPTION=(LOCAL=YES)(ADDRESS=(PROTOCOL=beq)))

grid@rac2:/home/grid> cat /oracle/app/grid/oracle.ahf/data/repository/suptools/rac2/oswbb/grid/archive/oswps/rac2_ps_22.01.11.0000.dat |sed -n '/00:15:23/,/00:15:38/p'|egrep 82535
oracle    82595  82535  19  0.0  0.0 315033232 15416 pipe_wait                               S 00:13:37 00:00:00 oracleorcl2 (DESCRIPTION=(LOCAL=YES)(ADDRESS=(PROTOCOL=beq)))
oracle    82535  72407  19  0.0  0.0 114040 13312 n_tty_read                                 S 00:13:37 00:00:00 sqlplus   as sysdba

grid@rac2:/home/grid> cat /oracle/app/grid/oracle.ahf/data/repository/suptools/rac2/oswbb/grid/archive/oswps/rac2_ps_22.01.11.0000.dat |sed -n '/00:15:23/,/00:15:38/p'|egrep 72407
oracle    82535  72407  19  0.0  0.0 114040 13312 n_tty_read                                 S 00:13:37 00:00:00 sqlplus   as sysdba
oracle    72407  72281  19  0.0  0.0 116788  3320 do_wait                                    S 00:13:32 00:00:00 -bash

grid@rac2:/home/grid> cat /oracle/app/grid/oracle.ahf/data/repository/suptools/rac2/oswbb/grid/archive/oswps/rac2_ps_22.01.11.0000.dat |sed -n '/00:15:23/,/00:15:38/p'|egrep 72281
oracle    72407  72281  19  0.0  0.0 116788  3320 do_wait                                    S 00:13:32 00:00:00 -bash
oracle    72281  69874  19  0.0  0.0 163376  2420 poll_schedule_timeout                      S 00:13:32 00:00:00 sshd: oracle@pts/2

grid@rac2:/home/grid> cat /oracle/app/grid/oracle.ahf/data/repository/suptools/rac2/oswbb/grid/archive/oswps/rac2_ps_22.01.11.0000.dat |sed -n '/00:15:23/,/00:15:38/p'|egrep 69874
root      69874  58356  19  0.0  0.0 163376  6036 poll_schedule_timeout                      S 00:13:31 00:00:00 sshd: oracle [priv]
oracle    72281  69874  19  0.0  0.0 163376  2420 poll_schedule_timeout                      S 00:13:32 00:00:00 sshd: oracle@pts/2

grid@rac2:/home/grid> cat /oracle/app/grid/oracle.ahf/data/repository/suptools/rac2/oswbb/grid/archive/oswps/rac2_ps_22.01.11.0000.dat |sed -n '/00:15:23/,/00:15:38/p'|egrep 58356
root     122271  58356  19  0.0  0.0 163376  6032 poll_schedule_timeout                      S 00:04:16 00:00:00 sshd: oracle [priv]
root     108972  58356  19  0.0  0.0 165196  5928 poll_schedule_timeout                      S 09:10:36 00:00:00 sshd: oracle [priv]
root      69874  58356  19  0.0  0.0 163376  6036 poll_schedule_timeout                      S 00:13:31 00:00:00 sshd: oracle [priv]
root      58356      1  19  0.0  0.0 112756  4352 poll_schedule_timeout                      S   Aug 19 00:04:31 /usr/sbin/sshd -D
root      47981  58356  19  0.0  0.0 167728  6424 poll_schedule_timeout                      S   Nov 06 00:00:00 sshd: hgaqjc [priv]
root      47614  58356  19  0.0  0.0 167716  6424 poll_schedule_timeout                      S   Nov 04 00:00:00 sshd: hgaqjc [priv]
root      31839  58356  19  0.0  0.0 163388  6028 poll_schedule_timeout                      S   Jan 05 00:00:00 sshd: oracle [priv]

通過補丁的日志也能發現這個進程(82595)的父進程(82535)，在安裝補丁時顯示仍在占用庫文件libclntsh.so.19.1，同事kill了以后安裝的補丁。
more /oracle/app/oracle/product/19c/db_1/cfgtoollogs/opatch/opatch2022-01-11_00-26-54AM_1.log

由于進程仍然在使用共享內存，導致無法回收共享內存段，HugePage仍在占用，導致重啟后無足夠的HugePage使用，重新分配4k的內存頁。

遺留一個疑問點，進程(82595)未正常退出的原因(Oracle SR的回復)。

當時這個 server process 在執行一個 OS kernel 函數 pipe_wait, 看起來它在執行OS的 systemcall 并進入異常狀態，
要進一步分析這個問題，必須當時就檢查這個 process 執行的具體的 kernel level 的完整的 callstack:

使用root執行
# cat /proc/<OSPID>/stack <==替換這里的<OSPID>為當時出問題的進程

請您在下次發生類似問題時，收集未退出進程的 /proc/<OSPID>/stack

BTW: 我們查了一下，很多 hang 在 pipe_wait 上的的進程都是在做 OS vfs IO 操作。

總結

根據HugePage的變化總結本次故障

1. 數據庫打補丁關庫前(停庫時間00:13:23)，大頁內存使用正常
zzz ***Tue Jan 11 00:12:38 CST 2022
HugePages_Total:   192988
HugePages_Free:    39808
HugePages_Rsvd:      421

2. 數據庫打補丁關庫后，起庫前(起庫時間00:28:34)，共享內存沒有釋放，導致 HugePage 沒有釋放
zzz ***Tue Jan 11 00:22:09 CST 2022
HugePages_Total:   192988
HugePages_Free:    39808
HugePages_Rsvd:      421

3. 數據庫打補丁起庫后(起庫時間00:28:34)，剩余的大頁內存正在被使用，因為剩余的大頁不足以存放 SGA，所以 SGA 沒有完全用上 HugePage，大部分使用的4K的頁。
zzz ***Tue Jan 11 00:28:39 CST 2022
HugePages_Total:   192988
HugePages_Free:    38026
HugePages_Rsvd:    37822

zzz ***Tue Jan 11 00:28:54 CST 2022
HugePages_Total:   192988
HugePages_Free:    16400
HugePages_Rsvd:    16196

zzz ***Tue Jan 11 00:29:09 CST 2022
HugePages_Total:   192988
HugePages_Free:      869
HugePages_Rsvd:      665

4. 在 01:13:58 左右，HugePage 被釋放，但是 Oracle 用不上。
zzz ***Tue Jan 11 01:13:58 CST 2022
HugePages_Total:   192988
HugePages_Free:    154049
HugePages_Rsvd:      244

同時82595進程也是在這個時間點消失的，腳本 grep 82595
Tue_Jan_11_01:13:43_CST_2022 1
Tue_Jan_11_01:13:58_CST_2022 0

5. 當數據庫連接數高時頁表占用過多內存，同時頁表過大影響性能，最后內存不足，導致IO無法完成，節點驅逐。
zzz ***Tue Jan 11 05:16:06 CST 2022
HugePages_Total:   192988
HugePages_Free:    192988
HugePages_Rsvd:        0

6. 2021-01-11 05:17:22 數據庫再次重啟時，幾乎全部分配的2m內存頁(Hugepage)，恢復了正常
zzz ***Tue Jan 11 05:19:37 CST 2022
HugePages_Total:   192988
HugePages_Free:    39810
HugePages_Rsvd:      423

2022-01-11T05:17:22.778865+08:00
 Available system pagesizes:
  4K, 2048K 
2022-01-11T05:17:22.778981+08:00
 Supported system pagesize(s):
2022-01-11T05:17:22.779041+08:00
  PAGESIZE  AVAILABLE_PAGES  EXPECTED_PAGES  ALLOCATED_PAGES  ERROR(s)
2022-01-11T05:17:22.779104+08:00
        4K       Configured              14              14        NONE
2022-01-11T05:17:22.779226+08:00
     2048K           192988          153601          153601        NONE

總結下數據庫啟停流程

1. 確認主機
ip a
uname -a

2. 確認數據庫和spfile
$ export ORACLE_SID=<SID of the instance>
$ export ORACLE_HOME=<location of ORACLE_HOME>
$ sqlplus / as sysdba
SQL> show parameter name
SQL> show parameter spfile

3. 注釋 crontab ，注意root用戶下是否也存在數據庫相關的crontab

$ crontab –l > /home/oracle/crontab_bak_xxxx
$ crontab –e  --清空

4. 檢查活動會話，是否存在類似備份的任務，提前處理掉
SQL> @ase.sql

5. 關閉監聽
su - grid
srvctl stop listener -node rac2

6. 殺會話
su - oracle
ps -ef|grep "LOCAL=NO"|grep $ORACLE_SID |grep -v grep |wc -l
ps -ef|grep "LOCAL=NO"|grep $ORACLE_SID |grep -v grep |awk '{print "kill -9 " $2}'|sh

7. 檢查是否有未提交事物
select
  s.username,
  t.xidusn,
  t.xidslot,
  t.xidsqn,
  x.ktuxesiz
from
  sys.x$ktuxe  x,
  sys.v_$transaction  t,
  sys.v_$session  s
where
  x.inst_id = userenv('Instance') and
  x.ktuxesta = 'ACTIVE' and
  x.ktuxesiz =1 and
  t.xidusn = x.ktuxeusn and
  t.xidslot = x.ktuxeslt and
  t.xidsqn = x.ktuxesqn and
  s.saddr = t.ses_addr;

8. 檢查session，切日志做checkpoint
select inst_id, sessions_current,sessions_highwater from gv$license;
@kill TYPE='USER'
alter system switch logfile;
alter system archive log current;
alter system checkpoint;

9. 關閉數據庫，跟蹤 alert log，是否異常輸出
shutdown immediate

如果使用srvctl關閉數據庫，注意加參數，默認是abort
srvctl stop instance -db orcl -instance orcl1 -stopoption IMMEDIATE
srvctl stop database -db orcl -stopoption IMMEDIATE

10. 確認共享內存段已釋放，是否存在殘留進程
cat /proc/meminfo    --檢查大頁內存已釋放
ipcs -m
對于一個主機上多個實例時，無法從共享段區分屬于哪個實例使用，需要增加一步使用oracle的sysresv工具:
$ORACLE_HOME/bin/sysresv

如果實例關閉后共享內存在等待1-2分鐘未自動釋放，可以使用ipcrm手動釋放共享內存段 
ipcrm -m shmid    //刪除共享內存段

檢查是否存在殘留進程
ps -ef|grep LOCAL |grep $ORACLE_SID |grep -v grep

11. 啟動數據庫，跟蹤 alert log
SQL> startup
或者使用srvctl
srvctl start instance -db orcl -instance orcl1
srvctl start database -db orcl


2022-01-10T23:54:29.121877+08:00
 Available system pagesizes:
  4K, 2048K
2022-01-10T23:54:29.121989+08:00
 Supported system pagesize(s):
2022-01-10T23:54:29.122047+08:00
  PAGESIZE  AVAILABLE_PAGES  EXPECTED_PAGES  ALLOCATED_PAGES  ERROR(s)
2022-01-10T23:54:29.122105+08:00
        4K       Configured              14              14        NONE
2022-01-10T23:54:29.122474+08:00
     2048K           192475          153601          153601        NONE

12. 起庫后的檢查
檢查是否可以切換日志
alter system switch logfile;

檢查監聽
lsnrctl status

檢查OGG，檢查Dataguard

墨力計劃 oracle

最后修改時間：2022-01-14 10:21:52

「喜歡這篇文章，您的關注和贊賞是給作者最好的鼓勵」

關注作者

文章被以下合輯收錄

oracle運維筆記（共6篇）

命令記錄

特色一级强游戏,海奥华预言免费阅读,51漫画兑换码,美女裸体无遮挡永久免费观看网站,lubuntu线路检测入口

Oracle-從一次打補丁后出現的故障中總結數據庫啟停的流程

總結

文章被以下合輯收錄

評論