一OGG 常用命令
进入到 OGG 的 home 目录 一般为 /odc, 运行 ./ggsci
1. 查看全部进程状态 info all
Program: 进程类型
Status: 进程状态
Group: 进程名
Time since chkpt :进程不一致时间
2. 查看某个进程的详细信息: info ext1
进程的 seqno 是 : 426 RBA 23065088
3. 进程启停命令
start mgr 开启管理进程(必须先于其他进程启动)
start * 开启其他所有进程
start ext1 开启某一指定进程 ( group 列 即为进程名)
stop mgr 停止管理进程
stop * 停止所有进程
stop ext1 停止某一指定进程
4. 查看 OGG 系统日志
view report ggserr
View report ext1 指定某一进程名
故障排查步骤
1. 查看 OGG 状态,把终止的进程启动。
2. 启动命令发出以后,多查看一会进程状态,例如 info all 查看进程状态。!重复执行上次操作。
3. 如果进程状态仍然为 STOPPED 或 ABENDED ,则可查看进程日志或 ggserr.log
此为 OGG 网络问题引起,网络被拒绝。很可能为目标端 mgr 进程没有启动。
即使有时候进程状态为 RUNNING ,但是 Time since chkpt 时间没有减少,或者没有到 Lag at chkpt 上去, OGG 都是有问题的。
常见错误
1. 查看源端进程日志,报错信息为:
|
2015-04-30 12:31:27 ERROR OGG-01031 Oracle GoldenGate Capture for Oracle, p_emr.prm: There is a problem in network communication, a remote file problem, encryption keys for target and source do not match (if using ENCRYPT) or an unknown error. (Reply received is Unable to open file "./dirdat/trace/hzss/p_emr/la012279" (error 13, Permission denied)). 2015-04-30 12:31:27 ERROR OGG-01668 Oracle GoldenGate Capture for Oracle, p_emr.prm: PROCESS ABENDING. |
目标端对应的报错信息:
|
2015-04-30 19:34:52 WARNING OGG-01519 Waiting at EOF on input trail file ./dirdat/trace/hzss/p_emr/la012279, which is not marked as complete; but succeeding trail file ./dirdat/trace/hzss/p_emr/la012280 exists. If ALTER ETROLLOVER has been performed on source extract, ALTER EXTSEQNO must be performed on each corresponding downstream reader. |
原因是由于源端到目标端的网络中断,导致 datapump 投递进程 abended
解决方法:
|
源端的投递进程产生新的文件点: GGSCI (oggdb01) 3> alter extract p_emr etrollover 2015-04-30 12:47:24 INFO OGG-01520 Rollover performed. For each affected output trail of Version 10 or higher format, after starting the source extract, issue ALTER EXTSEQNO for that trail's reader (either pump EXTRACT or REPLICAT) to move the reader's scan to the new trail file; it will not happen automatically. EXTRACT altered. GGSCI (oggdb01) 4> start p_emr 目标端的 replicat 复制进程从新的序列号开始( la012280 ) GGSCI (localhost) 3> stop hzss-r2 GGSCI (localhost) 4> alter replicat hzss-r2 extseqno 12280 extrba 0 GGSCI (localhost) 5> start hzss-r2 |
2. 处理不同步的表 先停掉目标端的 replicat 复制进程(确保 scn 号一致) 从源端按 SCN 号导出不同步的表:
|
expdp system/oracle directory=dmpdir dumpfile=zjhis_tab20150522.dmp logfile=zjhis_tab20150522.log tables=zjhis.CF_DUOMEITI,zjhis.CF_XIANGMUZD,zjhis.YZ_INPATIENT_ORDER flashback_scn = 12585744134590
将 dmp 文件导入到目标端 : impdp system/oracle directory=dmpdir dumpfile=zjhis_tab20150522.dmp logfile=impdp20150522.log remap_schema=zjhis:sq_zjhis table_exists_action=replace 在 replicat 复制进程里添加如下参数: map zjhis.CF_DUOMEITI, target sq_zjhis.CF_DUOMEITI, filter (@GETENV ("transaction", "csn") > 12585744134590); map zjhis.CF_XIANGMUZD, target sq_zjhis.CF_XIANGMUZD, filter (@GETENV ("transaction", "csn") > 12585744134590); map zjhis.YZ_INPATIENT_ORDER, target sq_zjhis.YZ_INPATIENT_ORDER, filter (@GETENV ("transaction", "csn") > 12585744134590); |
3. 源端启动 datapump 投递进程时,报错如下:
|
2015-11-23 20:28:58 ERROR OGG-01031 There is a problem in network communication, a remote file problem, encryption keys for target and source do not match (if using ENCRYPT) or an unknown error. (Reply received is Unable to open file "/opt/app/ogg/dirdat/trace/his/la000951" (error 11, Resource temporarily unavailable)). |
大概意思如下: 网络通信中存在一个问题,一个远程文件问题,目标和源的加密密钥不匹配(如果使用加密)或一个未知的错误。(收到的回复是无法打开的文件 “/opt/app/ogg/dirdat/trace/his/la000951” ( error 11, 资源暂时不可用))
查看目标端的 ggserr.log 文件,在出错的时间段内 检索 la000951 文件,发现如下比较明显的信息:
|
2015-11-23 13:30:49 WARNING OGG-01223 Oracle GoldenGate Collector for Oracle: Unable to lock file "/opt/app/ogg/dirdat/trace/his/la000951" (error 11, Resource temporarily unavailable). Lock currently held by process id (PID) 26115. 2015-11-23 13:34:46 INFO OGG-01669 Oracle GoldenGate Collector for Oracle: Opening /opt/app/ogg/dirdat/trace/his/la000951 (byte -1, current EOF 1194). 2015-11-23 13:34:56 WARNING OGG-01223 Oracle GoldenGate Collector for Oracle: Unable to lock file "/opt/app/ogg/dirdat/trace/his/la000951" (error 11, Resource temporarily unavailable). Lock currently held by process id (PID) 26115. 2015-11-23 13:34:56 WARNING OGG-01223 Oracle GoldenGate Collector for Oracle: Unable to open file "/opt/app/ogg/dirdat/trace/his/la000951" (error 11, Resource temporarily unavailable). |
原因是源端的 datapump 进程要往目标端的第 951 号 trail 文件写数据,而此时目标端的该文件被编号为 26115 的系统进程所锁定,造成源端无法打开文件而 abended 。
解决: 查找到该系统进程, kill 掉。最终重新启动 pump 进程成功。
