Data Guard :: Failover Standby Rac / Share Steps For Failover
Aug 26, 2010
In my production -- we have 3 nodes Primary RAC with 2 Nodes Physical secondary Standby RAC. Due to some reason we will have to failover means 3 nodes Primary Standby RAC to 2 Nodes secondary physical Standby RAC. share Steps for Failover ( not switchover) from 3 Nodes Primary RAC to 2 Nodes Physical Standby RAC. I know this is generic and can be find from google but issue is how can failover from 3 nodes primary standby rac to 2 nodes secondary physical standby rac.
i have Oracle 10g data guard set up on windows environment.....i need to know the what are the right steps to perform failover in case the primary database gets fail.
If flashback is enable in physical standby database 1. If we failover at 11AM can I flash back NEW primary database to 6 AM ? 2. if I convert physically standby database to snapshot standby database at 11AM , Can I flashback snapshot standby database to 6 AM and do some works on it (DML operations) then converting the snapshot standby database into physical standby database ?
I am very new to data guard...I have a primary and a physical standby database...If the primary got crashed, are the users directly switched to physical standyby (is it automated) or the DBA has to manually do the fail over or switch over....is it the same concept as in RAC like TAF....How can the DBA know that the users are disconnected from the database.....explain with some steps...
I am running some small tests here in my test env, using dataguard. I have configured the Primary and Standby with Maximum Availability... they're running just fine. Now i want to execute a failover test (i have already ran a Switchover test with the Broker successfully).
My question is very simple, at my point of view: What are the required steps to execute a successful manual failover? For example, i have my env as follows:
- Primary: prim1 - Standby: stdb1
Suppose that the primary database crashes in an unrecoverable way... is this case a manual failover would be necessary.
To do so, i would have to execute the following command, in my Standby database:
-- stdb1 is the standby database... DGMGRL> failover to stdb1
The above command is correct? Are there any required configurations after the failover? I read the Oracle Docs, and it says
what is the configuration dba has do for enabling the "fast-start failover concept" in the event of loss of primary database without requiring any manual steps to invoke the failover.whether the configuration are to be made in parameter file if yes what are those ?
My Data Guard 's Enviroment: Primary: 11.1.0.7 RAC 2 Nodes Standby: 11.1.0.7 Single Instance
I want to implement Fast-Start Failover at my enviroment. Reading requirements, I found that Oracle Flashback is needed (I don't use Flashback on my databases).
Best Practices recommends 60 min (1 hr) for DB_FLASHBACK_RETENTION_TARGET if I just want to use Flashback to fast-start failover.
My questions:
What 's better, setup up flash recovery area on ASM or O.S. filesystem? What is the recommended size for flash recovery area?
I have a situation where I want to configure primary database (11.2) with 2 remote destinations. dest_2 is the default and points to a standby on host_2. However, I also want the primary db to continue transporting redo to dest_3 on node_3 when node_2 is taken down (planned or unplanned).
1) Configure the ALTERNATE attribute of dest_2 to point to dest_3.
2) Configure tnsnames client-side failover on primary host to point to 2 nodes (node_2 and node_3).
Got Active DataGuard on a primary database, quite nicely sending its archive logs to its secondary. I can quite happily use the Broker and switchover between them.
Now if I take RMAN backups of the primary database, if I have to failover to the secondary, I'm gonna loose all those backups.
Well, I can restore the whole database to the backup, cos I can restore the control file, from the backuip and therefore I can restore the whole db.
But if I want to restore to a tablespace, I wont be able to , cos the db_unique_names names are different, and the DB ID's will be different.
Same goes if I use a recovery catalog....
so how do I failover/switchover without loosing my previous rman backups ??
if it is possible to create a failover setup without RAC and DG...For example:
I have 11.2.0.2 database (with EBS 12.1.3) on dbnode1...I would like to create another node to failover my primary datbas in case of any failure.
Steps i will follow: 1: creat dbnode2 2: install same os as dbnode1 3: install same oracle as dbnode1 4: share dbnode2 database between dbnode1 and dbnode2
Now, if hardware fails on dbnode1, can i manually failover and start my database on dbnode2?
I know we can do with RAC and DG,BUT without RAC and DG If not possible..
What is expected behavior for VIP on Linux servers? ( 2-node RAC)
1.When entire RAC stack is running, and one node is rebooted or crashed? Will VIP fail over to second node? 2.When one node RAC stack is contained (crsctl stop crs), shutting down RAC stack on remaining node? What happens to the VIP and Scan IP? 3.When RAC stacks are down on both nodes, one node has VIP on (eth0:1), reboot this node. What happens to the VIP?
We see some behavior in scenario 3, but don't know why the VIP was moved to second node as all Oracle processes are down.
configure correctly my application, Oracle is version 11R1
My physical configuration is :
Site 1 (main building) database on cluster : server1, server2 database : BASE1 available
Site2 (Backup Site) database on cluster : server3, server4 database : DGBASE1 not available for users (dataguard)
I want to acces BASE1 first in load balancing server1,2 (normal usage) and DGBASE1 in load balancing server3,4 in case of failover after 5 retries of 10 secondes (in case of main buiding out of order, DGBASE1 will be available for users)
My trouble is that with this configuration, access it randomly BASE1 or DGBASE1 but DGBASE1 is not available for users then I have failure
What could be the right configuration to obtain BASE1 first in load balancing (normal usage) and DGBASE1 in load balancing in case of failover of the BASE1 after 5 retries of 10 secondes ?
I can see that some failover events etc aren't supported (Differences between the ODP.NET Managed Driver and Unmanaged Driver), but is failover supported at all? I don't need to get notified, just to get the failover (clustered node switching) working. Is it supported in the managed driver?
I have installed Oracle RAC 10g on Redhat Linux 4.0. Till yesterday failover was happening that is when i stopped one instance on node01 the vip of node01 was transferred to node02.This was shown using ifconfig -a but now that is now happening.
Below information is given:
[oracle@node01 ~]$ crs_stat -t Name Type Target State Host ------------------------------------------------------------ ora.hitesh.db application ONLINE ONLINE node02 ora....h1.inst application ONLINE ONLINE node01
I am following lab instruction from my Oracle Admin Workshop book. The exercise is to create a 2nd listener to support connect-time failover. I have two listeners up and running. I connected the second listener using Static Database Registration in OEM, and then I started it.
When I looked in the listener.ora file, I was expecting to find the entries for FAILOVER=TRUE but it is not there. Both listeners are designed to connect to the same instance 'orcl'.
My questions:
1) Is the FAILOVER entry supposed to be done automatically when the second listener is created ? or is it always a manual entry ? 2) How can I test that it is working ?
I have this box that runs a dataguard observer in a test lab.
Is there a way to configure connect time failover and connection timeout in the connect identifier used by DGMGRL? It does not seem to respect the parameters I supply.
This is what I used for TNSNAMES connection string:
DB= (DESCRIPTION = (ADDRESS_LIST = (FAILOVER=ON)
[Code].....
When the hosts are up, if the instance is not running, I really get a quick reply from DGMGRL. However, when some host is not available, it can take quite a while to get an answer, minutes..
= I though I could use (TYPE=select) (METHOD=basic) (RETRIES=5) #(DELAY=1) # defaults to 1 when retries is set
in the connect_data section, but this does not seem to have an effect = I also tried ' (CONNECT_TIMEOUT=5)' in the ADDRESS_LIST section =
So assuming that Oracle NET services are not available on the hosts. What can I use to make 'dgmgrl' report that it can't connect to the host quickly ? Should I look into the TCP timeout parameters ?
I want to apply patch for the 3 node rac database. I am going to use rolling upgrade, one node at a time. For this i need to stop all the instances or any processes running out of $ORACLE_HOME. I will use srvctl stop home command. My concern is:
1) with the srvctl stop home command how the instances shutdowns, either shutdown immediate or shutdown abort. 2) if applications are connected to the shutdown node, will the sessions fail over to surviving nodes? 3) If it is shutdown abort, how session failover?how about long running queries ? will they also fail-over?
Oracle version 11.2.0.2 on RHEL 5.6 , planning to apply Db PSU 11.2.0.2.7
I am trying to design a database consolidation high-availability cluster for Oracle 11g R2 64-bit Enterprise Edition (X86-64) on Oracle Linux 6.x UEK, using Oracle 11.2.0.3 (the latest as of Aug 2012).
We don't need RAC Multi-Node now or in foreseeable future because none of the databases we run break the capacity of a single node. Likewise, we don't need to use Oracle VM to virtualise the database instances.
We plan to use SGA and PGA memory management to run multiple instances on same hardware operating on a single Linux 64-bit O/S image.Does it sound ok so far?
Two or three of 4-socket, 40-core Intel 64-bit servers with 512GB of RAM each (relatively cheap at today's HW commodity prices) will be sufficient to run all Oracle databases we have on Linux 64-bit.So the two HA options that I know of are:
(1) use Oracle Clusterware/Grid/ASM to provide for instance failover (2) use Oracle RAC One Node on top of Clusterware/Grid/ASM
As I understand it RAC One Node is significantly more expensive than the "free" Oracle Clusterware/ASM/Grid (since we own Oracle 11.2.0.3 Enterprise Licences already). So why should my employer pay for RAC One Node licence given they already own Single Instance Fail-Over and Restart protection from Clusterware/Grid/ASM ?
I also read that Data Guard 11.2 may not be supported with RAC One Node on 11.2? True? Will same Data Guard 11.2 work with a Single-Instance Failover running on Clusterware/Grid/ASM ?
-Who is running RAC One Node? Why? -Who is running Single Instance Failover with Clusterware? Why? -Who is using Data Guard with either of the above?
I have dataguard configuration operating in maximum availability mode with a local standby db (A - lgwr sync not using real time apply) and a remote standby db (B - lgwr async). I then simualted a crash of my primary database with batch jobs running. Since the stby db A is in lgwr sync option ,all the commited data in the current online redo log has been transmitted to stby A and is present in its stby redo log (Group 2).How do I apply this stby redo log to the remote stby db.
Tried the following methods.
1.ftp the stby redo log to the remote db and tried to regiter it, got an error that it is not completely archived.
2.issued the recover standby database command and supplied the stby redo log when it asked for the sequence in the stby redo, got an error saying there is corruption in a block(tried this option multiple times ended up with the same result.)
1.create a primary database 2.duplicate a physical standby database; 3.turn on flashback on both databases. 4.record SCN xxx on physical standby database. 5.convert physical standby to logical standby (using keep identity statement) 6.flashback to logical standby to xxx 7.convert logical standby to physical standby 8.using real time apply I got errors: Fast Parallel Media Recovery enabledManaged Standby Recovery starting Real Time ApplyMRP0:
Background Media Recovery waiting for new incarnation during transient logical upgrade procedure
Errors in file /home ora/ app/ oracle/ diag/ rdbms/ ora11gr1dg/ora11gr1dg/trace/ora11gr1dg_mrp0_10120.trc:ORA-19906: recovery target incarnation changed during recoveryManaged Standby Recovery not using Real Time ApplyErrors in file /home/ ora/app/ oracle/diag/ rdbms/ ora11gr1dg/ ora11gr1dg/ trace/ora11gr 1dg_mrp0_ 10120.trc:ORA-19906: recovery target incarnation changed during recovery
Errors appears every 10 seconds. Seems MPR0 is waiting for new incarnation for a long time. So am I.Standby database incarnation:
List of Database IncarnationsDB Key Inc Key DB Name DB ID STATUS Reset SCN Reset Time-------1 1 ORA11GR1 3853851354 CURRENT 1 08/09/2013 01:02:182 2 ORA11GR1 3853851354 ORPHAN 2127877 08/28/2013 19:22:01 BGV
GROUP# THREAD# SEQUENCE# sizeMB ARC STATUS T ---------- ---------- ---------- ---------- --- ---------------- - 1 1 115 512 YES CLEARING O 2 1 116 512 YES CURRENT O 6 1 114 512 YES CLEARING O 4 1 112 512 YES CLEARING O 5 1 113 512 YES CLEARING O 3 1 111 512 YES CLEARING O
6 rows selected. [code]...
Above result is from standby database.I am trying to understand the concept of standby redo log.My stadby database protection_mode is MAXIMUM PERFORMANCE.
working on setting up connection between a Windows 2008 server and a pair of Oracle 11g DBs in a RAC Cluster. One Database (let's say DatabaseA) is in one data center, and the other (DatabaseB) is an a secondary, backup database. The RAC Cluster is all set up, working fine, etc. However, I Need to set up the machine.config file on my Windows Server, to go only connect to DatabaseA, unless it fails, in which case, we want it to connect to DatabaseB. Think we could do this if the host app server was Linux/Unix, but it is windows, and I just don't have the background as to the parameters to set up in the machine.config file. They are similar, but different, and we want a very specific behavior (use DatabaseA, unless fails, then DatabaseB). Application is .NET 4.0 app.
i have configured physical standby in my local system, to check logshipping i created a table at primary db, wen i tried to check in standby, it says table does not exist..below are primary & standby alert entries..
Primary alert log
Fatal NI connect error 12514, connecting to: (DESCRIPTION=(ADDRESS=(PROTOCOL=TCP)(HOST=172.16.0.98)(PORT=1522))(CONNECT_DATA=(SERVER=DEDICATED)(SERVICE_NAME=STAND)(SERVER=dedicat ed)(CID=(PROGRAM=d:oracle11gappadministratorproduct11.1.0db_1inORACLE.EXE)(HOST=A960M)(USER=SYSTEM))(SERVER=dedicated))) VERSION INFORMATION: TNS for 64-bit Windows: Version 11.1.0.6.0 - Production
1) scn differs wrt primary in standby (i checked, 1day difference), how to make scn same?
2)i created a table in primary, its not refelecting in standby, (below i ve pasted alertlog entries)
ORA-27041: unable to open file OSD-04002: unable to open file O/S-Error: (OS 2) The system cannot find the file specified. Errors in file d:oracle11gappadministratordiag dbmsstandstand racestand_dbw0_6916.trc: ORA-01157: cannot identify/lock data file 2 - see DBWR trace file ORA-01110: data file 2: 'D:ORACLE11GAPPADMINISTRATORORADATASTANDSYSAUX01.DBF'
[code]....
3)wen i try to open standby database in read only mode gives below error..
ERROR at line 1: ORA-16004: backup database requires recovery ORA-01157: cannot identify/lock data file 1 - see DBWR trace file ORA-01110: data file 1: 'D:ORACLE11GAPPADMINISTRATORORADATASTANDSYSTEM01.DBF'
I have to implement Physical standby using same SID. parameters required to set on Primary and standby. Also what entries are required to do in TNS file. Recently we have faced hardware failure.
i have attended an interview there i got a question like--->There are 100 archives generated in standby in that a series of five archives are missing and those are not available in the primary then how will you do.....