Cav - these are exactly my thoughts, too!
...but...
With a DR scenario, the (business) intention is to have something which can be run in exactly the same way from either of 2 locations. So, if the Primary Site fails... I need to find an automated way to switch to the Secondary site. My big concern is how to STOP any associated JAVA Processes from running. I can see problems if I simply STOP the Primary ETL Service and START the Secondary ETL Service, since there are a number of (Extraction, Transformation, Load, Aggregation, etc.) JAVA Processes "in progress" (but I am not sure if I am worrying uneccessarily..?) I think I need to find a way to gracefully exit those JAVA Processes, before starting the Secondary ETL Service - otherwise there may be exact duplicate JAVA Processes attempting the same ETL Tasks on the same ODS - even though there would be only one ETL Service running....
I need to draw up a "process" of switchover, for various scenarios;
If the ETL Service's processes fail and data is written to the emergency file. This could be invoked by an OS Alarm, monitoring the emergnecy file location (SNMP).
If the Site fails (DR). This would be obvious in SCI and could be automated through an Alarm Reaction.
If the ETL Service fails ("HA"). Again, this would be obvious in SCI and automated through an Alarm Reaction.
BUT this is [i]just [/i] the ETL Services - what about all of the spawned ETL JAVA Processes which are running? I mentioned we are on Solaris - and I think I am looking for a way to ensure that the JAVA Processes exit naturally/gracefully, [i]before [/i] starting the Secondary ETL Service... Unless, in some way, the Secondary ETL Service is "clever" and can detect which JAVA Processes are already running and simply waits/loops until the Primary ETL Service JAVA Processes exit naturally - and then takes over the Task automatically...?
I hope I explained that OK..?

Thanks cav!
TT