A highly available application is one that can be resilient to
a system outage in a clustered environment.
Several levels of application availability are possible:
- If an application error occurs, the application restarts itself
on the same node and corrects any potential cause for error (such as corrupt
control data). You can view the application as though it had started for
the first time.
- The application performs some amount of checkpoint-restart processing.
You can view the application as if it were close to the point of failure.
- If a system outage occurs, the application is restarted on a backup
server. You can view the application as though it had started for the first
time.
- If a system outage occurs, the application is restarted on a backup
server and performs some amount of checkpoint-restart processing across the
servers. You can view the application as if it were close to the point of
failure.
- If a system outage occurs, a coordinated failover of both the application
and its associated data to another node or nodes in the cluster occurs. You can view the application as though it had started for the first
time.
- If a system outage occurs, a coordinated failover of both the application
and its associated data to another node or nodes in the cluster occurs. The application performs some amount of checkpoint-restart processing
across the servers. You can view the application as if it were close to
the point of failure.
Note: In cases 1 through 4 above, you are
responsible for recovering the data.