TOO_MANY_ACTIVE
Advisory
Subject Name Syntax
_RV.WARN.RVFT.TOO_MANY_ACTIVE.group
Purpose
A fault tolerance member presents this warning advisory message when it detects that too many group members are active. This situation is usually transient, and resolves itself quickly without intervention. However, if the situation persists, it might indicate problems that require attention.
Remarks
Rendezvous fault tolerance software detects the situation when it receives a heartbeat message from a member that was not already known to be active.
This warning indicates that the following conditions all hold simultaneously:
| • | The number of members broadcasting heartbeat messages is greater than the active goal parameter. |
| • | This member is active. |
| • | This member will not deactivate (that is, its rank indicates it should remain active). |
The conclusion is that one or more other members are active that should not be active. In most cases those members quickly detect the anomaly, and deactivate. Normally the situation resolves itself within one activation interval.
Notice that a member does not receive this advisory if it is either inactive or about to deactivate.
Diagnosis
This warning can indicate any of several situations:
| • | A network separates into two or more disconnected parts, and then reconnects. |
Rendezvous fault tolerance software arranges for the correct number of active members on each disconnected part of the network. When the parts reconnect, the active members with the lowest rank become extraneous, and quickly deactivate. This warning indicates that a network problem occurred.
| • | Members have different active goal parameters. |
If member A has an active goal of one member, and B has an active goal of two members, then A and B will both become active, and A receives this complaint that too many members are active. (Both A and B also receive the PARAM_MISMATCH error advisory, with Active goal differs in the RVADV_DESC field.)
| • | Interval parameters to Rendezvous fault tolerance software are too short compared to the speed of the hardware clock and operating system services. |
See Step 4: Choose the Intervals.
| • | The active member did not send timely heartbeat messages. For example, a callback function blocked, or did not return promptly, delaying the heartbeat messages. |
See Ensure Timely Event Processing.
Message Fields
|
Field Name |
Description |
|
|
A string describing the specific situation of this advisory. This field has datatype |