Highly Available Runtime Configuration with Primary & Secondary Nodes
Here is an explanation for supporting a highly available configuration of the Thru Node runtime:
Highly Available Configuration:
Primary Node:
The node is assigned unique Node ID
Configure Thru Node Health Check port.
Secondary Node:
Deploy identical node on secondary machine (either via installing it again via the same script as the primary or copying the installation folder from primary to secondary.)
Remember the secondary is required to have same Node ID as primary
Set to passive state initially
Heartbeat Monitoring:
Monitor primary node's heartbeat via:
a. Local HealthCheck Port
b. Cloud control plane
Failover Mechanism:
Detect primary node failure (heartbeat down)
Activate secondary node
Ensure only one node active at a time to avoid race conditions
Best Practices:
Regular testing of failover process
Implement proper logging for troubleshooting