Federated Clusters

Federated Clusters displays present high-level and detailed cache performance metrics for the cluster. Performance statistics are derived from the cluster Destination and Origin MBeans. Destination information shows how efficiently each node in the local cluster participant is sending data to each destination cluster participant. Origin information shows how efficiently each node in the local cluster participant is receiving data from destination cluster participants.

Use these displays to quickly assess total utilization and throughput metrics for all caches in the cluster.

Federated Destination Detail

Table shows performance and utilization data, such as bandwidth usage and bytes sent, for Federated Destinations on the selected cluster. Use this display to do high level utilization analysis. Each row is a different Destination MBean. Click a row to see details in the Federated Destination Summary display. Sort data by the highest and lowest values of interest by clicking on the column heading.

ocm_fed_desDetail.gif

title_bar_shortNew00104.gif

 

Filter By:

 

Cluster:

Select a cluster from the drop-down menu.

 

Host:

Select a host from the drop-down menu.

Federated Destination Detail by Node

 

Location

A unique identifier for each node. It is defined as: member_name.machine.rack.site.

 

BytesSentSecs

The number of bytes sent per second.

 

ConnectRetryTimeoutMillis

The configured connect retry timeout.

 

Connection

The name of the JMX connection used to access the cluster data.

 

CurrentBandwidth

The current amount of bandwidth being used, in megabits per second, for sending replicate message.

 

DeltaReplicateAllTotalTime

The difference in the total amount of time the replicateAll request took since the last data sample.

 

DeltaTIME_STAMP

The amount of time since the last data sample.

 

DeltaTotalBytesSent

The difference in the total number of bytes sent since the last data sample.

 

DeltaTotalEntriesSent

The difference in the total number of entries sent since the last data sample.

 

DeltaTotalErrorResponses

The difference in the total number of error responses since the last data sample.

 

DeltaTotalMsgSent

The difference in the total number of messages sent since the last data sample.

 

DeltaTotalMsgUnacked

The difference in the total number of unacknowledged messages since the last data sample.

 

DeltaTotalRecordsSent

The difference in the total number of records sent since the last data sample.

 

ErrorDescription

A description of the error. A value exists only if the sender is in an error state.

 

EstimatedReplicateAllRemainingTime

The estimated remaining time, in milliseconds, to complete the replicateAll request.

 

Expired

When checked, this connection is expired due to inactivity.

 

GeoIp

The Geo-IP metadata

 

HostName

The name of the host.

 

MaxBandwidth

The maximum amount of bandwidth per second, in megabits, for sending replicate message, where -1.0 means the maximum bandwidth is not specified.

 

Member

The member information of the destination node.

 

MemberName

The name of the member.

 

MsgApplyTimePercentileMillis

The 90-percentile value, in milliseconds, of the time taken to apply the replication messages on the destination.

 

MsgNetworkRoundTripTimePercentileMillis

The 90-percentile value, in milliseconds, of the time taken by transmission of replication messages and the corresponding ack messages over the network.

 

MsgSentSecs

The number of messages sent per second.

 

Name

The sender name.

 

ParticipantType

The participant type. Valid types are cluster and interceptor.

 

RateReplicateAllTotalTime

The number of replicateAll requests per second.

 

RateTotalBytesSent

The total number of bytes sent per second.

 

RateTotalEntriesSent

The total number of entries sent per second.

 

RateTotalErrorResponses

The total number of error responses per second.

 

RateTotalMsgSent

The total number of messages sent per second.

 

RateTotalMsgUnacked

The total number of unacknowledged messages per second.

 

RateTotalRecordsSent

The total number of records sent per second.

 

RecordBacklogDelayTimePercentileMillis

The 90-percentile value , in milliseconds, of the time the journal records are in the cache waiting to be replicated.

 

ReplicateAllPercentComplete

The percent of work completed for a replicateAll request.

 

ReplicateAllTotalTime

The total amount of time the replicateAll request took, in milliseconds.

 

SendTimeoutMillis

The configured send timeout.

 

State

The participant state, where:

0 is Ok

1 is Warning

2 is Error

 

Status

The participant status.

 

TIME_STAMP

The date and time of the data update.

 

TotalBytesSent

The total number of bytes sent.

 

TotalEntriesSent

The total number of cache entries sent.

 

TotalErrorResponses

The total number of responses with an error.

 

TotalMsgSent

The total number of replication messages sent. A replication message might contain multiple journal records

 

TotalMsgUnacked

The total number of unacknowledged replication messages.

 

TotalRecordsSent

The total number of journal records sent. A journal record might consist of multiple cache entries that are part of the same transaction.

 

name

The destination cluster name.

 

nodeid

The unique identifier for the node.

 

service

The Federated Service name.

 

subType

The Federated Service sub-type.

 

type

The Coherence MBean type (Federation, in this case).

Federated Destination Summary

Detailed performance and utilization data, such as bandwidth usage and bytes sent per second, for a Federated Destinations location. Use this display to do low level utilization analysis. Check the metrics for to determine whether more capacity is needed.

ocm_fed_desSumm.gif

title_bar_shortNew00105.gif

 

Filter By:

 

Cluster:

Select a cluster from the drop-down menu.

 

Host:

Select a host from the drop-down menu.

 

Location:

Select a location from the drop-down menu. Location is a unique identifier for each node and defined as: member_name.machine.rack.site.

 

Id:

The unique identifier for the node.

 

Participant Type

The participant type. Valid types are cluster and interceptor.

 

State

The participant state, where:

0 is Ok

1 is Warning

2 is Error

 

Bytes Sent Secs

The number of bytes sent per second.

 

Connect Retry Timeout (ms)

The configured connect retry timeout.

 

Current Bandwidth

The current amount of bandwidth being used, in megabits per second, for sending replicate message.

 

Estimated Replicate All Remaining Time

The estimated remaining time, in milliseconds, to complete the replicateAll request.

 

Geo IP

The Geo-IP metadata

 

Max Bandwidth

The maximum amount of bandwidth per second, in megabits, for sending replicate message, where -1.0 means the maximum bandwidth is not specified.

 

Status

The participant status.

 

Name

The sender name.

 

Msg Apply Time Percentile (ms)

The 90-percentile value, in milliseconds, of the time taken to apply the replication messages on the destination.

 

Msgs Sent Secs

The number of messages sent per second.

 

Record Backlog Delay Time Percentile (ms)

The 90-percentile value, in milliseconds, of the time the journal records are in the cache waiting to be replicated.

 

Replicate All Percentile Complete

The percent of work completed for a replicateAll request.

 

Replicate All Total Time

The total amount of time the replicateAll request took, in milliseconds.

 

Send Timeout (ms)

The configured send timeout.

 

Error Description

A description of the error. A value exists only if the sender is in an error state.

Trend Graph

Select a location from the drop-down menu to populate the trend graph. Location is a unique identifier for each node and defined as: member_name.machine.rack.site.

 

RateReplicateAllTotalTime: Traces the total number of replicateAll requests per second.

RateTotalBytesSent: Traces the total number of bytes sent per second.

RateTotalEntriesSent: Traces the total number of entries sent per second.

RateTotalErrorResponses: Traces the total number of error responses per second.

RateTotalMsgSent: Traces the total number of messages sent per second.

RateTotalMsgUnacked: Traces the total number of unacknowledged messages per second.

RateTotalRecordsSent: Traces the total number of records sent per second.

ReplicateAllPercentComplete: Traces the percent of completed replicateAll requests.

 

Start Time

The date and time the location was started. Location is a unique identifier for each node and defined as: member_name.machine.rack.site.

 

Base at Zero

Use zero for the Y axis minimum for all graphs.

 

Time Range

Select a time range from the drop down menu varying from 2 Minutes to Last 7 Days, or display All Data. To specify a time range, click Calendar button_calendar00106.gif.

trend_timerange00107.gif

By default, the time range end point is the current time. To change the time range end point, click Calendar button_calendar00108.gif and select a date and time from the calendar or enter the date and time in the text field using the following format: MMM dd, YYYY HH:MM. For example, Aug 21, 2011 12:24 PM.

Use the navigation arrows button_forwardback00109.gif to move forward or backward one time period. NOTE: The time period is determined by your selection from the Time Range drop-down menu.

Click Restore to Now to reset the time range end point to the current time.

 

Federated Origin Detail

Table shows performance and utilization data, such as bandwidth usage and bytes sent, for Federated Origins on the selected cluster. Use this display to do high level utilization analysis. Each row is a different Origin MBean. Click a row to see details in the Federated Origin Summary display. Sort data by the highest and lowest values of interest by clicking on the column heading.

ocm_fed_origDetail.gif

 

title_bar_shortNew00110.gif

 

Filter By:

 

Cluster:

Select a cluster from the drop-down menu.

 

Host:

Select a host from the drop-down menu.

Federated Origin Detail by Node

 

Location

A unique identifier for each node. It is defined as: member_name.machine.rack.site.

 

BytesReceivedSecs

The number of bytes received per second.

 

Connection

The name of the JMX connection used to access the cluster data.

 

DeltaTIME_STAMP

The amount of time since the last data sample.

 

DeltaTotalBytesReceived

The difference in the total number of bytes received since the last data sample.

 

DeltaTotalEntriesReceived

The difference in the total number of entries received since the last data sample.

 

DeltaTotalMsgReceived

The difference in the total number of messages received since the last data sample.

 

DeltaTotalMsgUnacked

The difference in the total number of unacknowledged messages since the last data sample.

 

DeltaTotalRecordsReceived

The difference in the total number of records received since the last data sample.

 

Expired

When checked, this connection is expired due to inactivity.

 

HostName

The name of the host.

 

Member

The member information of the destination node.

 

MemberName

The name of the member.

 

MsgApplyTimePercentileMillis

The 90-percentile value, in milliseconds, of the time taken to apply the replication messages on the origin.

 

MsgReceivedSecs

The number of messages received per second.

 

RateReplicateAllTotalTime

The number of replicateAll requests per second.

 

RateTotalBytesReceived

The total number of bytes received per second.

 

RateTotalEntriesReceived

The total number of entries received per second.

 

RateTotalMsgReceived

The total number of messages received per second.

 

RateTotalMsgUnacked

The total number of unacknowledged messages per second.

 

RateTotalRecordsReceived

The total number of records received per second.

 

RecordBacklogDelayTimePercentileMillis

The 90-percentile value, in milliseconds, of the time the journal records are in the cache waiting to be replicated.

 

TIME_STAMP

The date and time of the data update.

 

TotalBytesReceived

The total number of bytes received.

 

TotalEntriesReceived

The total number of cache entries received.

 

TotalErrorResponses

The total number of responses with an error.

 

TotalMsgReceived

The total number of replication messages received. A replication message might contain multiple journal records

 

TotalMsgUnacked

The total number of unacknowledged unacknowledged messages.

 

TotalRecordsReceived

The total number of journal records received. A journal record might consist of multiple cache entries that are part of the same transaction.

 

name

The destination cluster name.

 

nodeid

The unique identifier for the node.

 

service

The Federated Service name.

 

subType

The Federated Service sub-type.

 

type

The Coherence MBean type (Federation, in this case).

Federated Origin Summary

Detailed performance and utilization data, such as bandwidth usage and received per second, for a Federated Origin location. Use this display to do low level utilization analysis. Check the metrics for to determine whether more capacity is needed.

ocm_fed_origSumm.gif

title_bar_shortNew00111.gif

 

Filter By:

The display might include these filtering options:

 

Cluster:

Select a cluster from the drop-down menu.

 

Host:

Select a host from the drop-down menu.

 

Location:

Select a location from the drop-down menu. Location is a unique identifier for each node and defined as: member_name.machine.rack.site.

 

Bytes Received Secs

The number of bytes received per second.

 

Msg Apply Time Percentile (ms)

The 90-percentile value, in milliseconds, of the time taken to apply the replication messages on the origin.

 

Msgs Received Secs

The number of messages received per second.

 

Record Backlog Delay Time Percentile (ms)

The 90-percentile value, in milliseconds, of the time the journal records are in the cache waiting to be replicated.

 

Total Bytes Received

The total number of bytes received.

 

Total Entries Received

The total number of cache entries received.

 

Total Msg Received

The total number of replication messages received. A replication message might contain multiple journal records.

 

 

Total Msg Unacked

The total number of unacknowledged replication messages.

 

Total Records Received

The total number of journal records received. A journal record might consist of multiple cache entries that are part of the same transaction.

Trend Graph

Select a location from the drop-down menu to populate the trend graph. Location is a unique identifier for each node and defined as: member_name.machine.rack.site.

 

RateReplicateAllTotalTime: Traces the total number of replicateAll requests per second.

RateTotalBytesReceived: Traces the total number of bytes received per second.

RateTotalEntriesReceived: Traces the total number of entries received per second.

RateTotalErrorResponses: Traces the total number of error responses per second.

RateTotalMsgReceived: Traces the total number of messages received per second.

RateTotalMsgUnacked: Traces the total number of unacknowledged messages per second.

RateTotalRecordsReceived: Traces the total number of records received per second.

ReplicateAllPercentComplete: Traces the percent of completed replicateAll requests.

 

Start Time

The start date and time.

 

Base at Zero

Use zero for the Y axis minimum for all graphs.

 

Time Range

Select a time range from the drop down menu varying from 2 Minutes to Last 7 Days, or display All Data. To specify a time range, click Calendar button_calendar00112.gif.

trend_timerange00113.gif

By default, the time range end point is the current time. To change the time range end point, click Calendar button_calendar00114.gif and select a date and time from the calendar or enter the date and time in the text field using the following format: MMM dd, YYYY HH:MM. For example, Aug 21, 2011 12:24 PM.

Use the navigation arrows button_forwardback00115.gif to move forward or backward one time period. NOTE: The time period is determined by your selection from the Time Range drop-down menu.

Click Restore to Now to reset the time range end point to the current time.