Chat now with support
Chat with Support

Foglight 6.3.0 - Performance Tuning Field Guide

Overview Hardware and Operating System Tuning Management Server Tuning Java Virtual Machine Tuning Backend Database Tuning High Availability (HA) Tuning Agent Tuning Store and Forward Processing Appendix: Performance Health Check Appendix: Analyzing a Support Bundle

Viewing the Settings that Control the Size of Stored Data

The default time window during which the host collects and stores data that will be accepted by the server after the connection is restored is one hour. This setting can be changed to a desired value using a VM option in a configuration file on the server side. The Foglight® Agent Manager keeps track of this time window and discards any messages that are collected and stored on the host outside of this time window. For more information about changing the length of the time window for store and forward processing, see Store and Forward Processing and the Foglight Management Server .

The size of the stored data is controlled with a set of parameters in the file <foglight_agent_manager_home>/state/default/config/fglam.config.xml. In that file, the <config:queue-sizes> element controls the amount of the stored data using the following attributes of its <config:upstream> and <config:downstream> sub-elements:

max-disk-space specifies the amount of disk space in KB that can be used to store messages. Zero ‘0’ means no messages are written to disk, minus one, ‘-1’, indicates no limit. The default is 1024, meaning that the amount of messages written to disk cannot exceed 1024 KB or 1 MB, including upstream and downstream connections.
max-batch-size specifies the number of messages that are sent from the queue to the server when a connection is established. Minus one, ‘-1’, indicates that every available message is included in each batch. The default is 500, meaning that a batch of 500 messages is sent to the server when the connection is restored, including upstream and downstream connections.

For example, the default configuration instructs the Foglight Agent Manager to reserve 1 MB of disk space for storing messages and to limit the number of messages in a batch to 500, for upstream connections:

NOTE: The max-queue-size attribute is used internally and should not be changed.
1
On the monitored host, using a file browser, navigate to the directory <fglam_home>/state/default/config, where fglam_home refers to the location of the Foglight Agent Manager installation.
2
Open the file fglam.config.xml.
The file contains the settings used by the Foglight Agent Manager, such as the monitored host name, the URL and port number for connecting to the Foglight Management Server, and many others. These settings are specified at the installation time. For more information about installing the Foglight Agent Manager, see the Agent Manager Guide. For additional information about a specific element, view the contents of its <config:documentation> sub-element. For more information see the contents of <fglam_home>/state/default/config/fglam.config.xml.
3
In the fglam.config.xml file, locate the <config:queue-sizes> element and view the attribute values of its <config:upstream> and <config:downstream> sub-elements.
<config:queue-sizes>
<config:upstream max-queue-size="-1" max-disk-space="1024" max-batch-
<config:downstream max-queue-size="-1" max-disk-space="1024" max-batch-
</config:queue-sizes>

Examples

This contains scenarios, each illustrating a set of common results typically found in store and forward processing:

Example 1: Reporting on Disk Usage

A Foglight® Management Server monitors a group of hosts, each using the Foglight Agent Manager to communicate with the server. The network between a monitored host and the server becomes unavailable at 4:55 PM, and is restored at 5:00 PM. Prior to the connection disruption, the monitored host’s disk usage was within a normal range.

At 4:55 PM, the connection with the server is lost, and the Agent Manager process continues to collect the data from the host, and to store it on disk.

At 4:57 PM, a high disk usage is reported. The Agent Manager stores this information on disk.

At 5:00 PM, the network connection between the Management Server and the Agent Manager is restored. The Agent Manager forwards the data stored between 4:55 and 5:00 PM to the Management Server. The increase of disk usage is reported at 5:00 PM, when the data arrived, not at 4:57, when it actually occurred.

Example 2: Calculating Service Availability Metrics

The SLA is calculated every minute for the current minute, and is stored as a time-driven derived metric. When the Foglight® Management Server accepts forwarded data, the SLA is not re-calculated using the forwarded data.

For example, the SLA calculations take into consideration the expected average response time. The Foglight Management Server monitors a host that becomes disconnected for three hours due to a network outage. The host uses the Foglight Agent Manager to communicate with the server. At the time the connection is lost, the Foglight Agent Manager continues to collect the data from the host, storing the collected data on disk, including the collected data, which now contains the average response metrics.

The Foglight Management Server is configured to accept data that is older than one hour, with the store-and-forward time widow set to eight hours. The server’s configuration file, <foglight_home>/config/server.config, was edited prior to the server startup, and the option foglight.data_service.max_past_timestamp_delta is set to 28,800,000 milliseconds (which equals to eight hours) as follows:

The Agent Manager is aware of this configuration and continues to collect and store the data past the one-hour default, without deleting any new messages that come in.

During the connection disruption, the host’s average response metrics change over time. After three hours, the connection between the host and the server is restored, and the Foglight Agent Manager forwards the stored metrics to the server, including the average response metrics. The Foglight Management Server persists the forwarded data into the repository. The Foglight Management Server does not make use of the forwarded data to re-calculate the SLA.

 

Related Documents

The document was helpful.

Select Rating

I easily found the information I needed.

Select Rating