var/log/metrics/metrics.log
file, then indexes them asynchronously in the system_monitor
system collection:
fusion.properties
file:
default.collectMetrics = false
default.collectMetricsIntervalSecs = 30
log-shipper.solrZk.connect
and log-shipper.metricsSolrCollection
properties.delete-old-system-metrics
Fusion task job, available in all apps.
id
Unique autogenerated document identifier.
node_s
Unique identifier of a Fusion node / server (autogenerated).
timestamp_tdt
Timestamp of a metric.
view_s
Type of a metric, either “host” or “service_instance”.
type_s
Store type of a metric, either “latest” or “history”.
cpu_load_d
Normalized CPU load, such as a floating number value in the range of [0.0,1.0]
.
cpu_sys_d
, cpu_user_d
, cpu_wait_d
, cpu_combined_d
and cpu_idle_d
Break down of CPU load per type. Those are also floating number values in the range of [0.0,1.0]
.
load_average_d
System load average for the last minute, not normalized.
processors_l
Number of CPU cores according to JVM.
memory_total_l
and memory_free_l
Total and free amounts of physical memory in bytes.
swap_total_l
and swap_free_l
Total and free swap in bytes.
disk_total_l
and disk_free_l
Disk sizes of a partition where Fusion is installed (where var
and data
folders reside).
host_uptime_l
Total uptime of a host operating system (in milliseconds).
agent_uptime_l
Uptime of Fusion agent service (in milliseconds).
os_name_s
, os_arch_s
and os_version_s
OS details according to JVM.
addresses_ss
List of IP addresses according to network configuration.
hostname_s
Main hostname or IP address of a server.
service_s
The name of the service (that is, api
, solr
, and so on) to which this metric pertains.
status_s
Status of the service according to Agent (that is, RUNNING
, STARTING
, and so on).
pid_i
Process ID.
address_s
IP address or hostname that is configured for this service to run on (or the default).
java_process_cpu_load_d
Normalized CPU load used by this service.
java_heap_max_l
, java_heap_used_l
and java_non_heap_used_l
JVM memory metrics.
java_open_file_descriptors_i
Number of open files according to JVM.
java_loaded_classes_i
and java_unloaded_classes_i
JVM class loading metrics, useful for spotting problems with dynamic redeployment of Web applications.
java_threads_i
Total JVM threads.
gc_collection_count_l
and gc_collection_time_l
GC metrics like number of invocations and total time spent.
jetty_request_time_mean_f
Mean request time according to Jetty.
jetty_threads_i
Number of Jetty threads
jetty_responses_5xx_l
, jetty_responses_4xx_l
, and so on
Number of responses per status.
solr_index_size_l
Total Solr index size in bytes hosted on a Solr node.
solr_docs_l
Total number of Solr documents hosted on a Solr node.
solr_requests_l
Total number of Solr requests to all cores on a Solr node.
zk_connections_i
Number of ZooKeeper connections to ZooKeeper node.
zk_znodes_l
Number of ZooKeeper nodes.
zk_watches_i
Number of Zk watches.
zk_ephemerals_i
Number of ephemeral ZooKeeper nodes.
zk_size_l
ZooKeeper size in bytes.
api_query_pipelines_http_one_minute_rate_f
, api_query_pipelines_http_mean_f
, and so on
Query pipeline metrics like rate of query requests to the HTTP endpoint or to Solr and mean response times.
api_index_pipelines_http_one_minute_rate_f
, api_index_pipelines_http_mean_f
, and so on
Index pipeline metrics like rate of index requests to the HTTP endpoint or to Solr and mean response times.
proxy_active_sessions_l
Number of active auth sessions.
proxy_sessions_one_minute_rate_f
Rate of new auth sessions per minute (per node). This metric is captured once per second, then presented as a moving average over the last minute.
system_metrics
. It is empty until you manually enable system_metrics
indexing; see below for instructions.
In version 4.2 and later, Fusion creates the collection when you enable system_metrics
indexing.
com.lucidworks.apollo.resources.serviceName.methodName.weighted.slow.examples
, where serviceName is the name of the service and methodName is the name of a valid method for that service.
This information might be helpful when diagnosing performance issues. Here is an example of the 5 slowest calls to the getCollectionMetrics method of the CollectionResource service:
mem.heap.used
: the current amount of heap memory, in bytes, used by the system.mem.heap.max
: the maximum amount of heap memory, in bytes, that the system could use.mem.heap.usage
: the percentage (0 - 1.0) of available heap memory that the system is currently using (this is equal to mem.heap.used
/ mem.heap.max
).mem.non-heap.used
: the current amount of non-heap memory (also called “off-heap memory”), in bytes, used by the system.mem.non-heap.max
: the maximum amount of non-heap memory, in bytes, that the system could use.mem.non-heap.usage
: the percentage (0 - 1.0) of available non-heap memory that the system is currently using (this is equal to mem.non-heap.used
/ mem.non-heap.max
).mem.total.used
: the current total amount of memory (heap plus non-heap), in bytes, used by the system.mem.total.max
: the maximum amount of total memory (heap plus non-heap), in bytes, that the system could use.mem.heap.used
:
stages.stageType.stageName.process
, where stageType is the type of stage, and stageName is the name of a specific stage.
Here is an example of a request to get the performance metrics for an index pipeline stage named ‘solr-default’ (stages.solr-index.solr-default.process
), which is included with Fusion:
stages.index.stageType.stage.stageName.ok
or stages.query.stageType.stage.stageName.ok
, depending on if the stage is part of an index pipeline or a query pipeline. Here is an example of the mean rates for successful runs of the ‘solr-default’ index pipeline stage (stages.index.solr-index.stage.solr-default.ok
):
stages.index.stageType.ok
(for an index pipeline) or stages.query.stageName.ok
(for a query pipeline). Here is an example, using the solr-index stage type:
com.lucidworks.apollo.resources.serviceName.methodName.weighted.timer
, or for a specific example, com.lucidworks.apollo.resources.CollectionResource.getCollectionMetrics.weighted.timer
:
solr.solrIdentifier.requestType
. The solrIdentifier is the address of the Solr instance, and the requestType can be ‘get-requests’, ‘post-requests’ or ‘put-requests’.
This example shows get-requests to a Solr instance that is found on ‘10.0.1.8’ and port 8983: