Metrics dictionary

This dictionary defines metrics created by and specific to Chronosphere. These metrics are often included in default dashboards, but you can search for them anywhere you use metrics. The Chronosphere curated metrics can help track basic important information about your Chronosphere service.

The Chronosphere Health Check dashboard includes links to the Collectors, Usage Dashboard, and Licensing Information dashboards.

Query these metrics as their respective Prometheus type.

Licensing information

The Licensing Information dashboard includes the following metrics. Use this dashboard to monitor quotas for your environment.

Metric nameMetric typeDescriptionTags provided during dashboard creation
limit_service_cardinality_countCounterCurrent cardinality count across all Collectors.chronosphere_service
limit_service_licensed_cardinality_limitCounterCurrent cardinality limit across all Collectors.chronosphere_service
limit_service_licensed_persist_limitCounterCurrent limit for data points persisted in the database across all
Collectors, as defined in the contract.
chronosphere_service
limit_service_capacity_limitCounterCurrent capacity limit for data points persisted in the database across all Collectors, based on grant by Chronosphere.chronosphere_service
limit_service_licensed_processing_limitCounterCurrent limit for processed data points across all Collectors.chronosphere_service
limit_service_persisted_countCounterTotal number of data points persisted in database.chronosphere_service
limit_service_processed_countCounterCurrent count of processed data points across all Collectors.chronosphere_service

Collectors

The Collectors dashboard includes the following metrics that Collectors generate. Use this dashboard to monitor the health of your Collectors.

Metric nameMetric typeDescriptionTags provided during dashboard creation
chronocollector_build_informationGaugeMetrics relating to current build of Collectors.branch
build_date
build_version
chronosphere_k8s_cluster
chronosphere_k8s_container_port
chronosphere_k8s_namespace
cluster
go_version
hostname
instance
job
k8s_cluster_id
pod_name
namespace
region
revision
service
chronocollector_gateway_push_errorsCounterCurrent total number of push errors from Collector.chronosphere_k8s_cluster
chronosphere_k8s_container_port
chronosphere_k8s_namespace
component
environment
hostname
instance
job
k8s_cluster_id
namespace
region
service
chronocollector_gateway_push_latencySummaryLatency of pushed writes by Collector.chronosphere_k8s_cluster
chronosphere_k8s_container_port
chronosphere_k8s_namespace
component
environment
instance
job
k8s_cluster_id
namespace
quantile
region
service
chronocollector_gateway_push_successCounterTotal number of metrics successfully pushed to
the Chronosphere gateway.
annotationsPrefix
cluster
component
env
environment
instance
job
node
region
service
service_account
team
version
chronocollector_gateway_write_successCounterTotal number of metrics successfully written to
the Chronosphere gateway.
annotationsPrefix
cluster
component
env
environment
instance
job
node
region
service
service_account
team
version
chronocollector_k8s_gatherer_processor_targets_activeGaugeCurrent number of active targets Collector is scraping.environment
instance
job
k8s_cluster_id
namespace
region
service
process_cpu_seconds_totalCounterCurrent total number of seconds of CPU processing time.environment
instance
job
k8s_cluster_id
namespace
node
region
service

Query overview

The Chronosphere Query Overview dashboard includes the following metrics. Use this dashboard to identify resource-intensive alert or recording groups.

Metric nameMetric typeDescriptionTags provided during dashboard creation
permits_quotaCounterAmount of resources used associated to querying time series.chronosphere_k8s_namespace
endpoint
instance
job
permit
pod_name
source
permits_throttledCounterAmount of throttling applied to queries.chronosphere_k8s_namespace
endpoint
instance
job
permit
pod_name
source
permits_wait_totalCounterAmount of time spent waiting to access querying resources.chronosphere_k8s_cluster
chronosphere_k8s_namespace
endpoint
instance
job
permit
pod_name
source
prometheus_rule_group_last_duration_secondsHistogramThe total time the group took to complete its last iteration, in seconds.chronosphere_k8s_cluster
chronosphere_k8s_namespace
instance
job
pod_name
rule_group

Policy statistics

The following usage metrics apply to policy statistics.

Metric nameMetric typeDescriptionTags provided during dashboard creation
chrono_policies_countCounterTracks actions for ingestion policies, grouped by the name of the policy.dropped
policy_name
type
chrono_policies_totalCounterTracks actions for ingestion policies with any naming policy.dropped
policy_name
type

Shaping usage statistics

The following usage metrics apply to shaping statistics.

Metric nameMetric typeDescriptionTags provided during dashboard creation
chrono_poolstats_countCounterShaping statistics that include pool information.drop_reason
dropped
type
chrono_poolstats_totalCounterTotal shaping statistics without any tag information.drop_reason
dropped
type
chrono_poolstats_samplingCounterEmitted only when the number of unique usage statistics values
exceeds the configured maximum allowed tags.
node
type

Usage statistics

The Usage Dashboard includes the following usage statistics metrics. Use this dashboard to identify who is contributing most to your Chronosphere usage and manage your overall usage.

Metric nameMetric typeDescriptionTags provided during dashboard creation
chrono_usagestats_countCounterUsage statistics grouped by tags.drop_reason
dropped
type
chrono_usagestats_totalCounterTotal usage statistics without any grouping.drop_reason
dropped
type
chrono_usagestats_count_samplingCounterEmitted only when the number of unique usage statistics values
exceeds the configured maximum allowed tags.
node
type

Other usage statistics count label and metric name usage.

Metric nameMetric typeDescription
chrono_datapoints_by_metric_per_secondGaugeContains the metric_name label. Emits the average data points per second over the last two minutes by metric name.
chrono_datapoints_by_label_per_secondGaugeContains the label_name label. Emits the average data points per second over the last two minutes by label name.
chrono_unique_label_values_countGaugeContains the label_name label. Emits the unique values seen over the last two minutes, by label name.