Skip to content

Metrics

This topic will describe the monitoring metrics in Nebula Dashboard.

Machine

Note

  • All the machine metrics listed below are for the Linux operating system.
  • The default unit for Disk and Network is byte. The unit changes with the data magnitude as the page displays. For example, when the flow is less than 1 KB/s, the unit is Bytes/s.
  • For all versions of Dashboard Enterprise Edition, the memory occupied by Buff and Cache will not be counted in the memory usage.

CPU

Parameter Description
cpu_utilization The percentage of used CPU.
cpu_idle The percentage of idled CPU.
cpu_wait The percentage of CPU waiting for IO operations.
cpu_user The percentage of CPU used by users.
cpu_system The percentage of CPU used by the system.

Memory

Parameter Description
memory_utilization The percentage of used memory.
memory_used The memory space used (not including caches).
memory_free The memory space available.

Load

Parameter Description
load_1m The average load of the system in the last 1 minute.
load_5m The average load of the system in the last 5 minutes.
load_15m The average load of the system in the last 15 minutes.

Disk

Parameter Description
disk_used The disk space used.
disk_free The disk space available.
disk_readbytes The number of bytes that the system reads in the disk per second.
disk_writebytes The number of bytes that the system writes in the disk per second.
disk_readiops The number of read queries that the disk receives per second.
disk_writeiops The number of write queries that the disk receives per second.
inode_utilization The percentage of used inode.

Network

Parameter Description
network_in_rate The number of bytes that the network card receives per second.
network_out_rate The number of bytes that the network card sends out per second.
network_in_errs The number of wrong bytes that the network card receives per second.
network_out_errs The number of wrong bytes that the network card sends out per second.
network_in_packets The number of data packages that the network card receives per second.
network_out_packets The number of data packages that the network card sends out per second.

Service

Period

The period is the time range of counting metrics. It currently supports 5 seconds, 60 seconds, 600 seconds, and 3600 seconds, which respectively represent the last 5 seconds, the last 1 minute, the last 10 minutes, and the last 1 hour.

Metric methods

Parameter Description
rate The average rate of operations per second in a period.
sum The sum of operations in the period.
avg The average latency in the cycle.
P75 The 75th percentile latency.
P95 The 95th percentile latency.
P99 The 99th percentile latency.
P999 The 99.9th percentile latency.

Graph

Parameter Description
num_active_queries The number of queries currently being executed.
num_active_sessions The number of currently active sessions.
num_aggregate_executors The number of executions for the Aggregation operator.
num_auth_failed_sessions_bad_username_password The number of sessions where authentication failed due to incorrect username and password.
num_auth_failed_sessions The number of sessions in which login authentication failed.
num_indexscan_executors The number of executions for index scan operators.
num_killed_queries The number of killed queries.
num_opened_sessions The number of sessions connected to the server.
num_queries The number of queries.
num_query_errors_leader_changes The number of raft leader changes due to query errors.
num_query_errors The number of query errors.
num_reclaimed_expired_sessions The number of expired sessions actively reclaimed by the server.
num_rpc_sent_to_metad_failed The number of failed RPC requests that the Graphd service sends to the Metad service.
num_rpc_sent_to_metad The number of RPC requests that the Graphd service sent to the Metad service.
num_rpc_sent_to_storaged_failed The number of failed RPC requests that the Graphd service sent to the Storaged service.
num_rpc_sent_to_storaged The number of RPC requests that the Graphd service sent to the Storaged service.
num_sentences The number of statements received by the Graphd service.
num_slow_queries The number of slow queries.
num_sort_executors The number of executions for the Sort operator.
optimizer_latency_us The latency of executing optimizer statements.
query_latency_us The average latency of queries.
slow_query_latency_us The average latency of slow queries.

Meta

Parameter Description
commit_log_latency_us The latency of committing logs in Raft.
commit_snapshot_latency_us The latency of committing snapshots in Raft.
heartbeat_latency_us The latency of heartbeats.
num_heartbeats The number of heartbeats.
num_raft_votes The number of votes in Raft.
transfer_leader_latency_us The latency of transferring the raft leader.

Storage

Parameter Description
add_edges_latency_us The average latency of adding edges.
add_vertices_latency_us The average latency of adding vertices.
commit_log_latency_us The latency of committing logs in Raft.
commit_snapshot_latency_us The latency of committing snapshots in Raft.
delete_edges_latency_us The average latency of deleting edges.
delete_vertices_latency_us The average latency of deleting vertices.
get_neighbors_latency_us The average latency of querying neighbor vertices.
num_edges_deleted The number of deleted edges.
num_edges_inserted The number of inserted edges.
num_raft_votes The number of votes in Raft.
num_rpc_sent_to_metad_failed The number of failed RPC requests that the Storaged service sent to the Metad service.
num_rpc_sent_to_metad The number of RPC requests that the Storage service sent to the Meta service.
num_tags_deleted The number of deleted tags.
num_vertices_deleted The number of deleted vertices.
num_vertices_inserted The number of inserted vertices.
transfer_leader_latency_us The latency of transferring the raft leader.

Graph space

Parameter Description
num_active_queries The number of queries currently being executed.
num_aggregate_executors The number of executions for the Aggregation operator.
num_indexscan_executors The number of executions for index scan operators.
num_killed_queries The number of killed queries.
num_queries The number of queries.
num_query_errors_leader_changes The number of raft leader changes due to query errors.
num_query_errors The number of query errors.
num_sentences The number of statements received by the Graphd service.
num_slow_queries The number of slow queries.
num_sort_executors The number of executions for the Sort operator.
optimizer_latency_us The latency of executing optimizer statements.
query_latency_us The average latency of queries.

Last update: October 18, 2022
Back to top