You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Describe the bug
When the loki-backend instance which runs compactor shuts down and comes back up again we see the failure count increase in the oki_boltdb_shipper_compact_tables_operation_total metric.
To Reproduce
Steps to reproduce the behavior:
Started Loki (3.3.2) in scalable mode
Restart the pod running compactor
Observe delta(loki_boltdb_shipper_compact_tables_operation_total{status="failure"}[5m]) via prometheus (or query the metric endpoint)
Expected behavior
When the loki-backend instance which runs compactor restarts we expect a graceful failover of the compactor. We expect loki_boltdb_shipper_compact_tables_operation_total to not count anyfailure in that case.
Environment:
Infrastructure: Kubernetes 1.30
Deployment tool: helm
Screenshots, Promtail config, or terminal output
Log lines when this happens:
loki-backend-2 - info: finished compacting table
loki-backend-2 - info: compacting table
loki-backend-2 - info: finished compacting table
loki-backend-1 - info: this instance has been chosen to run the compactor, starting compactor
loki-backend-1 - info: waiting 10m0s for ring to stay stable and previous compactions to finish before starting compactor
loki-backend-2 - info: compactor exiting
loki-backend-2 - info: waiting until compactor is JOINING in the ring
loki-backend-2 - info: compactor is JOINING in the ring
loki-backend-2 - info: waiting until compactor is ACTIVE in the ring
loki-backend-2 - info: compactor is ACTIVE in the ring
loki-backend-1 - info: this instance should no longer run the compactor, stopping compactor
loki-backend-1 - info: compactor stopped
loki-backend-1 - error: failed to run compaction - failed to list tables: RequestCanceled: request context canceled
caused by: context canceled
loki-backend-2 - info: this instance has been chosen to run the compactor, starting compactor
loki-backend-1 - info: compactor started
loki-backend-2 - info: waiting 10m0s for ring to stay stable and previous compactions to finish before starting compactor
The text was updated successfully, but these errors were encountered:
Describe the bug
When the loki-backend instance which runs compactor shuts down and comes back up again we see the
failure
count increase in theoki_boltdb_shipper_compact_tables_operation_total
metric.To Reproduce
Steps to reproduce the behavior:
delta(loki_boltdb_shipper_compact_tables_operation_total{status="failure"}[5m])
via prometheus (or query the metric endpoint)Expected behavior
When the loki-backend instance which runs compactor restarts we expect a graceful failover of the compactor. We expect
loki_boltdb_shipper_compact_tables_operation_total
to not count anyfailure
in that case.Environment:
Screenshots, Promtail config, or terminal output
Log lines when this happens:
The text was updated successfully, but these errors were encountered: