Question 1

Why does Prometheus use so much RAM?

Accepted Answer

Prometheus keeps all active time series in an in-memory TSDB head block. Each series uses ~2.5 KB for the index, chunk references, and WAL buffer. 100,000 series = ~250 MB just for the head. Add query cache, chunk cache, and WAL replay buffer and you can easily hit 2–4 GB on a busy cluster.

Question 2

How do I find my actual series count?

Accepted Answer

Run: `curl -s http://localhost:9090/metrics | grep prometheus_tsdb_head_series`. Or in Grafana: query `prometheus_tsdb_head_series` as a metric. kube-prometheus-stack exposes this by default.

Question 3

What is WAL and how much disk does it use?

Accepted Answer

The Write-Ahead Log (WAL) buffers the last ~2 hours of samples before compacting them into TSDB blocks. WAL size ≈ 2h of ingestion. At 10,000 series / 15s scrape = 667 samples/sec × 2 bytes × 7,200 sec ≈ 10 MB. WAL is on the same PVC as TSDB data.

Question 4

Should I use Victoria Metrics instead of Prometheus?

Accepted Answer

For clusters with >500,000 series or retention >90 days, Victoria Metrics is 3–5× more memory-efficient and has better query performance. For standard clusters (10K–200K series), Prometheus with the kube-prometheus-stack is simpler to operate.

Prometheus Storage Calculator

How Prometheus Storage Works

Memory Model

Disk Model

Retention vs Remote Storage

Reducing Series Count

Key Terms

Frequently Asked Questions

Why does Prometheus use so much RAM?

How do I find my actual series count?

What is WAL and how much disk does it use?

Should I use Victoria Metrics instead of Prometheus?

Related Tools

Related Generators

Related Comparisons

Related Guides

Kubernetes Monitoring Stack Guide: Prometheus, Loki, Grafana, and Tempo