Recover from OOM¶

OutOfMemoryError (OOM) crashes occur when the JVM exhausts available heap memory. This playbook covers immediate recovery and prevention.

Symptoms¶

Cassandra process terminated unexpectedly
OutOfMemoryError in system.log
Linux OOM killer messages in dmesg
Node shows as DOWN in cluster

Immediate Recovery¶

Step 1: Confirm OOM Was the Cause¶

# Check dmesg for OOM killer
dmesg | grep -i "killed process\|oom"

# Check Cassandra logs
grep -i "outofmemory\|heap space\|gc overhead" /var/log/cassandra/system.log | tail -20

Step 2: Check Node Status¶

# Is process running?
ps aux | grep cassandra

# Service status
systemctl status cassandra

Step 3: Restart Cassandra¶

sudo systemctl start cassandra

# Monitor startup
tail -f /var/log/cassandra/system.log

Step 4: Verify Node Rejoins¶

# From another node
nodetool status

# Wait for UN status

Diagnosis¶

Identify OOM Cause¶

Check logs for what was happening before OOM:

# Look at activity before crash
grep -B 50 "OutOfMemory" /var/log/cassandra/system.log | head -100

# Common patterns:
# - Large partition reads
# - Compaction
# - Repair
# - Batch operations

Check Heap Configuration¶

# Current settings
grep -E "^-Xm|^-XX:.*Heap" /etc/cassandra/jvm.options

Check What Consumed Memory¶

If heap dump exists:

# Location depends on configuration
ls -la /var/lib/cassandra/*.hprof

# Analyze with Eclipse MAT or similar tool

Common OOM Causes and Fixes¶

Cause 1: Large Partition Read¶

Symptom: OOM during read operation

grep -i "large partition" /var/log/cassandra/system.log

Fix: See Large Partition Issues

Concurrent Reads

Concurrent reads are configured via concurrent_reads in cassandra.yaml and require a restart to change.

Cause 2: Compaction¶

Symptom: OOM during compaction

grep -i "compacting" /var/log/cassandra/system.log | tail -20

Fix:

# Reduce compaction memory usage
# In cassandra.yaml
compaction_large_partition_warning_threshold_mb: 100

# Reduce concurrent compactors
nodetool setconcurrentcompactors 1

Cause 3: Repair¶

Symptom: OOM during repair (Merkle tree building)

Fix:

# Reduce repair scope
nodetool repair -pr my_keyspace my_table  # One table at a time

# Reduce repair memory usage
# In cassandra.yaml (4.1+ syntax)
repair_session_space: 256MiB
concurrent_merkle_tree_requests: 2

Cause 4: Batch Operations¶

Symptom: OOM during large batch writes

Fix:

Reduce batch sizes in application
Use unlogged batches for non-atomic operations

Cause 5: Heap Too Small¶

Symptom: Frequent OOMs under normal load

Fix:

# In jvm.options
-Xms8G
-Xmx8G

Cause 6: Heap Too Large¶

Symptom: OOM after long GC pauses

Fix: Counterintuitively, reducing heap can help:

# In jvm.options
-Xms8G
-Xmx8G
# Generally don't exceed 16GB

Prevention Configuration¶

jvm.options¶

# Heap sizing (8GB is often optimal)
-Xms8G
-Xmx8G

# Heap dump on OOM (for analysis)
-XX:+HeapDumpOnOutOfMemoryError
-XX:HeapDumpPath=/var/lib/cassandra/

# Exit on OOM (allows systemd to restart)
-XX:+ExitOnOutOfMemoryError

# Or crash on OOM (creates core dump)
# -XX:+CrashOnOutOfMemoryError

cassandra.yaml¶

# Limit compaction memory
compaction_large_partition_warning_threshold: 100MiB

# Limit repair memory (4.1+ syntax)
repair_session_space: 256MiB
concurrent_merkle_tree_requests: 2

# Limit memtable size (4.1+ syntax)
memtable_heap_space: 2048MiB
memtable_offheap_space: 2048MiB

Setting Name Changes

In Cassandra 4.1+, settings use size literals (e.g., 256MiB) instead of _in_mb suffixes.

Monitoring for OOM Prevention¶

Key Metrics to Watch¶

Metric	Warning	Critical
Heap usage	> 70%	> 85%
GC pause time	> 500ms	> 2000ms
GC frequency	Every few seconds	Constant

Monitoring Commands¶

# Watch heap usage
watch -n 10 'nodetool info | grep -i heap'

# Watch GC
watch -n 30 'nodetool gcstats'

Alert Configuration¶

Set alerts for: - Heap usage > 80% - GC time > 1 second - OOM errors in logs

Recovery Verification¶

Verify Node Health¶

# Node status
nodetool status

# Heap usage after restart
nodetool info | grep Heap

# GC behavior
nodetool gcstats

Verify Data Consistency¶

# After OOM, consider running repair
nodetool repair -pr my_keyspace

Emergency Procedures¶

If Node Won't Start After OOM¶

# Check for corrupted commitlog
ls -la /var/lib/cassandra/commitlog/

# If suspected corruption, can skip commitlog (DATA LOSS!)
# Only as last resort:
# sudo rm /var/lib/cassandra/commitlog/*
# Then start Cassandra

If Multiple Nodes OOM¶

# Start nodes one at a time
# Allow each to fully join before starting next
# May indicate cluster-wide issue (bad query, data model)

Best Practices¶

Practice	Implementation
Set heap appropriately	8GB for most workloads
Enable OOM handling	`-XX:+ExitOnOutOfMemoryError`
Generate heap dumps	`-XX:+HeapDumpOnOutOfMemoryError`
Monitor heap usage	Alert at 75%
Limit partition sizes	Design for < 100MB
Regular compaction	Keep SSTable count low

Problem	Playbook
Large partitions	Large Partition Issues
GC problems	GC Pause Issues
General memory	High Memory Usage
Disk full	Handle Full Disk

Command	Purpose
`nodetool info`	Check heap usage
`nodetool gcstats`	GC statistics
`nodetool flush`	Flush memtables to reduce memory
`nodetool setconcurrentcompactors`	Reduce compaction parallelism

Recover from OOM¶

Symptoms¶

Immediate Recovery¶

Step 1: Confirm OOM Was the Cause¶

Step 2: Check Node Status¶

Step 3: Restart Cassandra¶

Step 4: Verify Node Rejoins¶

Diagnosis¶

Identify OOM Cause¶

Check Heap Configuration¶

Check What Consumed Memory¶

Common OOM Causes and Fixes¶

Cause 1: Large Partition Read¶

Cause 2: Compaction¶

Cause 3: Repair¶

Cause 4: Batch Operations¶

Cause 5: Heap Too Small¶

Cause 6: Heap Too Large¶

Prevention Configuration¶

jvm.options¶

cassandra.yaml¶

Monitoring for OOM Prevention¶

Key Metrics to Watch¶

Monitoring Commands¶

Alert Configuration¶

Recovery Verification¶

Verify Node Health¶

Verify Data Consistency¶

Emergency Procedures¶

If Node Won't Start After OOM¶

If Multiple Nodes OOM¶

Best Practices¶

Related Issues¶

Related Commands¶