Your cloud cost tool says an instance is "underutilized." It recommends downsizing. You follow the advice. Two days later, your nightly ETL pipeline fails because the smaller instance can't handle the processing load.
This scenario plays out at organizations of every size. The problem isn't that the tool is broken — it's that average utilization tells you almost nothing about what a resource actually does.
Why Patterns, Not Averages
Consider two EC2 instances, both showing 15% average CPU over 30 days:
- Instance A: Flat 15% all day, every day. Genuinely over-provisioned.
- Instance B: 3% for 23 hours, then 92% for 1 hour during a nightly batch job.
Same average. Completely different optimization strategies. Instance A should be downsized. Instance B should not — or needs its batch workload moved to a different compute model.
This is why CLARITY classifies peak patterns, not just utilization levels. After analyzing resource behavior across AWS, Azure, and GCP, we've identified 6 distinct patterns that cover virtually every workload type.
1. Idle Clusters
Signature: Low average (<10%), low peaks (<20%), no periodic spikes.
These are genuinely unused resources. Dev environments left running after a sprint. Staging clusters nobody decommissioned. Test instances from a proof-of-concept six months ago.
Action: Terminate or downsize aggressively. But first, check one thing: if it's a burstable instance (t3, t4g on AWS), verify it's not accumulating CPU credits that another process depends on.
Cost impact: These are typically the easiest savings to capture — often 30-40% of an organization's total waste.
2. Maintenance Spikes
Signature: Low average (<15%), periodic high peaks (70-100%) at predictable times, duration under 30 minutes.
Cache flushes. Log rotation. Database vacuuming. Backup compression. Cron jobs that run on a schedule.
Action: Do not downsize. The low average is irrelevant — the maintenance task requires the full capacity. The correct optimization is to schedule these tasks during off-peak pricing windows if possible, or to move the workload to a spot instance that spins up just for the maintenance window.
Why tools get this wrong: A 30-day average completely smooths out a 20-minute spike. The peak becomes invisible in aggregated metrics.
3. Genuine Pressure
Signature: High average (>60%), frequent high peaks (>85%), sustained over multiple days.
This is real demand growth. The workload has outgrown its instance size.
Action: Upsize — but validate which dimension is the bottleneck first. CLARITY cross-references CPU with memory, network throughput, and IOPS to identify the actual constraint. Upsizing CPU when the bottleneck is memory is a common and expensive mistake.
4. Burstable Credit Risk
Signature: Low-to-moderate average (<30%), peaks that exceed baseline CPU allocation on burstable instances (t3, t4g, B-series on Azure).
This is the most insidious pattern. The instance appears healthy on dashboards. Average CPU looks "fine." But during peak hours, it's consuming CPU credits faster than it accumulates them. When credits run out, the instance throttles to baseline — causing latency spikes, timeouts, and degraded user experience.
Action: Either switch to a non-burstable instance type (m5, m6i) or enable unlimited burst mode (and accept the per-vCPU cost during burst periods). The cost of throttling usually exceeds the cost of a larger instance.
Why this is expensive to miss: The cost isn't on the compute bill. It's in SLA violations, degraded performance, and incident response hours.
Detect burstable credit risk automatically
CLARITY correlates CPU credits with peak patterns across all burstable instance families.
Start Free Trial5. Scheduled Batch Processing
Signature: Low average, very high peaks (>90%) at regular intervals (hourly, nightly, weekly).
ETL pipelines. Report generation. Data warehouse refreshes. ML training jobs. Any workload that runs on a schedule and demands full resources during its window.
Action: Right-sizing must target the peak, not the average. Better optimizations include moving batch workloads to spot instances, scheduling during reserved capacity windows, or using auto-scaling groups that scale up before the known batch window.
6. Deployment Spikes
Signature: Normal utilization with brief, irregular spikes to 100% that correlate with CI/CD deployment timestamps.
Container image pulls. Application restarts. Health check storms. Blue-green deployment switchovers. These are completely normal operational events.
Action: Ignore. These spikes are transient, expected, and non-indicative of resource needs. CLARITY filters them by correlating spike timestamps with CloudTrail deployment events, preventing false positive alerts.
Why this matters for alert fatigue: If your tool flags every deployment spike as anomalous, your team learns to ignore all alerts — including the real ones. This is one of the core reasons most FinOps dashboards produce unreliable data. For Kubernetes workloads specifically, peak misclassification directly impacts cost allocation accuracy.
How CLARITY Detects Them
CLARITY's peak analyzer runs after every cloud sync and classifies each resource into one of these 6 patterns using:
- Statistical analysis: Z-score deviation from baseline, week-over-week comparison
- Temporal correlation: Are peaks periodic? What's the interval? What's the duration?
- Cross-metric validation: Does CPU peak correlate with IOPS? Memory? Network?
- Event correlation: Do peaks align with known deployment or maintenance events?
- Credit monitoring: For burstable instances, are credits being consumed faster than accrued?
The classification feeds directly into CLARITY's recommendation engine. Every suggestion — downsize, upsize, terminate, or leave alone — is informed by the peak pattern, not just the average.
Putting It Into Practice
Next time you're reviewing right-sizing recommendations, ask one question for each resource:
"What does this resource's peak pattern look like?"
If your tool can't answer that, every recommendation it makes is based on incomplete data. And incomplete data leads to either wasted spend or broken production — sometimes both.
Peak intelligence built in
CLARITY classifies all 6 patterns automatically. Every recommendation is peak-aware and AI-validated.
Try CLARITY Free Or request a free cloud cost auditDid you find this article useful?