fix(k8s): raise schemas CPU request from 20m to 100m #840
Reference in New Issue
Block a user
Delete Branch "bump-schemas-cpu-request"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
The HPA was pinning the deployment at maxReplicas (4) even though aggregate CPU usage was low.
Diagnosis
The service is event-driven: pods sit at ~0-1m idle but spike to 100-300m per supergraph query. With
requests.cpu=20mthose bursts read as 500-1500% utilization, so the HPA's 60% target was constantly exceeded on whichever pod handled the request.Observed per-pod samples (15s apart):
HPA status confirmed:
ScalingLimited TooManyReplicas+ScaleDownStabilizedkept replicas pinned at 4.Fix
Raise
requests.cputo100m. Bursts now read as 100-300% instead of 500-1500%. Combined with the HPA's downscale stabilization window this lets replicas settle back tominReplicas(2) between bursts.