Container Request Right Sizing Recommendation API (v2)
The container request right sizing recommendation API provides recommendations for container resource requests based on configurable parameters and estimates the savings from implementing those recommendations on a per-container, per-controller level. Of course, if the cluster-level resources stay static then you will likely not enjoy real savings from applying these recommendations until you reduce your cluster resources. Instead, your idle allocation will increase.
The endpoint is available at
curl -G \
-d 'algorithmCPU=quantile' \
-d 'qCPU=0.95' \
-d 'algorithmRAM=max' \
-d 'targetCPUUtilization=0.8' \
-d 'targetRAMUtilization=0.8' \
-d 'window=3d' \
--data-urlencode 'filter=namespace:"kubecost"+container:"cost-model"' \
The "base" recommendation is calculated from the observed usage of each resource per unique container spec (e.g. a 2-replica, 3-container Deployment will have 3 recommendations: one for each container spec).
Say you have a single-container Deployment with two replicas: A and B.
- A's container had peak usages of 120 mCPU and 300 MiB of RAM.
- B's container had peak usages of 800 mCPU and 120 MiB of RAM.
The max algorithm recommendation for the Deployment's container will be 800 mCPU and 300 MiB of RAM. Overhead will be added to the base recommendation according to the target utilization parameters as described above.