Auto Scaling
noun
Definition
- 1.Automatically adjusting the number of compute resources based on demand to ensure optimal performance and cost-effectiveness.
Example
Auto Scaling is used to increase the number of instances during peak model inference requests.
Related Exams

