Auto Scaling (AS)
(AS)
Auto Scaling automatically adjusts compute resources based on scaling policies you define, striking a balance between performance and cost.
Enable your infrastructure to scale up or down automatically based on real-time demand. This elasticity ensures that your applications can handle sudden traffic spikes or unexpected usage patterns without any manual intervention.
Your applications are distributed across multiple servers or instances. In the event of a server failure or increased load, the system automatically launches new instances to maintain optimal performance.
Achieve peak performance and cost optimization for your cloud-based applications with our Auto Scaling solution. AS dynamically adjusts your infrastructure based on real-time demand, ensuring optimal resource allocation and eliminating unnecessary expenses.
Auto Scaling works with ELB to balance the load among instances in your Auto Scaling groups. It maintains high application availability by adding instances on demand and replacing unhealthy instances with new ones. It supports scheduled, periodic, and alarm scaling policies to help you confidently handle with any changes in workload.
Lifecycle hooks allow your applications to respond to scaling events in your Auto Scaling groups. Graceful scale-in ensures that running tasks are complete before the instances terminate so services are not interrupted. Any time an instance is launched or terminated in your Auto Scaling groups, you will be notified.
During surges in demand, your website may need additional resources to maintain the desired performance. When demand slows and these additional resources are no longer needed, they can be offloaded so you do not need to continue paying for them. With Auto Scaling, you get a scalable architecture that automatically scales up or down to maintain optimum performance at the lowest possible cost, so that your business is prepared for all scenarios.
Requests are distributed to instances deployed in OREL Cloud to isolate failures by design. You get the reliability of geographic redundancy. Auto Scaling can also detect unhealthy instances and replace them with new ones to help ensure your applications never crash.
You can attach a load balancer to your Auto Scaling group to manage incoming requests by optimizing traffic routing so that no one instance is ever overwhelmed.
Auto Scaling monitors the health of running instances in real time and automatically manages the launch and removal of instances on your behalf.
You can schedule a batch of compute instances to be automatically created for tasks at scheduled time and then have them immediately released when tasks are complete. During the process, the number of compute instances can also be automatically adjusted based on size of the workload over time.
You can manually change the size of your Auto Scaling group or configure scheduled, periodic, or alarm-triggered scaling policies to have the group scaled automatically.
Auto Scaling adds instances only when needed, and can scale across purchase options to optimize performance and cost. You pay only for what you use.
Auto Scaling allows you to deploy compute tasks across multiple instance types to enhance performance.
This document describes basic concepts, functions, key terms, and FAQs of the Auto Scaling (AS) service and provides instructions for quickly creating AS groups and using the AS....
This document describes application programming interfaces (APIs) of Auto Scaling (AS) and provides API parameter description and example values.