Skip to main content
Huge Networks
Auto Scaling

Automatically scale your resources based on demand

Intelligently and automatically scale your infrastructure. Auto-scaling monitors your applications and adjusts capacity to maintain stable performance at the lowest cost.

Benefits

Why use Auto-scaling?

Benefits that transform your infrastructure management

Smart Savings

Pay only for the resources you really need. Reduce costs by scaling down during periods of low demand.

Guaranteed Performance

Keep your application responsive during traffic spikes. New instances are automatically provisioned.

Zero Manual Intervention

Eliminate the need for 24/7 monitoring. Configure once and let the system work for you.

How It Works

How It Works in 4 Steps

Simple and automated process to scale your infrastructure

1. Define Rules

Configure metrics such as CPU, memory, or requests per second.

2. Set Limits

Determine the minimum and maximum number of instances for your group.

3. Automate Actions

The system adds or removes VMs based on defined conditions.

4. Monitor Health

Health checks ensure that only healthy instances receive traffic.

Metrics

Metrics-Based Policies

  • CPU Usage
  • Memory Consumption
  • Network Traffic
  • Custom metrics via API
Metrics-Based Policies
Bidirectional Scalability
Flexible

Bidirectional Scalability

  • Scale-out: add instances when needed.
  • Scale-in: remove idle instances automatically.
  • Cooldown periods to avoid flapping.
Schedulable

Time-Based Scheduling

  • Scale at specific times for planned events.
  • Prepare your infrastructure for known demand peaks.
  • Turn off dev/test environments outside business hours.
Time-Based Scheduling
Use Cases

Common Use Cases

Ideal applications for auto-scaling

E-commerce

Web Applications

Data Processing

Dev/Test Environments

Results

Performance in Numbers

Proven results of savings and efficiency

60%

Cost Reduction

99.99%

Availability

<30s

Provisioning Time

Configuration

Simple Configuration in YAML

Example of auto-scaling configuration

auto-scaling-config.yaml
Auto-scaling Group:
  Min Instances: 2
  Max Instances: 10
  Desired Capacity: 3

Scale-out Policy:
  Metric: CPU > 75%
  Duration: 5 minutes
  Action: Add 2 instances
  Cooldown: 300 seconds

Scale-in Policy:
  Metric: CPU < 30%
  Duration: 10 minutes
  Action: Remove 1 instance
  Cooldown: 300 seconds
Get Started

Create your VM and use auto-scaling today

Configure your first Auto-scaling group in minutes through the console or API.

Elastic infrastructure

Capacity that follows your demand

Auto-scaling sizes your infrastructure intelligently and automatically, monitoring metrics such as CPU, memory and network traffic. When demand rises, new instances are provisioned; when it drops, idle instances are removed.

The result is smart savings — you pay only for the resources you need — with guaranteed performance during peaks and zero manual intervention. Set the rules once and let the system work for you.

FAQ

Frequently asked questions

What is Auto-scaling?

It is a capability that automatically adjusts the number of your application's instances based on demand, adding capacity during peaks and reducing it during quiet periods.

Which metrics trigger scaling?

You can base policies on CPU usage, memory consumption, network traffic and custom metrics via API, defining exactly when to scale.

Does scaling work in both directions?

Yes. Scale-out adds instances when needed and scale-in automatically removes idle instances, always within the minimum and maximum limits you define.

Can I schedule scaling for planned events?

Yes. In addition to metric-based policies, you can schedule scaling by time, provisioning capacity ahead of predictable events and peaks.

How does the system avoid capacity flapping?

Through cooldown periods between scaling actions, the system avoids excessively adding and removing instances in response to short load fluctuations.

Does Auto-scaling help reduce costs?

Yes. Since you pay only for the resources actually used and the system reduces capacity during low-demand periods, Auto-scaling optimizes costs without compromising performance.