Efficiently Monitor and Scale Your Workspace Gateways
We’re excited to announce the availability of workspace gateway metrics and autoscale in Azure API Management, offering both real-time insights and automated scaling for your gateway infrastructure. This combination increases reliability, streamlines operations, and boosts cost efficiency.
Monitor and Scale Gateway with New Metrics
API Management workspace gateways now support two metrics:
- CPU Utilization (%): Represents CPU utilization across workspace gateway units.
- Memory Utilization (%): Represents memory utilization across workspace gateway units.
Both metrics should be used together to make informed scaling decisions. For instance, if one of the metrics consistently exceeds a 70% threshold, adding an additional gateway unit to distribute the load can prevent outages during traffic increases. In most workloads, the CPU metric will determine scaling requirements.
Automatically Scale Workspace Gateways
In addition to manual scaling, Azure API Management workspace gateways now also feature autoscale, allowing for automatic scaling in or out based on metrics or a defined schedule.
Autoscale provides several important benefits:
- Reliability: Autoscale ensures consistent performance by scaling out during periods of high traffic.
- Operational Efficiency: Automating scaling processes streamlines operations and eliminates manual and error-prone intervention.
- Cost Optimization: Autoscale scales down resources when traffic is lower, reducing unnecessary expenses.
Access Metrics and Autoscale Settings
You can access the new metrics in the “Metrics” page of your workspace gateway resource in the Azure portal or through Azure Monitor. Autoscale can be configured in the “Autoscale” page of your workspace gateway resource in the Azure portal or through the autoscale experience.
Get Started
Updated May 19, 2025
Version 1.0budzynski
Microsoft
Joined October 13, 2021
Azure Integration Services Blog
Follow this blog board to get notified when there's new activity