Amazon’s cloud division on Tuesday unveiled DevOps Agent, an AI-powered tool aimed at helping clients diagnose and recover from system outages more efficiently.
The software leverages data from third-party monitoring platforms, including Datadog and Dynatrace, to predict the causes of technical issues.
AWS said the tool is available for preview starting Tuesday, with paid access to follow later.
AWS’s new AI outage tool is designed to help companies identify the causes of outages and implement fixes faster, Swami Sivasubramanian, AWS vice president of agentic AI, said.
The tool essentially automates tasks performed by site reliability engineers, who work to prevent downtime and respond to incidents in real time.
Startups like Resolve and Traversal are already offering AI assistants for SREs, while Microsoft’s Azure introduced its own SRE Agent in May.
“By the time the on-call ops team member dials in, they have an incident report with preliminary investigation of what could be the likely outcome, and then suggest what could be the remediation as well,” Sivasubramanian added.

