Show HN: I made an AI agent that helps DevOps teams resolve incidents 60% faster

platops.ai

1 points by Bahushruth 5 hours ago

Hey HN,

We built PlatOps to help platform teams resolve production incidents faster using AI. Our AI agent either auto-resolves incidents using workflows learned from your past incidents or guides your on-call engineers through resolution steps.

A common problem we hear from SRE teams is that incident resolution is still largely manual and relies heavily on tribal knowledge. Engineers waste precious time gathering information from multiple sources during incidents, and learnings from past incidents often stay trapped in postmortem docs.

PlatOps solves this by:

1. Automatically gathering relevant information (logs, metrics, docs) when an incident occurs

2. Learning from your team's past incident resolutions to suggest or automate fixes

3. Providing AI-guided resolution steps when manual intervention is needed

Teams using PlatOps have seen:

- 60% reduction in Mean Time To Recovery (MTTR)

- Faster onboarding of new on-call engineers

- Better knowledge retention from past incidents

We're looking for feedback from the HN community, especially from folks who manage production systems. Would love to hear your thoughts on:

- Current pain points in incident management

- Features you'd like to see

- Integration suggestions

Try it out at https://docs.platops.ai

erdaniels 4 hours ago

Neat! Would this have a complete audit trail of every CRUD action taken, AI or Human?

  • Bahushruth 4 hours ago

    Thanks for asking! Yes, we maintain a comprehensive audit trail of all actions taken during incident resolution, whether they're performed by the AI agent or human engineers.