Garry Tan

DrDroid - AI teammate for On Call engineers

DrDroid is your AI agent for production incidents—automating triaging, troubleshooting, and remediation. Integrates with 50+ tools including Datadog, Grafana, Kubernetes, Cloud Providers to help engineers resolve issues faster and save hours every week.

Add a comment

Replies

Best
Siddarth Jain

Hey Product Hunt community! 👋

When we first pitched to YC in 2022, we knew production on-call was a pain for engineering teams. We started working on an Open Source framework for SREs to implement self-healing using runbook automation. We got some amazing early adopters like Platform team at Palo Alto Networks.

But this was a small portion of the overall on-call and the bigger pain always had been the small minor new issues that takes hours for the team. Escalations would happen often because of lack of context of how Kafka or Redis or Airflow works.


Today, we're thrilled to introduce DrDroid to you all. DrDroid is an AI agent designed to fix production issues, with an extra competence to solve cascading infrastructure and container issues for teams.

🧠 Automate triaging and debugging: Let the AI agent analyze alerts, diagnose your observability data and give you recommendations on how to fix issues.

🔄 Integrate seamlessly: Connect with 50+ monitoring tools, CI/CD tools, logging tools, containers (k8s clusters, VMs), databases, and more.

🔒 Maintain control: DrDroid operates with read-only access by default, ensuring safety and compliance. If you have integrations behind your VPC, you can deploy our proxy service on any of your machine/cluster and connect the platform to those integrations.

We've worked with some amazing platform teams over the last few months & helped them productionize the use of DrDroid. We're currently in public beta and have simplified the integration process:

  1. Integrate with just Slack and start testing out the hypothesis that our AI agent makes on your alerts.

  2. If you like it, then decide which additional data integrations to add depending on the kind of issues where you want your teammates to get help.

We have a free forever plan for engineers wanting to play around. For Teams plan, signing up before 31st May, we'll give you an extended 2 months of free trial instead of the current 15 days. Check it out here: https://drdroid.io/

Masum Parvej

@sidphoenix It might be helpful to show how DrDroid handles noisy alerts.

Siddarth Jain

Noted, thank you for the feedback! @masump

Supa Liu

Automating the entire incident lifecycle—from triage to remediation—across all those tools is exactly the kind of ops magic we need in production. 🚀

Vrushank Vyas

The moment you start selling to enterprises is the moment you should have had on-call implemented (already).

DrDroid has built one of the most impressive solutions and a truly "useful" agent for on-call engineers. Congratulations on the launch!

Siddarth Jain

@vrv18 thank you!

Alex Cloudstar

Congrats on the launch of DrDroid! It sounds like a game-changer for on-call engineers. Making triaging and debugging easier while integrating seamlessly with existing tools is huge. Offering a free forever plan and an extended trial for teams is a generous touch. Best of luck! 🚀

Suryansh Tiwari

DrDroid sounds like a real game-changer for engineering teams. Automating triage, troubleshooting, and remediation can save countless hours and reduce stress for on-call engineers. I’m impressed by the seamless integration with popular tools like Datadog and Kubernetes. Wishing you all the best with this launch!

Dmitry Obukhov

Congrats on the launch — this looks like a huge time-saver for platform and SRE teams. Love how you’re not just detecting issues but going deeper into triaging and suggesting fixes with context.

Curious how DrDroid handles edge cases or noisy alerts — does the agent learn over time based on how teams respond, or is it more rules-based out of the box?

Excited to try it out — feels like a real shift in how on-call can be managed.

Siddarth Jain
@dobk great question! User can give feedback on alerts and mention how they typically react to an alert. Similar alert in future would follow that cadence. We do have rule based option too but that is typically used for self-healing and auto remediation uses cases that are completely deterministic
Jason Chernofsky

every dev i know should be thanking you for this!

Evgenii Zaitsev

Automating triaging, debugging, and remediation of production issues will save teams hours each week, especially when dealing with minor yet time-consuming problems. Does DrDroid handle multi-cloud environments seamlessly, or is it better suited for single cloud/VM setups?

Siddarth Jain

@evgenii_zaitsev1 we work great with multiple integrations / multi-cloud. we have customers using 20+ integrations within their DrDroid account -- from multiple clouds to different databases to custom actions.

Siddharth Goyal

Having used DrDroid to create runbooks for our team has been a scalable way to transfer crucial triage context in a scalable way. Super excited for the launch !