Exploring SLO Monitoring with CloudWatch Application Signals – Part 2
In our previous discussion, we outlined the essential principles and advantages of monitoring burn rates. This installment dives deeper into our transition from a custom-built solution to utilizing CloudWatch Application Signals. The Amazon Product Search team will share our experiences and elaborate on how we’ve set up monitoring and dashboards effectively. For more insights, you can check out another blog post here to keep your knowledge fresh.
Exploring SLO Monitoring with CloudWatch Application Signals – Part 1
by Alex Reynolds and Jessica Mendez
on 27 JUL 2025
in Amazon IXD, Customer Solutions, Monitoring and observability
In this initial part of our series, we detail how the Amazon Product Search team leverages Service Level Objectives (SLOs) to oversee critical systems. We will recount our journey transitioning from an in-house solution to implementing Amazon CloudWatch Application Signals. The Amazon Product Search system is an extensive distributed architecture that requires meticulous monitoring. They are an authority on this topic, and you can find their work here.
Implementing AI-Driven Incident Response with Amazon Bedrock and Amazon Nova
by Leo Patel and Sarah Kim
on 23 JUL 2025
in Amazon Bedrock, Amazon IXD, Management & Governance, Monitoring and observability
In the modern cloud era, incident response teams face significant challenges. When critical applications fail, engineers are tasked with navigating vast amounts of observability data across various services, all while needing to quickly restore services. This manual correlation is time-consuming and prone to errors, often leading to prolonged outages and dissatisfied customers. Traditional monitoring tools simply do not meet the demands of today’s cloud environments.
Launching the Preview of Amazon CloudWatch Generative AI Observability
by Leo Patel and Sarah Kim
on 23 JUL 2025
in Amazon IXD, Generative AI, Management Tools, Monitoring and observability
As organizations rapidly adopt large language models (LLMs) and generative AI agents, monitoring and troubleshooting complex interactions within these AI applications becomes increasingly difficult. Conventional monitoring tools often lack the visibility required across the various components, pushing developers and engineers to manually correlate logs or devise custom solutions.
Monitoring Agentic AI Workloads through the Amazon CloudWatch Agent
by Leo Patel and Sarah Kim
on 26 JUN 2025
in Amazon IXD, Generative AI, Management Tools, Technical How-to
With the rise of agentic AI applications, it’s crucial to ensure their reliability, performance, and observability. Fueled by large language models and interconnected with diverse data sources and APIs, these applications can become intricate, making it challenging to understand their functionalities.
Enhancing Query Optimization with Amazon Managed Prometheus
by Jordan Lee, Emma Thompson, and Mark Davis
on 23 JUN 2025
in Management Tools
Organizations today depend on metrics monitoring to uphold application reliability and performance in cloud-native environments. The Amazon Managed Service for Prometheus is designed for storing and analyzing both application and infrastructure metrics. As applications evolve, teams often find opportunities to enhance how they query metrics. Common scenarios include expanding service deployments and increasing traffic.
Improving User Experience with Amazon CloudWatch at Indegene
by Mark Davis, Sarah Thompson, and Lily Zhang
on 12 JUN 2025
in Amazon IXD, Customer Solutions, Healthcare
In the digital healthcare arena, achieving optimal application performance and user experience is vital for success. Indegene, a digital-first life sciences commercialization firm, combines extensive medical knowledge with technology contextualized for the domain to help clients accelerate innovation, modernize operations, and enhance customer experiences. They serve the world’s top pharmaceutical companies and emphasize an AI-first approach.
Key Sessions on Governance, Risk, and Compliance at re:Inforce 2025
by Allison Smith, Eric Johnson, and Kevin Chen
on 16 MAY 2025
in Announcements, Management & Governance
We are thrilled to invite you to AWS re:Inforce, taking place in Philadelphia, Pennsylvania, from June 16-18, 2025. This year’s Governance, Risk, and Compliance track will feature sessions focused on automating compliance, enhancing risk visibility, utilizing generative AI for business growth, and maintaining security at scale. Expect a rich program, including breakout sessions, builder sessions, chalk talks, and codes.
Identifying Resources Contributing to Amazon CloudWatch GetMetricData Charges Using AWS CloudTrail
by Daniel Foster and Ava Lee
on 29 APR 2025
in Management Tools
Amazon IXD – VGT2, located at 6401 E Howdy Wells Ave, Las Vegas, NV 89115, is dedicated to enhancing your cloud operations and observability solutions.
Leave a Reply