There’s a problem with modern observability that almost nobody talks about openly: your monitoring stack might be hurting the systems it’s supposed to protect. I don’t mean in a theoretical sense. I mean that the agents and SDKs most teams rely on for visibility impose real overhead on the applications they instrument. CPU, memory, throughput. … continue reading
SAN FRANCISCO — groundcover, the observability platform for modern architectures, today announced the general availability of groundcover AI Mode, a native AI capability designed to help engineering teams investigate production incidents and analyze infrastructure behavior directly inside their own cloud environments. AI Mode runs natively within the customer’s own AWS infrastructure via Amazon Bedrock, ensuring … continue reading
In a significant move toward solidifying the infrastructure for production-grade AI, the llm-d project is being contributed to the Cloud Native Computing Foundation (CNCF) as a Sandbox initiative. This commitment, spearheaded by a multi-vendor coalition including CoreWeave, IBM Red Hat, Google, and NVIDIA, aims to establish an open standard for distributed inference. By integrating llm-d into the cloud … continue reading
Vigil: An Open-Source AI SOC Built with a LLM-native Architecture A new open source project, Vigil, launched at RSA today, enhances the transformative intelligence of rapidly advancing reasoning models, including Anthropic’s Claude. Available under an Apache 2.0 license, Vigil — created by DeepTempo — ships with13 specialized AI agents, 30+ integrations, and 7,200+ detection rules spanning … continue reading
Kubernetes has become the standard for container orchestration. It’s also notoriously challenging to manage when incidents arise. These incidents can come in many shapes and sizes, with their complexity forcing responders into firefighting mode. As a result, teams frequently end up chasing symptoms rather than finding and fixing the underlying cause. While Kubernetes incidents may … continue reading
MINNEAPOLIS — Modern penetration testing provider NetSPI today announced a new, modern user experience for the NetSPI platform, reimagining what penetration testing should feel like for today’s enterprise: focused, fast, and easy. Security teams are being asked to do more, faster, across attack surfaces that don’t sit still. Yet too many pentesting programs remain slow, … continue reading
The Linux Foundation has announced it will use $12.5 million in grants to develop long-term, sustainable security solutions that support open source communities worldwide. This is necessary, the foundation said in its announcement, because rapid advances in AI have created a more complex security landscape with vulnerabilities being found in much greater numbers, leaving security teams … continue reading
The AI boom is reshaping application architectures. Large Language Model (LLM) inference has fundamentally altered the requirements of the Kubernetes networking stack. Kubernetes is now the default environment for scheduling GPU-accelerated workloads, but the last mile of delivery — connecting a user request to the optimal model instance — is increasingly a bottleneck. Traditional ingress … continue reading
ARMONK, N.Y. – IBM today announced at GTC 2026 an expanded collaboration with NVIDIA to help enterprises operationalize AI at scale. Advancing efforts across GPU-native data analytics, intelligent document processing, on-premises and regulated infrastructure deployments, cloud, and consulting, the collaboration aims to give enterprises the data foundation, infrastructure, and expertise to move AI from pilot … continue reading
Let’s talk about debt. For years, enterprises have made decisions that help them move faster in the moment – taking shortcuts, postponing cleanup, or accepting imperfect visibility – knowing it will create technical debt they’ll eventually have to unwind. Many leaders accept this trade-off. While they know it will be a pain to deal with … continue reading
I know the pressure you are under right now. In every meeting I attend with technology leaders, the conversation inevitably drifts toward the same mandate: “What is our AI story?” You are expected to explain how AI will predict the next outage, optimize traffic flows, and finally deliver the self-healing infrastructure that has been promised … continue reading
Security teams have spent decades building defenses around network perimeters. AI pipelines make those perimeters meaningless. Data moves constantly between training environments, model registries, inference endpoints, and third-party services. A fraud detection system I worked on in a large healthcare setting illustrates why: the workflow relied on governed clinical and claims data, real-time event signals, … continue reading
The DevOps and Platform Engineering landscape is undergoing a massive shift. As AI-driven automation accelerates, the volume of machine-generated telemetry data is growing exponentially. Consequently, traditional observability platforms are struggling to provide the context and speed necessary for AI-scale operations. Existing tools, built for humans reading logs, are failing to keep up with intelligent agents … continue reading
A critical misalignment between modern IT architecture and the monitoring and observability tools needed for full-stack visibility has led to those tools not being able to keep pace, according to the 2026 SolarWinds State of Monitoring & Observability report. The report found that: 77% of respondents cite limited visibility across on-prem and cloud environments 75% … continue reading