News
Logs in code
3+ hour, 18+ min ago (132+ words) Image from thumbs. dreamstime. com When writing code, many developers especially junior level, don't. .. Tagged with backend, go, development....
Building an Autonomous SRE Agent: From Raw Telemetry to Safe, AI-Driven Remediation
6+ hour, 8+ min ago (380+ words) Modern Site Reliability Engineering (SRE) teams manage hundreds of microservices with complex interdependencies. When an incident occurs, engineers must manually query multiple observability backends, correlate signals across layers, consult historical post-mortems, and execute runbooks. This manual process leads to high…...
How to Track AI Usage Without Losing Revenue (Complete Guide)
6+ hour, 40+ min ago (518+ words) Most AI products eventually run into the same problem: Tracking usage sounds simple. At first, all you need is a counter. A request comes in. You decrement a credit. You process the request. Or at least that's what most teams…...
Product Spotlight: Managing Multiple Client Sites in One Dashboard
2+ week, 4+ day ago (527+ words) Run multi site performance monitoring from one Apogee Watcher dashboard: organisations, per-site settings, and scheduled Page Speed tests without per-client A. .....
Product Spotlight: Team Roles and Access Control in Apogee Watcher
1+ week, 1+ day ago (992+ words) Agency performance monitoring roles in Apogee Watcher: Admin, Manager, and Viewer permissions, team invites, organisation scoping, and read-only access for c. .....
Word Press Performance Monitoring: A Complete Guide
8+ hour, 12+ min ago (1134+ words) Word Press sites share a common platform, but not a common configuration. Two sites on the same host can differ by: That variability is why "we run Lighthouse once a month" fails. Regressions usually arrive from a small change: a…...
Why timeout handling matters more than most backend logic
8+ hour, 59+ min ago (379+ words) Most backend systems spend a lot of time optimizing business logic. Very few spend enough time handling timeouts correctly. But in production systems, bad timeout handling causes more instability than most application bugs. Because backend systems rarely fail instantly. And…...
A Replay Runbook For Missed Publishing Windows
8+ hour, 59+ min ago (347+ words) Originally published on Tech Saa S Cloud When a scheduled post misses its window, the worst fix is often "publish it now." That response treats every post as equal. In reality, a public-sector service notice, a fintech product announcement, and…...
I got tired of writing post-mortems " so I built RCAi for SREs
9+ hour, 24+ min ago (88+ words) I'm an SRE at Sony Interactive Entertainment. After a week where my teammate had four incidents (and four RCAs), I built something for the blank-page problem after every outage. RCAi turns an incident timeline into a structured post-mortem / RCA: Free:…...
Production Lab: ECS Fargate + Prometheus + Grafana + Loki + Alloy + Node Exporter
9+ hour, 44+ min ago (254+ words) You will build this architecture: Officially, ECS Fargate tasks use task execution roles for ECS actions like pulling images/logging, and task roles for application AWS permissions. (AWS Documentation) Alloy supports ECS/Fargate container metrics using the ECS Task Metadata…...