Firetiger

Work - Firetiger

Firetiger

Designing investigation interfaces and monitoring dashboards for security analysts to trace, correlate, and resolve incidents.

Session

Auth Latency Spike

Done

Webhook trigger

2 min ago

Auth service p99 latency exceeded 500ms SLO. Investigate root cause across traces and notify the on-call channel.

Reading runbook

Complete

Research on service topology

Complete

Retrieve relevant traces

Complete

Query connection pool metrics

Complete

Correlate deploy timeline

Complete

Agent: Latency Root Cause

Complete

Found 3 related incidents

Complete

Investigation UI

Designed the agent session UI at Firetiger — the primary surface for viewing how autonomous agents investigate production incidents. Each session renders the trigger message followed by a step-by-step execution trace with research actions, queries, tool uses, and a final outcome status. The interface lets engineers follow the agent's reasoning, inspect individual actions, and understand how production data, codebase context, and business logic were correlated to reach a resolution.

AgentsTracesReal-time

Monitoring Agents

3 active

Auth Latency Monitor

Every 15m · 142 sessions

Done

Connection Pool Watch

Every 5m · 89 sessions

Issue

Deploy Health Check

Post-deploy · 24 sessions

Done

Availability

99.97%

Error Budget

12.3%

Issues

3 open

SLO Monitoring

Built the monitoring and observer views at Firetiger — combining autonomous agent management with SLO dashboards over the data lake. Engineers configure monitoring agents with scheduled, webhook, or post-deploy triggers, then track session outcomes alongside key observability metrics like availability, error budget, and open issues across logs, traces, and metrics.

ObservabilityAgentsSRE
Back homeView project