Start with metrics to confirm scope (p95/p99, endpoints, regions), then use traces to locate slow spans and logs to identify exact errors or queries. Compare recent deploys and config changes.
A structured workflow saves time:
Regression checklist:
1) p95/p99 up? 2) Which routes? 3) Which version? 4) Trace slow spans
5) Check DB: slow query log / locks / cache hit ratio