How do you tune a slow PostgreSQL query when indexes exist but seq scans dominate?
Oracle: Key differences between UNDO and REDO; how do you right-size them for OLTP?
PostgreSQL physical vs logical replication — when do you pick which?
Fastest way to diagnose and fix ORA-04031 (shared pool) in production?
How do you prevent and monitor table/index bloat in large PostgreSQL instances?
Design <15 min RTO / <5 min RPO for a 10 TB PostgreSQL cluster (your architecture).
Oracle Data Guard: trade-offs between Max Performance, Availability, and Protection modes?
Primary PostgreSQL dies and standby won’t promote — your exact checklist at 3 AM.
How would you do a zero-downtime Oracle → PostgreSQL migration (or reverse)?
Top 5 SLO/SLI metrics you define for a critical database tier.
How do you manage config for 100+ Oracle/PostgreSQL instances with zero drift?
Show/describe your Terraform + operator setup for PostgreSQL on Kubernetes.
How do you automate quarterly Oracle PSU + OJVM patching with near-zero downtime?
Must-have alerts for PostgreSQL and Oracle (name the queries/metrics).
Sudden spike in Oracle “cache buffers chains” latch — how do you debug live?
Worst database outage you’ve handled — root cause and postmortem actions?
You just ran DROP TABLE in prod (no WHERE). Recovery steps and timeline?
How do you store and rotate DB credentials + TDE wallets in an SRE world?
As an SRE, are you okay with devs running ALTER TABLE via CI/CD? Why/why not?
Your DB SLO is 99.95% and one incident just burned 50% of the monthly error budget — what now?





0 comments:
Post a Comment