Slider 1 mini Slider 2 mini

Thursday, 19 March 2026

Filled under:

 As part of the weekend activity for Grid listener certificate renewal, I started performing pre-checks. Although the activity is expected to be a manual listener restart, I encountered issues on a few hosts during validation, indicating it may not be as straightforward.

Additionally, the Confluence document does not currently reflect these scenarios, so the pre-checks have been helpful in identifying these gaps early. I’ll connect with you separately to understand the possible causes for these issues.

From a resourcing perspective, there is a wide scope on the Blue side (~400 servers as informed by Vinit), so his availability will be limited. Out of 74 APAC hosts, I’ve taken ownership of ~35. Vinit has agreed to handle 5–10 hosts where only a simple listener restart is required.

For the remaining scope, we may need support from Navneeth or the Shift SRE team to ensure smooth execution and avoid spillover to the next shift.

Please let me know a convenient time to discuss the pre-check findings.

Posted By Nikhil03:55
Filled under:

 As part of the weekend activity for Grid listener certificate renewal, I started performing pre-checks. Although the activity is expected to be a manual listener restart, I encountered issues on a few hosts during validation, indicating it may not be as straightforward.

Additionally, the Confluence document does not currently reflect these scenarios, so the pre-checks have been helpful in identifying these gaps early. I’ll connect with you separately to understand the possible causes for these issues.

From a resourcing perspective, there is a wide scope on the Blue side (~400 servers as informed by Vinit), so his availability will be limited. Out of 74 APAC hosts, I’ve taken ownership of ~35. Vinit has agreed to handle 5–10 hosts where only a simple listener restart is required.

For the remaining scope, we may need support from Navneeth or the Shift SRE team to ensure smooth execution and avoid spillover to the next shift.

Please let me know a convenient time to discuss the pre-check findings.

Posted By Nikhil03:54
Filled under:

 

Hi Team,

As part of the change to disable AUTOEXTEND, we understand concerns around tablespace utilization.

While the DB team will continue to monitor and send alerts for cleanup or capacity planning, to avoid back-and-forth and delays, we propose setting up a dedicated coordination channel for handling critical queries.

Also note, space addition requests raised over weekends may face approval delays, which could lead to unexpected issues by Monday.

Request your support in aligning on this approach.

Thanks,
DB Team

Posted By Nikhil02:00

Wednesday, 18 March 2026

Filled under:

 Hi OEM Admins,


Sharing a finding from a recent check and requesting your validation on OEM alerting.


The database FRA was full and out of sync for ~2 months, but no GSNow alerts were generated during this period. Alerts were present in OEM, however they do not seem to have been forwarded to GSNow. Notably, an alert was triggered in GSNow today when we started fixing the sync issue.


Request you to validate if there were any issues with alert forwarding, notification rules, or configuration gaps.


Given the criticality, ensuring consistent alert propagation is important.


Thanks & Regards,

[Your Name]

Posted By Nikhil22:21
Filled under:

 Backup failure alerts must be treated as high priority, especially for Production, and should not remain open beyond one week.

Please ensure timely acknowledgment, clear ownership, and regular communication. Refer to the available documentation and engage SMEs/SREs when needed to drive resolution promptly.

Posted By Nikhil06:14
Filled under:

 Kudos to Mykid for consistently ensuring things are properly acknowledged, taking ownership when it matters, and maintaining clear and effective communication throughout.


Really appreciate the responsibility and reliability you bring. Looking forward to seeing you contribute even more towards keeping the estate cleaner with your skills, and sharing your knowledge with the team.

Posted By Nikhil06:02
Filled under:

 Regarding the suggestion to increase backup streams, as per discussions from a few years ago, PostgreSQL backups in Commvault did not support stream configuration.

I also cross-checked with the L2 team on the incident channel, and they confirmed that the streams option is currently greyed out for PG instances.

While only about a week remains for the HSX migration, it would be helpful to explore any long-term approach to improve backup throughput—apart from increasing streams—especially for critical databases.

Please let us know if there have been any recent changes or alternative options we can consider.

Thanks & Regards,

Posted By Nikhil05:30