•
Lead DevOps initiatives for customer-facing WebSphere Commerce platform serving thousands of concurrent users, ensuring 99.9% uptime and optimal performance
•
Architect and implement CI/CD pipelines using Jenkins and GitHub Actions, reducing deployment time by 60% and enabling multiple daily releases
•
Design and deploy infrastructure automation using Terraform and Ansible, managing 80+ environments across development, testing, and production
•
Oversee patching, upgrades, and performance optimization of mission-critical eCommerce applications, achieving 40% improvement in response times
•
Administer Apache Web Server and Tomcat Application Server for Tuxedo Rental system, custom clothing applications, and Master Data Management platforms
•
Implement comprehensive monitoring solutions using Grafana, Prometheus, Open telemetry, Open-source LTM tools, and Sumo Logic for proactive incident detection and resolution.
•
Troubleshoot and optimize Elasticsearch-based cluster issues, managing node performance, index optimization, and cluster health to ensure sub-second query response times
•
Collaborate with development teams to perform root cause analysis and resolve production issues, reducing MTTR by 50%
•
Scale infrastructure to handle 3x traffic during peak retail seasons (Black Friday, holiday periods) with zero downtime
•
Configure multiple Grafana data source connections (Prometheus, Google Prometheus, Elasticsearch, MySQL, PostgreSQL, Google MQL, DB2, Oracle) to build custom dashboards and third-party integrations for real-time metrics visualization.
•
Develop and publish custom NPM packages to private GitHub registries, creating reusable components utilized across multiple application repositories to standardize deployments.