Ashok Kumar Selvam

+1 346 302 9180
GitHub: https://ashoktmw.github.io/ak/

PROFESSIONAL SUMMARY

Dynamic Staff DevOps Engineer with over 12 years of expertise in designing and automating scalable cloud infrastructures using Python, Ansible, Terraform, and Shell scripting. Proven track record of accelerating deployment frequency by 40% and enhancing system uptime to 99.99% through robust CI/CD pipelines leveraging Jenkins and GitHub Actions on GCP. Adept at driving operational excellence and cost optimization, delivering resilient solutions that empower teams to innovate rapidly and securely.

WORK EXPERIENCE

Staff DevOps Engineer - Cloud
08/2019 - Present
Tailored Brands , Houston
Lead DevOps initiatives for customer-facing WebSphere Commerce platform serving thousands of concurrent users, ensuring 99.9% uptime and optimal performance
Architect and implement CI/CD pipelines using Jenkins and GitHub Actions, reducing deployment time by 60% and enabling multiple daily releases
Design and deploy infrastructure automation using Terraform and Ansible, managing 80+ environments across development, testing, and production
Oversee patching, upgrades, and performance optimization of mission-critical eCommerce applications, achieving 40% improvement in response times
Administer Apache Web Server and Tomcat Application Server for Tuxedo Rental system, custom clothing applications, and Master Data Management platforms
Implement comprehensive monitoring solutions using Grafana, Prometheus, Open telemetry, Open-source LTM tools, and Sumo Logic for proactive incident detection and resolution.
Troubleshoot and optimize Elasticsearch-based cluster issues, managing node performance, index optimization, and cluster health to ensure sub-second query response times
Collaborate with development teams to perform root cause analysis and resolve production issues, reducing MTTR by 50%
Scale infrastructure to handle 3x traffic during peak retail seasons (Black Friday, holiday periods) with zero downtime
Configure multiple Grafana data source connections (Prometheus, Google Prometheus, Elasticsearch, MySQL, PostgreSQL, Google MQL, DB2, Oracle) to build custom dashboards and third-party integrations for real-time metrics visualization.
Develop and publish custom NPM packages to private GitHub registries, creating reusable components utilized across multiple application repositories to standardize deployments.
Production Site Reliability Engineer
02/2015 - 08/2019
Tata Consultancy Services , Coraopolis, PA, USA
Dicks has its own eCommerce platform by partnering with IBM tools called Websphere Commerce to integrate the eCommerce platform in production with two tier Data centers.
Owning nearly 6 largest non-production environment and one multitier eCommerce WCS data center with linux and DB2 database.
SRE and Observability initiatives ensuring high availability, proactive monitoring, and faster incident response through tools such as Grafana, Prometheus, AppDynamics, and Sumo Logic.
Specialized in performance tuning, CDN management (Akamai) with Cloudlets and edge computing, optimizing infrastructure for reliability, scalability, and fault tolerance.
Administrating Websphere Application Server Administration,Jenkins,SOLR SEARCH Engine, Extreme Scale servers.
Administrating Crossview Customer care application and handling ETL and Indexing.
Package, build, Integrate and deploy enterprise J2EE applications on WebSphere 7 that involves EAR (Enterprise Archives) and WAR (Web Archives).
Configuring WebSphere resources, including JDBC providers, JDBC data sources and connection pooling, MQ.
Having hands on experience Automation using ANT, LINUX scripting in Jenkins tool and Control-M.
Performing Root Cause Analysis with the help of HeapDump and Thread Dump Analyzer tools.
Have hands-on experience on Staging propagation, Solr Index utility jobs, Dataload jobs.
Performance tuning in Crossview Customer care application for Java 8 by load test and adding JVM options for GC tuning.
Perform daily application health checks for CPU Utilization, Memory Heap Utilization, Thread usage, and response times.
eCommerce Environment Build and Deploy Team Member
12/2012 - 01/2015
The HomeDepot , Chennai, INDIA
Reporting and Auditing through Splunk and SCOM and AppDynamics.
L2 Support for SCOM and Splunk agents
Installing SCOM applications in Windows Server 2012 and supporting Windows Server administration.
Servicenow,Remedy, HPSM and Ivanti
Atlassian stack (JIRA, Confluence, Bitbucket)
Languages Used: Unix Shell Scripting, Powershell Scripting, Python, Ansible and Terraform

EDUCATION

M.Tech in Information Security & Computer Forensics
01/2012
GPA: 8.6 /10
B.Tech in Information Technology
01/2010
GPA: 69%
Higher Secondary
01/2006
St.Ann’s Hr.Sec. School, Tindivanam GPA: 76%
Matriculation
01/2004
St.Joseph’s Matric. Hr. Sec. School, Tindivanam GPA: 81%

SKILLS

Technical Skills: Python, Shell Scripting, PowerShell, Ansible, Terraform, Jenkins, GitHub Actions, GCP, Apache Web Server, Tomcat Application Server, Prometheus, OpenTelemetry, Elasticsearch, NPM, Unix Shell Scripting
Soft Skills: Collaboration, Problem Solving, Adaptability, Communication, Time Management
Tools: Grafana, Sumo Logic, WebSphere Commerce, Akamai, Apache Service Mix, SCOM, Netcool, AppDynamics, SOLR Search Engine, Extreme Scale Servers, Control-M, HeapDump Analyzer, Thread Dump Analyzer, Splunk, System Center Operations Manager, Windows Server, ServiceNow, Remedy, HPSM, Ivanti, Atlassian Stack, Bitbucket, JIRA, Confluence
Other: Agile Methodology, ITIL Framework, Cloud Infrastructure, DevOps Practices, Incident Management

PROJECTS

Grafana
Technologies: Shell, Java, Go
Integrating grafana with Google Managed prometheus
integrated many exporters to grafana as datasource (IBM DB2, Service now,Github,Elasticsearch,Java exporters ) etc

CERTIFICATIONS

IBM WebSphere Commerce V7 System Administration – Foundations Badge
IBM
Sumologic Mastery and Sumologic Administrator Certified
Google Cloud Platform Fundamentals: Core Infrastructure
Coursera
70-246 MICROSOFT Certification - Monitoring and Operating a Private Cloud (SCOM)
Microsoft
ITIL Foundation V3EX0-101 certified professional
RHCE – Red Hat Linux Certified Engineer
Red Hat

ACHIEVEMENTS

TECHNOLOGY MOST VALUABLE TEAM OF THE YEAR 2025