← Back to list
Senior
Registration: 15.07.2025

Ashish Srivastava

Specialization: Site Reliability Engineer
— Site Reliability Engineering leader with over 12 years of experience in infrastructure management, system reliability, and DevOps practices. — Proficient in leading cross-functional teams, managing large-scale production systems, and implementing end-to-end automation using Infrastructure as Code (IaC). — Demonstrated success in VMware management, container orchestration, and incident handling. — Expert in building CI/CD pipelines using Jenkins and maintaining high-availability environments in hybrid (VMware, AWS) infrastructure. — Lead, mentor, and grow high-performing teams of SRE engineers with a focus on collaboration, skill development, and innovation. — Expert in Terraform and AWS CloudFormation for scalable, automated infrastructure provisioning. — Extensive experience managing and optimizing VMware virtual environments for performance, reliability, and scalability. — Design and maintain Jenkins pipelines with automated gates for quality assurance, testing, and production readiness. — Deploy new code with zero downtime using blue-green strategies across both cloud and on-prem environments. — Hands-on experience with Docker, manage and orchestrate containers for scalable service deployment. — Develop Python and Shell scripts to automate operational tasks and improve system efficiency. — Deploy and manage monitoring stacks using Prometheus, Grafana, Kibana, and Zabbix. — Lead incident response, perform root cause analysis, and implement long-term improvements. — Manage Linux and Windows servers with strong troubleshooting and performance tuning skills. — Implement best practices to secure infrastructure and support audit-ready operations. — Effectively articulate technical details to non-technical stakeholders and align goals across engineering and product teams.
— Site Reliability Engineering leader with over 12 years of experience in infrastructure management, system reliability, and DevOps practices. — Proficient in leading cross-functional teams, managing large-scale production systems, and implementing end-to-end automation using Infrastructure as Code (IaC). — Demonstrated success in VMware management, container orchestration, and incident handling. — Expert in building CI/CD pipelines using Jenkins and maintaining high-availability environments in hybrid (VMware, AWS) infrastructure. — Lead, mentor, and grow high-performing teams of SRE engineers with a focus on collaboration, skill development, and innovation. — Expert in Terraform and AWS CloudFormation for scalable, automated infrastructure provisioning. — Extensive experience managing and optimizing VMware virtual environments for performance, reliability, and scalability. — Design and maintain Jenkins pipelines with automated gates for quality assurance, testing, and production readiness. — Deploy new code with zero downtime using blue-green strategies across both cloud and on-prem environments. — Hands-on experience with Docker, manage and orchestrate containers for scalable service deployment. — Develop Python and Shell scripts to automate operational tasks and improve system efficiency. — Deploy and manage monitoring stacks using Prometheus, Grafana, Kibana, and Zabbix. — Lead incident response, perform root cause analysis, and implement long-term improvements. — Manage Linux and Windows servers with strong troubleshooting and performance tuning skills. — Implement best practices to secure infrastructure and support audit-ready operations. — Effectively articulate technical details to non-technical stakeholders and align goals across engineering and product teams.

Portfolio

Doctoranywhere

● Deployed secure AWS VPC environments, managed Linux and Windows servers, implemented MySQL replication.

Oaks.com.sg

● Infra deign and setup.

Friendshipmeter.com

● Infra Design and setup.

Skills

Python
VMware ESXi
KVM
Ubuntu
CentOS
SQL
MySQL
PostgreSQL
Oracle
Firewall
Windows
Grafana
Zabbix
Kibana
Terraform
Jenkins
Shell
AWS
Bitbucket
GitHub
Git

Work experience

Senior Site Reliability Engineer
since 06.2022 - Till the present day |One97 Communications
Jenkins, CI/CD, Docker, AWS, Grafana
● Led central NOC support to ensure availability and stability of critical production services. ● Designed and implemented gated Jenkins, CI/CD pipelines with automated test validations and manual approvals. ● Managed Docker container environments, automating deployments and scaling. ● Administered AWS infrastructure using Terraform, automating deployments with blue-green strategies. ● Configured Prometheus and Grafana dashboards for system health monitoring. ● Provided performance feedback and mentoring to junior SRE team members. ● Drove root cause analysis and resolution of high-severity incidents. ● Supported both Linux and Windows platforms ensuring cross-environment stability. Projects: ● DoctorAnywhere.com: Deployed secure AWS VPC environments, managed Linux and Windows servers, implemented MySQL replication. ● Secure VPC Setup: Automated VPC and backup configurations using Python & Shell. ● Cloud Resource Automation: Scripted full AWS resource inventory for visibility and cost control.
Executive IT
09.2018 - 05.2022 |Reliance Corporate IT Park
VMware ESXi, ASP.NET
● Supervised 5-member engineering team and delivered critical infrastructure projects. ● Maintained and optimized VMware ESXi environments. ● Developed ASP.NET-based internal tools for infrastructure management. ● Implemented server patching and compliance procedures across platforms. ● Led migrations and performance optimization for key web applications.
System Engineer
12.2016 - 09.2018 |Singsys Pte
AWS, GitLab, CI/CD, Shell, Python
● Built and maintained Jenkins and GitLab CI/CD pipelines. ● Automated infrastructure tasks with Shell/Python scripts. ● Managed AWS services (EC2, VPC, ELB) for production environments. ● Collaborated with stakeholders for project requirements and solution delivery.
System Software Engineer
06.2015 - 10.2016 |Logbull IT Solutions
AWS, Linux, Java, Python, Git
● Designed automation scripts for AWS backup and resource listing. ● Deployed and maintained Linux servers aligned with IT best practices.
Server Support Engineer
06.2014 - 06.2015 |Progressive Infotech
PHP, AWS, Git
● Provided PHP application support and led automation initiatives. ● Ensured high availability of network devices and applications.
Server Support Engineer
11.2012 - 06.2014 |Trimax IT Infrastructure and Services
PHP, AWS, Git
● Managed voice and data network operations. ● Reduced storage costs and improved backup processes through automation.

Languages

EnglishUpper Intermediate