Senior SysOps Engineer (AWS, Terraform, Kubernetes)
Key Responsibilities:
- Design and implement scalable, high-availability infrastructure utilizing AWS services (EC2, ELB, ECS, EFS, RDS, Elasticache) configured with Terraform for infrastructure as code automation.
- Architect and maintain CI/CD pipelines through Jenkins and GitLab to ensure seamless, automated deployments with minimal downtime.
- Orchestrate containerized applications leveraging Docker, AWS ECS, and Kubernetes (AWS EKS) to achieve optimal resource utilization and application isolation.
- Configure and fine-tune database systems including PostgreSQL 14+, MySQL 8+, Oracle, Cassandra, and Redis for maximum performance and reliability.
- Implement and administer messaging systems using Apache Kafka 3.x for real-time data streaming in trading environments.
- Establish comprehensive logging solutions through ELK stack (Elasticsearch 8.x, Logstash, Kibana) and AWS CloudWatch for centralized log management.
- Deploy advanced monitoring frameworks utilizing Zabbix, Prometheus, and Grafana to ensure system health and preemptively identify potential issues.
- Configure and optimize web servers such as Nginx for enhanced performance, security, and reliability.
- Implement and oversee security infrastructure including HashiCorp Vault for secrets management and Consul for service discovery.
- Support distributed tracing with Jaeger to monitor and troubleshoot complex microservice architectures.
- Diagnose and resolve infrastructure issues in production environments with minimal impact on trading operations.
- Collaborate with development teams to continuously improve system reliability, performance metrics, and security posture.
Required Skills:
- Minimum 2 years of experience as a Linux System Administrator with proficiency in command-line operations and shell scripting.
- Demonstrated expertise in AWS cloud services configuration and management – experience with EC2, ELB, ECS, EFS, RDS, and Elasticache services.
- Practical experience with Infrastructure as Code using Terraform to provision and manage cloud resources.
- Hands-on knowledge of containerization technologies (Docker, Kubernetes) and orchestration strategies.
- Comprehensive understanding of microservice architecture and service mesh concepts (like Istio or Linkerd).
- Experience administering modern web technologies, including web servers, application servers, and load balancers.
- Solid networking knowledge (TCP/IP, DNS, HTTP/S, Load Balancing, Firewalls) to ensure secure and performant systems.
- Excellent troubleshooting abilities with a methodical approach to problem-solving under pressure.
- English language proficiency at B1 level or higher for effective team communication.
