Service Reliability Engineer(SRE)

Bengaluru | Exp : 10 – 15  years | Notice period : Immediate /30days |

Rotation Shift ( 2 weeks) : India ( 9am to 6pm IST) and  US ( 9pm to 6am IST

EDUCATION & EXPERIENCE:

Bachelor’s degree and eight years of relevant experience or a combination of education and relevant experience.

KNOWLEDGE, SKILLS, AND ABILITIES:

  • Hands on experience with Kubernetes resources – deployments, services, ingress, storage volumes, configmaps with EKS/Karpenter
  • Proficiency using AWS Services – Autoscaling/EC2, AMI, Security Groups, ALB, S3, VPC
  • Experience with diverse middleware technologies(tomcat, weblogic, apache etc) on bare metal and docker containers.
  • Experience with Infrastructure as a code like terraform and container orchestration utilities.
  • Demonstrate Cloud Infrastructure experience with experience in building full-stack infrastructure for enterprise ready applications.
  • Experience with install configure and support Oracle Database.
  • Experience with version control systems (Git, SVN) and CI/CD tools.
  • Proficiency in programming and scripting languages, especially Python and Shell.
  • Strong working knowledge of Linux-based systems.

CORE DUTIES:

  • Deploying and managing highly available hybrid systems on On premise and cloud platforms like AWS, focusing on Infrastructure-as-a-Service (IaaS) and Platform-as-a-Service (PaaS) offerings.
  • Deploy and manage containerized applications using Docker and orchestrate them with docker compose or Kubernetes for scalability and resilience.
  • Manage installations, configurations, and upgrades; troubleshoot outages and incidents.
  • Implement Infrastructure as Code practices using tools like Terraform to automate cloud infrastructure provisioning and management. Improve operational efficiency by automating routine application tasks using python and shell scripting
  • Design, implement, and maintain CI/CD pipelines to streamline application deployment processes, ensuring high-quality software delivery.
  • Modernize existing infrastructure and applications by integrating new technologies and cloud-native solutions.
  • Actively participate in scaling, performance tuning and capacity planning of Enterprise Stack, including Single Sign On and SSL keystore management.
  • Conduct application server hardening to enhance security against potential threats.
  • Create and maintain comprehensive documentation for system configurations, procedures, and best practices to ensure knowledge transfer and compliance.
  • Ensure robust monitoring processes are in place and compliance with production security standards.