Job Title : Site Reliability Engineer (SRE) - : Hyderabad (5 Days WFO) (Hyderabad Candidates are only preferred.) IMMEDIATE : Conviction HR Type : Contract-to-Hire (C2 H) Job Description : Conviction HR is seeking a talented Site Reliability Engineer (SRE) to join our growing team.
This Contract-to-Hire position is perfect for an individual who is passionate about improving system reliability and performance while collaborating closely with both development and operations teams.
Business Requirements : - 10 Yrs exp in Linux, Windows, VMWare with strong skills in Infrastructure as Code using Java, Python, Unix Shell, Powershell or equivalent.
- Hands on experience in Jenkins, Dev Sec Ops Frameworks, Terraform, Ansible, Chef or Puppet will be a plus.
- Strong experience in building and deploying CI/CD pipelines for complex distributed software is required.
- Good working knowledge on Containers using Docker or Podman, Kubernetes is a plus.
- Experience in one or more programing languages - Java, Python, Unix Shell, Powershell or equivalent - Experience and expertise in distributed systems is a must - Experience in building software and systems to manage platform infrastructure and application - Providing primary operational support and engineering for multiple large scale distributed software applications Key Responsibilities : - Design, implement, and maintain scalable and reliable infrastructure and services.
- Monitor system performance and reliability, ensuring high availability and quick recovery from incidents.
- Collaborate with development teams to improve application performance through automation and best practices.
- Develop and maintain incident response plans, conducting post-mortem analyses to prevent future issues.
- Implement monitoring and alerting solutions to proactively identify and resolve issues.
- Participate in the on-call rotation to provide support for production systems.
- Advocate for a culture of reliability, operational excellence, and continuous improvement.
Qualifications : - Bachelor's degree in Computer Science, Information Technology, or a related field.
- 3 years of experience in a Site Reliability Engineering or related role.
- Strong understanding of cloud platforms (e.g., AWS, Azure, Google Cloud) and container orchestration (e.g., Kubernetes, Docker).
- Proficiency in scripting and automation tools (e.g., Python, Bash, Terraform).
- Experience with monitoring and observability tools (e.g., Prometheus, Grafana, ELK Stack).
- Familiarity with CI/CD pipelines and Dev Ops methodologies.
- Excellent problem-solving skills and a proactive mindset.
- Strong communication skills and ability to work collaboratively with diverse teams.
What We Offer : - Competitive salary and comprehensive benefits.
- Opportunities for professional growth and development.
- A collaborative, innovative, and inclusive work environment.
(ref:hirist.tech)
Advertisement:
Site Reliability Engineer - Distributed Systems, Hyderabad
Free
Site Reliability Engineer - Distributed Systems, Hyderabad
India, Andhra Pradesh, Hyderabad,
Modified November 14, 2024
Description
Job details:
⇐ Previous job |
Next job ⇒ |