Site Reliability Engineer / DevOps Engineer
My Desk
Contract
Remote
Job description
Hiring Immediately - Site Reliability Engineer/ DevOps Engineer on contract for 6 months (extendable )
Type: 6+ Month Contract
Rate: $Open /hr.
Position Overview:
The Lead Site Reliability Engineer is a senior technical leadership role within the Engineering organization, responsible for the reliability, availability, and operational excellence of ARC’s cloud infrastructure and kiosks platform. This role owns uptime, SLAs, and incident response while driving long-term improvements to system resilience, observability, and operational maturity. The Lead SRE serves as both a hands-on technical leader and a force multiplier across platform, QA, and development teams.
Responsibilities
● Own uptime, SLAs, and overall reliability of cloud infrastructure and kiosks platform.
● Lead incident response, root-cause analysis, and drive actionable postmortems.
● Automate infrastructure, deployments, and operational tasks using modern IaC and scripting in collaboration with the Platform Engineering team.
● Maintain and improve monitoring, alerting, and observability (Grafana, Prometheus, New Relic, etc).
● Manage, operate and recommend improvement of mo
● Execute and continuously improve disaster recovery and business continuity plans.
● Partner with platform engineering, QA, and development teams to ensure operational readiness.
● Establish and maintain runbooks, operational standards, and reliability best practices.
Provide leadership, mentorship, and clear communication during both normal operations and incidents.
● Optimize cloud and Kubernetes environments for reliability, performance, and scalability.
Qualifications
● 8+ years in SRE, DevOps, or Platform Engineering roles; 2+ years in a senior or lead capacity.
● Strong experience supporting production environments with strict SLAs and high uptime requirements.
● Deep knowledge of Kubernetes, containers, and cloud-native infrastructure.
● Proficiency in automation and scripting using Bash, Python, or Go.
● Hands-on experience with CI/CD pipelines and release engineering in modern environments.
● Expert-level familiarity with IaC tools (Terraform preferred).
● Strong understanding of monitoring, alerting, logging, and observability tooling.
● Experience implementing and managing GitOps workflows (ArgoCD or similar).
● Demonstrated ability to lead incidents and communicate effectively with technical
● Solid understanding of disaster recovery planning, resilience practices, and system hardening.
Job Type: Contract
Pay: $15,000.00 - $20,000.00 per month
Application Question(s):
- Experience implementing and managing GitOps workflows (ArgoCD or similar)?.
- Do you have a min of 8 years experience as SRE , DevOps or Platfoem Engineering ? Mention years
Do you have min of 2 years in Lead position ? Mention years
*
Do you have Deep knowledge of Kubernetes, containers, and cloud-native infrastructure.? Mention
- Proficiency in automation and scripting using Bash, Python, or Go.?
- Do you have hands-on experience with CI/CD pipelines and release engineering in modern environments.?
- ● Expert-level familiarity with IaC tools (Terraform preferred).? Mention
Work Location: Remote