CareerZen Logo
Company logo

SENIOR HPC DEVOPS ENGINEER

Peraton

Full-time

College Park, MD

Job description

Quantum Leap is the leading technology services company focused on providing operationally focused solutions for select clientele within the United States Government (USG). Quantum Leap was founded by a team of technologists to focus exclusively on advancing USG interests against the Nation’s pacing threats in an age of Strategic (Great Power) Competition. We have built an elite, collaborative workforce dedicated to countering those threats in an environment designed for success. We offer exciting, important work, a team-oriented culture and growth opportunities.

We are seeking a highly motivated and experienced Senior Linux and DevOps Engineer to spearhead the design, build-out, and tooling of our NOC. This is a greenfield role focused on architecture, tooling, automation, and implementation. The ideal candidate is a hands-on Linux and open-source expert with deep experience building scalable monitoring and observability platforms.

Key Responsibilities:

NOC Architecture & Platform Design

  • Architect and deploy end-to-end NOC monitoring and logging ecosystems that support scalable, reliable operations.
  • Design highly available, Linux-based infrastructure that can grow with business needs while maintaining performance and resilience.
  • Evaluate, select, and implement open-source monitoring and observability tools such as Zabbix, Prometheus, Grafana, and the ELK Stack to provide actionable insights across systems.

Automation & Infrastructure as Code

  • Develop automation for deployment, configuration, and lifecycle management of monitoring and observability platforms.
  • Implement Infrastructure-as-Code practices using Ansible, Python, Bash, or similar tools to reduce manual work and improve consistency.
  • Establish configuration management standards and integrate automation into CI/CD pipelines where applicable.

Monitoring & Observability Engineering

  • Design alerting strategies that focus on actionable, low-noise alerts to help teams respond efficiently.
  • Build and maintain logging aggregation and analysis platforms to provide clear visibility into system health.
  • Integrate monitoring and observability systems seamlessly with existing infrastructure and platform services.

Documentation & Knowledge Transfer

  • Produce clear, practical technical documentation for architectures, platforms, and operational procedures.
  • Support the transition of platform ownership to the NOC Operations team, ensuring they can manage and maintain systems confidently.

Required Technical Skills and Experience:

  • 5+ years of senior-level Linux systems engineering experience with expert proficiency in RHEL, Ubuntu, or CentOS.
  • Deep experience with monitoring and observability tools, including Zabbix, Prometheus, Grafana, Nagios/Icinga, ELK Stack, and Graylog.
  • Strong scripting and automation skills using Python, Bash, or similar languages.
  • Solid networking knowledge, including TCP/IP, DNS, DHCP, SNMP, BGP, and OSPF.
  • Experienced with troubleshooting tools such as tcpdump, Wireshark, and mtr.
  • Familiarity with configuration management tools like Ansible, Puppet, or Chef.
  • Infrastructure-focused architect with an automation-first mindset, comfortable building platforms from the ground up.
  • Primarily a builder and technical leader rather than a people manager, focused on designing and delivering scalable systems.

Compensation and Benefits:

We seek the best-and-brightest professionals at all levels so offer highly competitive compensation, benefits and bonus plans. Company-sponsored benefits include: 6% 401(k) company match; 100% health care premium subsidy for employees, 75% subsidy for dependents; Anthem medical, PPO, HDHP/HSA, HCFSA, DCFSA, Guardian dental, Life and disability coverage; 3 weeks’ vacation; 2 weeks sick leave, and 11 holidays.

Quantum Leap is an Equal Opportunity Employer

www.ql-research.com

Job Type: Full-time

Pay: $140,000.00 - $180,000.00 per year

Benefits:

  • 401(k)
  • 401(k) matching
  • Dental insurance
  • Employee assistance program
  • Flexible spending account
  • Health insurance
  • Health savings account
  • Life insurance
  • Paid time off
  • Professional development assistance
  • Referral program
  • Tuition reimbursement
  • Vision insurance

Education:

  • Bachelor's (Preferred)

Experience:

  • Linux Engineering: 5 years (Preferred)
  • monitoring and observability tools (Zabbix, Prometheus etc): 5 years (Preferred)
  • scripting and automation skills : 5 years (Preferred)
  • designing and delivering systems: 5 years (Preferred)

Security clearance:

  • Top Secret (Required)

Ability to Commute:

  • Leesburg, VA 20175 (Required)

Work Location: In person