Lead DevOps and Systems Engineer
Panther IT
Full-time
Dallas, TX
Job description
Role Summary
Our growing client in North Dallas has an immediate opening for an experienced DevOps and Linux Systems Engineer
As a Lead DevOps & Systems Engineer, you will be responsible for designing, securing, automating, and operating the hybrid infrastructure that powers our trust platform. This is a hands-on, high-impact role where you will work across Linux systems, Kubernetes clusters, AWS cloud services, and on-prem data center environments to ensure our systems are hardened, resilient, scalable, and compliant.
Your expertise in Linux internals, DevOps automation, Kubernetes lifecycle management, cloud security, CI/CD, and distributed systems will be crucial to our platform’s reliability and performance. You will collaborate closely with engineering, security, and architecture teams to build infrastructure that meets strict standards for availability, security, and operational excellence.
Key Responsibilities
Linux Systems Engineering & Enterprise Operations
- Build and maintain hardened Linux servers (RHEL/AlmaLinux/Ubuntu LTS).
- Perform kernel tuning, SELinux configuration, system hardening, and OS baseline enforcement (CIS, NIST, ISO 27001).
- Develop Golden Images and automated provisioning workflows
- Troubleshoot advanced OS, hardware, firmware, virtualization, and kernel-related issues.
Kubernetes Engineering & Automation (AWS + On-Prem)
- Deploy and maintain Kubernetes clusters (EKS + on-prem/bare metal).
- Automate cluster lifecycle tasks (bootstrap, scaling, upgrades, node provisioning).
- Implement and secure cluster networking, service mesh, ingress, persistent storage, and workload policies.
- Integrate Kubernetes with centralized logging, monitoring, and auditing systems.
DevOps, CI/CD, IaC, and Automation
- Develop CI/CD pipelines using GitHub Actions, GitLab CI, or Jenkins.
- Implement Infrastructure-as-Code (Terraform, CloudFormation) for repeatable deployments.
- Build automation workflows using Ansible, Bash, Python, and GitOps patterns.
- Ensure CI/CD and IaC meet security, compliance, and audit traceability standards.
AWS Cloud Engineering & Security
- Support AWS infrastructure including VPCs, subnets, routing, security, RDS, EKS, EC2, IAM.
- Implement AWS IAM least-privilege access models.
- Maintain logging, encryption, and monitoring for compliance.
- Optimize cloud resources for performance, HA, cost, and security.
Security Compliance, IAM Governance, and Audit Remediation
- Enforce security baselines aligned to SOC 2, ISO 27001, NIST, CIS Benchmarks.
- Implement IAM governance for RBAC, PAM, MFA, SSH key lifecycle, identity federation.
- Lead vulnerability management, log retention, audit trails, and compliance reporting.
- Conduct internal security reviews and implement corrective actions.
High Availability (HA), Disaster Recovery (DR), and Resilience Engineering
- Deploy HA topologies for compute, storage, Kubernetes, and network layers.
- Design DR strategies including replication, cross-region failover, and backup automation.
- Conduct DR testing, failover drills, and resilience validation.
- Build monitoring and alerting systems that detect degradation early.
Hybrid Data Center Operations
Deploy and maintain servers in on-prem racks (HPE/Dell/Nutanix/KVM/ESXi).
- Integrate AWS cloud with on-prem networks for hybrid deployments.
- Optimize high-IOPS workloads via OS, NIC, network, and storage tuning.
- Maintain enterprise monitoring dashboards (Zabbix, Prometheus, Grafana, ELK).
Required Technical Skills & Expertise
- Deep mastery of Linux internals, SELinux, kernel tuning, systemd, cgroups.
- RAID, LVM, multipathing, NVMe performance tuning.
- Knowledge of high availability and load balancer configurations
- Good understanding of Data Center Networking, firewalls, VLANs, DNS, etc.
- CI/CD automation, Terraform, CloudFormation, GitOps, Ansible.
- Kubernetes security, RBAC, PSP/PSA, OPA/Gatekeeper, Helm, Kustomize.
- AWS IAM, KMS, Secrets Manager, CloudTrail, GuardDuty, Config, Organizations.
- Compliance frameworks: SOC 2, ISO 27001, CIS, NIST; evidence gathering and audit remediation.
- HA & DR deployment for cloud and on-prem workloads.
Soft Skills & Leadership
- Excellent communication and documentation skills.
- High level of multi-tasking capability
- Ability to lead technical reviews, incident response, and post-mortems.
- Collaboration across engineering, development, security, and IT operations.
Preferred Qualifications
- 5+ years Linux systems engineering experience.
- 5+ years DevOps & Kubernetes automation & support experience.
- Kubernetes and CI/CD related certifications preferred
- RHEL Administration certification preferred
- AWS Certifications preferred.
Why Dallas? Why In-Office?
Building a futuristic trusted global network requires intense, high-bandwidth collaboration. We believe that being physically present allows for spontaneous whiteboard sessions, rapid feedback loops, and deep relationship-building crucial to tackling this mission. You’ll have unparalleled access to the entire team, accelerating your learning, impact, and ability to influence our trajectory. We’re building a tight-knit, mission-driven culture here in Dallas, and your presence is key.
Benefits & Perks
- Relocation Assistance – We provide financial support to ensure a smooth move.
- Competitive salary and performance-based bonuses.
- Comprehensive health, dental, and vision insurance.
- 401(k) plan with company match.
- Unlimited paid time off.
- On-site gym.
- Daily lunch.
Job Type: Full-time
Pay: $100,000.00 - $140,000.00 per year
Benefits:
- 401(k)
- Dental insurance
- Flexible schedule
- Health insurance
- Paid time off
Work Location: In person