AI Inference Engineer
IDrive, Inc.
Full-time
Calabasas, CA
Job description
AI Inference Engineer - ONSITE "Calabasas, CA"
Company Overview
IDrive Inc is a leading provider of cloud storage and data backup and on-premise appliance solutions for businesses. We are expanding into AI infrastructure and building an AI Inference Appliance designed to support GPU-accelerated workloads at customer sites.
Job Summary
We are seeking enthusiastic and talented AI Inference Engineers to help develop and optimize inference workloads for our initial product rollout.
Responsibilities
- Build and deploy AI model inference services
- Optimize inference performance on NVIDIA GPU systems
- Convert models using ONNX and/or TensorRT
- Package inference services in Docker containers
- Develop API endpoints for model serving
- Conduct performance benchmarking and profiling
- Collaborate with platform engineers for system integration
Required Qualifications
- Bachelor’s degree in Computer Science or related field (or equivalent experience)
- 3–6 years of software engineering experience
- 2+ years of experience in AI/ML deployment or inference systems
- Strong proficiency in Python
- Experience with PyTorch or TensorFlow
- Experience with ONNX Runtime or TensorRT
- Experience working with NVIDIA GPUs
- Experience with Docker and Linux
Preferred Qualifications
- Knowledge of Kubernetes
- Experience with model optimization and quantization
- Experience deploying AI systems in edge or on-prem environments
- Performance tuning and benchmarking experience
Pay: $120,000.00 - $150,000.00 per year
Benefits:
- 401(k) matching
- Dental insurance
- Health insurance
- Life insurance
- Paid time off
- Relocation assistance
- Vision insurance
Experience:
- Software design: 6 years (Required)
- AI: 2 years (Required)
Work Location: In person