AI Inference Engineer

IDrive, Inc.

Full-time

Calabasas, CA

Job description

AI Inference Engineer - ONSITE "Calabasas, CA"

Company Overview

IDrive Inc is a leading provider of cloud storage and data backup and on-premise appliance solutions for businesses. We are expanding into AI infrastructure and building an AI Inference Appliance designed to support GPU-accelerated workloads at customer sites.

Job Summary

We are seeking enthusiastic and talented AI Inference Engineers to help develop and optimize inference workloads for our initial product rollout.

Responsibilities

Build and deploy AI model inference services
Optimize inference performance on NVIDIA GPU systems
Convert models using ONNX and/or TensorRT
Package inference services in Docker containers
Develop API endpoints for model serving
Conduct performance benchmarking and profiling
Collaborate with platform engineers for system integration

Required Qualifications

Bachelor’s degree in Computer Science or related field (or equivalent experience)
3–6 years of software engineering experience
2+ years of experience in AI/ML deployment or inference systems
Strong proficiency in Python
Experience with PyTorch or TensorFlow
Experience with ONNX Runtime or TensorRT
Experience working with NVIDIA GPUs
Experience with Docker and Linux

Preferred Qualifications

Knowledge of Kubernetes
Experience with model optimization and quantization
Experience deploying AI systems in edge or on-prem environments
Performance tuning and benchmarking experience

Pay: $120,000.00 - $150,000.00 per year

Benefits:

401(k) matching
Dental insurance
Health insurance
Life insurance
Paid time off
Relocation assistance
Vision insurance

Experience:

Software design: 6 years (Required)
AI: 2 years (Required)

Work Location: In person

Apply