CareerZen Logo
Company logo

AI Inference Engineer

IDrive, Inc.

Full-time

Calabasas, CA

Job description

AI Inference Engineer - ONSITE "Calabasas, CA"

Company Overview

IDrive Inc is a leading provider of cloud storage and data backup and on-premise appliance solutions for businesses. We are expanding into AI infrastructure and building an AI Inference Appliance designed to support GPU-accelerated workloads at customer sites.

Job Summary

We are seeking enthusiastic and talented AI Inference Engineers to help develop and optimize inference workloads for our initial product rollout.

Responsibilities

  • Build and deploy AI model inference services
  • Optimize inference performance on NVIDIA GPU systems
  • Convert models using ONNX and/or TensorRT
  • Package inference services in Docker containers
  • Develop API endpoints for model serving
  • Conduct performance benchmarking and profiling
  • Collaborate with platform engineers for system integration

Required Qualifications

  • Bachelor’s degree in Computer Science or related field (or equivalent experience)
  • 3–6 years of software engineering experience
  • 2+ years of experience in AI/ML deployment or inference systems
  • Strong proficiency in Python
  • Experience with PyTorch or TensorFlow
  • Experience with ONNX Runtime or TensorRT
  • Experience working with NVIDIA GPUs
  • Experience with Docker and Linux

Preferred Qualifications

  • Knowledge of Kubernetes
  • Experience with model optimization and quantization
  • Experience deploying AI systems in edge or on-prem environments
  • Performance tuning and benchmarking experience

Pay: $120,000.00 - $150,000.00 per year

Benefits:

  • 401(k) matching
  • Dental insurance
  • Health insurance
  • Life insurance
  • Paid time off
  • Relocation assistance
  • Vision insurance

Experience:

  • Software design: 6 years (Required)
  • AI: 2 years (Required)

Work Location: In person