AI Optimization Engineer - ONSITE
Simple Software Solutions Group · Jersey City, New Jersey, US
Job Description Summary – AI Optimization Engineer (Onsite, Jersey City, NJ) We are seeking an experienced AI Optimization Engineer to support large-scale AI...
Job description
Job Description Summary – AI Optimization Engineer (Onsite, Jersey City, NJ) We are seeking an experienced: AI Optimization Engineer: to support large-scale AI/ML and Generative AI workloads for an enterprise environment. This role focuses on optimizing, deploying, and managing machine learning and large language models (LLMs) on GPU-accelerated HPC infrastructure. The ideal candidate will have strong experience in Python-based machine learning, deep learning frameworks, model optimization techniques, and scalable AI infrastructure. The engineer will work closely with AI, infrastructure, and DevOps teams to design efficient model training and inference pipelines, implement SLURM-based workload orchestration, and deploy containerized ML solutions in production environments. Responsibilities include optimizing model performance using techniques such as pruning, quantization, and knowledge distillation, managing inference workflows using Triton Inference Server, and monitoring system performance using Prometheus and Grafana. This role requires hands-on experience with HPC environments, GPU clusters, containerization technologies, and Linux system administration, along with strong know...