JobMesh

Virtual Platform Software Engineer, Annapurna Labs Machine Learning Accelerators, AWS

Amazon.com · Cupertino, California, US

AWS's Trainium and Inferentia chips power the world's largest machine learning clusters. Our team builds virtual platforms — full-system C++ and SystemC mode...

Job description

AWS's Trainium and Inferentia chips power the world's largest machine learning clusters. Our team builds virtual platforms — full-system C++ and SystemC models of these custom SoCs — that let software teams start development months before silicon arrives. For Trainium3, our virtual platform enabled running a full training workload within 12 hours of first silicon. We're looking for a software engineer to build and own the models and infrastructure that make this possible. What you'll do: - Build and own functional models of SoC subsystems that integrate into our full-system virtual platform, used by firmware, driver, runtime, and application software teams - Design models for usability and performance — your customers are software engineers who need to run real workloads on your platform efficiently - Develop and improve the virtual platform infrastructure: QEMU integration, simulation performance, build and release tooling, and customer-facing documentation - Work with software teams (your primary customers) to understand their workflows, debug issues on the platform, and shape the model to maximize their productivity - Drive simulation performance improvements so the platform can...