JobMesh

Senior Technical Program Manager, Cloud Infrastructure, Observability and Systems Monitoring

NVIDIA · Santa Clara, California, US

NVIDIA's deep learning platforms lead innovation, significantly influencing multiple fields and embraced by top academic institutions, startups, and major In...

Job description

NVIDIA's deep learning platforms lead innovation, significantly influencing multiple fields and embraced by top academic institutions, startups, and major Internet companies worldwide. We're looking for a seasoned and highly skilled Principal Technical Program Manager (TPM) to join our NVIDIA DGX Cloud team. This is an exciting opportunity for a passionate, driven, and creative individual to provide outstanding value to our DGX Cloud customers. We are seeking a TPM who has deep knowledge of observability, systems telemetry, and cloud infrastructure operations. You will play a key role collaborating with hardware/software supply teams, DGXC operations, and external Cloud Service Providers (CSPs and NCPs). Together, you will develop unified telemetry mentorship and confirm operational readiness worldwide. What you'll be doing: As a DGX Cloud Principal Technical Program Manager, you will work closely with our Engineering, Infrastructure, and Software teams. You will lead important programs focused on telemetry and data center fleet health & management. Your role will be essential in developing core capabilities for DGX Cloud. You will make sure that operations and advanced tenants rec...