Performance Engineer
Etched · San Jose, California, US
About Etched Etched is building the world’s first AI inference system purpose-built for transformers - delivering over 10x higher performance and dramaticall...
Job description
About Etched Etched is building the world’s first AI inference system purpose-built for transformers - delivering over 10x higher performance and dramatically lower cost and latency than a B200. With Etched ASICs, you can build products that would be impossible with GPUs, like real-time video generation models and extremely deep & parallel chain-of-thought reasoning agents. Backed by hundreds of millions from top-tier investors and staffed by leading engineers, Etched is redefining the infrastructure layer for the fastest growing industry in history. Key responsibilities: Develop comprehensive performance models and projections for Sohu's transformer-specific architecture across varying workloads and configurations Profile and analyze deep learning workloads on Sohu to identify micro-architectural bottlenecks and optimization opportunities Build analytical and simulation-based models to predict performance under different architectural configurations and design trade-offs Collaborate with hardware architects to inform micro-architectural decisions based on workload characteristics and performance analysis Drive hardware/software co-optimization by identifying opportunities where ar...