Machine Learning Research Engineer
Etched · Cupertino, California, US
About Etched Etched is building AI chips that are hard-coded for individual model architectures. Our first product (Sohu) only supports transformers, but has...
Job description
About Etched Etched is building AI chips that are hard-coded for individual model architectures. Our first product (Sohu) only supports transformers, but has an order of magnitude more throughput and lower latency than a B200. With Etched ASICs, you can build products that would be impossible with GPUs, like real-time video generation models and extremely deep & parallel chain-of-thought reasoning agents. Etched Labs is the organization within Etched whose mission is to democratize generative AI, pushing the boundaries of what will be possible in a post-Sohu world. Key responsibilities: - Propose and conduct novel research to achieve results on Sohu that are unviable on GPUs - Translate core mathematical operations from the most popular Transformer-based models into maximally performant instruction sequences for Sohu - Develop deep architectural knowledge informing best-in-the-world software performance on Sohu HW, collaborating with HW architects and designers. - Co-design and finetune emerging model architectures for highest efficiency on Sohu - Guide and contribute to the Sohu software stack, performance characterization tools, and runtime abstractions by implementing frontier m...