JobMesh

Senior AI Engineer

Openchip & Software Technologies · BE

We are looking for an outstanding individual with expertise in training large-scale AI models, including Large Language Models (LLMs) or equivalent architect...

Job description

We are looking for an outstanding individual with expertise in training large-scale AI models, including Large Language Models (LLMs) or equivalent architectures. This person will play a key role in the development and optimization of AI workloads, collaborating closely with hardware and software engineers in a co-design environment. Key Responsibilities: · Design, train, and optimize large-scale AI models, including LLMs and similar architectures. · Collaborate with hardware and software engineers to ensure efficient model deployment on custom silicon solutions. · Develop new techniques for improving training efficiency, model accuracy, and hardware utilization. · Implement distributed training strategies for large-scale models across high-performance computing infrastructures. · Strong programming skills in Python, (Go, C++,…is considered a plus) · Explore and implement advanced AI/ML techniques, such as sparsity, quantization, and knowledge distillation. · Work with state-of-the-art frameworks such as TensorFlow, PyTorch, JAX, and Hugging Face Transformers. · Conduct performance benchmarking and profiling of ML workloads on custom accelerators. · Stay up to date with advancement...