Hunyuan Multimodal Algorithm Researcher Intern(Omni-Modal)
Tencent · Palo Alto, California, US
Business Unit What the Role Entails What the Role Entails 1.
Job description
Business Unit What the Role Entails: 1.Conduct research and development of Omni multimodal large models, including the design and construction of training data, foundational model algorithm design, optimization related to pre-training/SFT/RL, model capability evaluation, and exploration of downstream application scenarios. 2.Scientifically analyze challenges in R&D, identify bottlenecks in model performance, and devise solutions based on first principles to accelerate model development and iteration, ensuring competitiveness and leading-edge performance. 3.Explore diverse paradigms for achieving Omni-modal understanding and generation capabilities, research next-generation model architectures, and push the boundaries of multimodal models. Who We Look For: 1.Bachelor’s degree (full-time preferred) or higher in Computer Science, Artificial Intelligence, Mathematics, or related fields; graduate degrees are prioritized. 2.Hands-on experience in large-scale multimodal data processing and high-quality data generation is highly preferred. 3.Solid foundation in deep learning algorithms and practical experience in large model development; familiarity with Diffusion Models and Autoregressive...