Research Scientist Intern, Multimodal Generative AI and Robotics (PhD)
Meta · Redmond, Washington, US
The Meta Reality Labs Research Team brings together a world-class team of researchers, developers, and engineers to create the future of contextual AI and ro...
Job description
The Meta Reality Labs Research Team brings together a world-class team of researchers, developers, and engineers to create the future of contextual AI and robotics. The Surreal Vision group at RL Research is seeking exceptional Research Scientists to research and help build the egocentric machine perception functionalities that will underpin future contextual AI-enabled devices. The research intern will work on cutting edge research problems to innovate novel computer vision and machine learning techniques. Work with researchers to advance frontier generative AI in the following areas: -Develop unified predictive models that integrate language, vision, human motion, and actions. -Investigate techniques to enable long-horizon, consistent and physically grounded generation. -Benchmark against state-of-the-art approaches in world modeling, video generation, and vision–language–action model. -Leverage multimodal generation to accelerate robot learning and control. Build contextual and embodied AI models using large-scale egocentric multimodal datasets. Our internships are twelve (12) to twenty four (24) weeks long and we have various start dates throughout the year. Some projects may r...