Graduate Internship: Human-in-the-Loop Named Entity Recognition for Biological Data Curation
University of Florida · Gainesville, Florida, US
Classification Title: Graduate Internship: Human-in-the-Loop Named Entity Recognition for Biological Data Curation Classification Minimum Requirements: Curre...
Job description
Classification Title: Graduate Internship: Human-in-the-Loop Named Entity Recognition for Biological Data Curation Classification Minimum Requirements: Currently enrolled in good standing in a UF graduate program. Job Description: The George A. Smathers Libraries is offering a graduate internship supervised by Dr. Borui Zhang in collaboration with Dr. Jonathan Nations (Florida Museum of Natural History). The project focuses on building an open-access, machine-readable database of mammalian dietary information extracted from decades of unstructured scientific literature. It integrates text data mining, NLP pipeline engineering, and biodiversity science. The graduate intern will construct and validate ground-truth datasets for an AI-driven data extraction pipeline supporting this effort. This internship is funded through the Smathers Graduate Internship Program. RESPONSIBILITIES: The intern will: - Use authorized APIs to identify and retrieve relevant mammalian dietary literature, with emphasis on Mammalian Species Accounts. - Apply the project’s NER (Named Entity Recognition) pipeline to extract consumer (mammal species) and food class entities from full-text publications. - Systema...