Senior LLM / RAG Engineer
Peraton · Reston, Virginia, US
Responsibilities Overview We are looking for a Senior‑Level Engineer to lead the development and sustainment of Retrieval‑Augmented Generation (RAG) AI proto...
Job description
Responsibilities Overview: We are looking for a Senior‑Level Engineer to lead the development and sustainment of Retrieval‑Augmented Generation (RAG) AI prototypes for a national‑security mission. You will work directly with customer stakeholders to expand an existing prototype and deliver new LLM‑powered capabilities that help analysts understand and act on large volumes of proprietary data. In this role, you will combine data engineering, model serving, GPU‑based inference, and rapid application development to deliver high‑impact AI tools in a secure environment. Key Responsibilities: - Maintain and extend current RAG prototypes to integrate new datasets and features • Build and optimize data ingest pipelines using Python and AWS services • Develop LLM/embedding pipelines and operate GPU inference workloads • Deploy and manage containerized services in Kubernetes‑like or Docker‑based environments • Implement vector search solutions using modern vector databases • Develop mission‑focused UIs using Streamlit or similar tools for rapid prototyping • Use tools such as sglang, Ray Serve, and LlamaIndex to operationalize LLM capabilities • Collaborate closely with analysts and mission...