LMI is seeking a data engineer to develop, test, maintain, and augment data pipelines and database setups in support of a supply chain risk program for the Department of Defense.
Responsibilities
Set up segmented databases to control access to various datasets and data products
Build data pipelines in PySpark using data engineering best practices for testing, validation, automation, and version control
Coordinate closely with SMEs and front-end developers to generate data tables required for UI execution
Build code to execute pulls from APIs, allowing access to external data to be pulled into the client environment
Interact with partners and data providers to understand their data holdings, methods for moving data into the CDAO-Advana platform, and data usage agreements
Self-starter, and eager to learn and work in a team environment
Qualifications
Bachelor’s degree in Computer Science or related field and/or equivalent work experience
3-10 years of experience as a data engineer
Experience with:
Agile-based development
Python, Spark, notebooks
Data Bricks or similar environment
Effective analytical, conceptual, and problem-solving skills
Active DoD Secret clearance
Fully remote capability acceptable
Desired Qualifications
Active DoD Top Secret clearance
Part-time onsite client support in the Washington, DC metro area