We are seeking a Python-focused Data Engineer to bridge the gap between data infrastructure and data science. In this role, you will lead the standardization and optimization of our data pipelines, specifically built for data science and machine learning projects. You will introduce modern technology stacks and MLOps practices to ensure our models are robust, scalable, and efficiently deployed into production environments.
Key Responsibilities
- Work on the company's data pipeline, contributing heavily to its standardization, development, and optimization.
- Build clean Python abstractions and automated pipelines to optimize data ingestion and preprocessing for analytics and machine learning.
- Enforce rigorous data quality initiatives, testing, and validations to ensure data integrity across upstream and downstream systems.
- Collaborate with data scientists to architect, package, configure, and deploy machine learning models using MLOps frameworks.
- Evaluate and integrate new technologies to leverage and scale our current data science engineering stack.
- Generate actionable insights for business and infrastructure improvements.