Senior Data Engineer
Paris, France
About Us
We're partnered with an AI-driven startup with 5M+ funding, developing generative AI to discover new materials and reduce CO emissions in carbon-intensive industries.
The Role
Reporting to the CTO, you'll lead two key initiatives :
- Design and build scalable data infrastructure integrating diverse sources (text, simulations, experiments) in support of ML and LLM applications.
- Develop internal tools enabling AI-enhanced data access and foster a data-centric culture
Key Responsibilities
Build optimized data pipelines for simulation, textual, and experimental dataImplement secure, scalable data storage systems supporting ML workflowsCreate automation tools for data processingEstablish data governance policies and lineage trackingCollaborate with DevOps on cloud infrastructure integrationPartner with scientists to enable data-driven decision makingContribute to open-source projectsRequirements
Master's or PhD in Computer Science or related field7+ years of data engineering experienceProficiency in multiple programming languages (Python, Rust, Scala, or Go)Strong SQL and NoSQL database experienceData modeling, ETL, and warehousing expertiseCloud platform experience (AWS / GCP) and infrastructure-as-codeExcellent English communication skillsNice-to-Have
ML pipeline and AI infrastructure experienceOpen-source contributionsFamiliarity with scientific data, especially materials scienceBenefits
Competitive salaryEquity package (BSPCE)Comprehensive health insurance (Alan Blue)Standard French PTOFlexible work environment