3h ago
Biology Data Quality Engineer
Paris - Berlin - London - EU Remote
full-timemid Remotebiotechnology
Tech Stack
Description
You will ensure the quality and consistency of complex biological datasets used to train AI foundation models. You'll develop validation protocols, standardize data, collaborate with R&D, and implement automated pipelines to maintain data integrity for biomedical breakthroughs.
Requirements
- MSc in Biology, Computational Biology, or Bioinformatics
- Deep understanding of transcriptomics data types and quality considerations
- Proven experience in data quality control and validation pipelines
- Proficiency in Python and data visualization libraries (e.g., matplotlib)
- Strong analytical and problem-solving skills
Responsibilities
- Develop and implement data validation protocols for histology, omics, and clinical datasets
- Design and automate data quality pipelines to identify issues early
- Establish and enforce data standardization practices for machine learning
- Collaborate with R&D team and external data providers on data quality
- Evaluate and validate external public data sources for training models
0 views 0 saves 0 applications