16h ago
Member of Engineering (Evaluations/Engineering)
Remote (EMEA/East Coast)
โจ $180k-$280k / yearest.
full-timesenior Remoteai-ml
๐ Tech Stack
๐ผ About This Role
You'll design and implement a scalable self-serve evaluation platform for a frontier AI company. Your work will directly power experimentation and quality for building AGI. This role offers fully remote work across Europe and North America.
๐ฏ What You'll Do
- Design a Python framework for centralized benchmark implementation
- Build and maintain distributed evaluation pipelines at scale
- Collaborate with modeling and product teams to improve tooling
๐ Requirements
- Strong engineering background with experience in software projects
- Experience building highly reliable services and distributed systems
- Experience with data pipelines, message queues, and cloud services
- Proficiency in monitoring and alerting tools like Grafana, Prometheus
โจ Nice to Have
- Experience designing frameworks or tooling for developers
- Product mindset towards building developer-facing software
- Experience with ML ops and data visualization platforms
๐ Benefits & Perks
- ๐๏ธ Fully remote work & flexible hours
- โ๏ธ 37 days/year of vacation & holidays
- ๐ฅ Health insurance allowance for you and dependents
- ๐ฐ Wellbeing and home office allowances
- ๐ฅ Great diverse & inclusive people-first culture
๐จ Hiring Process
Estimated timeline: 2-4 weeks
- 1Intro call with Founding Engineerยท 30 min
- 2Technical Interview(s)ยท 60 min
- 3Team fit call with People teamยท 30 min
- 4Final interview with Founding Engineerยท 45 min
0 0 0