3h ago

Software Engineer, Safeguards

San Francisco, CA | New York City, NY

$320,000-$425,000 / year

full-timeseniorartificial intelligence Visa Sponsor

Tech Stack

Description

You will build safety and oversight mechanisms for AI systems, focusing on detecting unwanted model behaviors and preventing misuse. You'll develop monitoring systems, abuse detection infrastructure, and multi-layered defenses to ensure user well-being and enforce acceptable use policies.

Requirements

  • Bachelor's degree in Computer Science, Software Engineering or comparable experience
  • 5-10+ years of software engineering experience, preferably in integrity, spam, fraud, or abuse detection
  • Proficiency in Python and TypeScript
  • Ability to work across the stack
  • Strong communication skills to explain complex technical concepts to non-technical stakeholders

Responsibilities

  • Develop monitoring systems to detect unwanted behaviors from API partners and trigger automated enforcement or manual review
  • Build abuse detection mechanisms and infrastructure
  • Surface abuse patterns to research teams to harden models at training stage
  • Build robust, multi-layered defenses for real-time improvement of safety mechanisms at scale
0 views 0 saves 0 applications