2h ago

Senior ML Solutions Architect - Token Factory

United States
full-timeseniorCloud Computing / AI Infrastructure

Tech Stack

+1

Description

In this role, you'll collaborate with clients to design and implement customized LLM-based solutions using Nebius Token Factory's serverless inference platform. You'll build production-ready applications, guide customers from proof-of-concept to production, and work with internal teams to improve the platform based on customer needs.

Requirements

  • 5+ years experience in ML/AI systems, 2+ years focused on LLMs and generative AI
  • Deep knowledge of LLM ecosystem including model architectures and fine-tuning
  • Hands-on experience with prompt engineering, LLM pipeline development, agentic frameworks (Langchain, Langsmith, smolagents), vector databases and RAG
  • Deploying LLM-powered applications using APIs from OpenAI, Anthropic, or open-source models
  • Strong Python programming skills

Responsibilities

  • Design and implement LLM-based solutions using Nebius Token Factory’s inference services to drive business value
  • Build production-ready applications leveraging serverless LLM APIs, including multimodal models
  • Provide technical expertise in prompt engineering, RAG architectures, model selection, and inference optimization
  • Collaborate with product and engineering teams to surface customer feedback and shape the platform roadmap
  • Guide customers in scaling from POC to production with focus on performance, reliability, and cost efficiency
0 views 0 saves 0 applications