23h ago

Applied Scientist, Computer Vision and Generative Models, Vision AI

Tokyo

$60k-$80k / yearest.

internshipintern Visa Sponsor

🛠 Tech Stack

💼 About This Role

You'll conduct cutting-edge research on multi-modal foundation models for mobility services at Woven City. You'll build next-generation computer vision products that enhance human-centric mobility. This internship offers mentorship from world-class engineers and a chance to impact real-world products.

🎯 What You'll Do

  • Research and develop new technologies for multi-modal foundation models
  • Conduct large-scale training and evaluation on multiple benchmarks
  • Develop novel algorithms for prototyping and present research findings
  • Disseminate research results internally and externally

📋 Requirements

  • Currently pursuing a Bachelor's, Master's, or PhD in computer science or related field
  • Experience in machine learning, generative models, computer vision, or multi-modal learning
  • Coding experience with Python or C++
  • Experience with ML frameworks such as PyTorch or TensorFlow

✨ Nice to Have

  • Strong publication record in top-tier conferences like CVPR or ICCV
  • Experience with cloud-based environments (AWS, GCP) and modern development tools (Git, Docker)
  • Experience in large-scale dataset creation and distributed training

🎁 Benefits & Perks

  • ✈️ Air tickets for international students
  • 🏠 Temporary housing provided
  • 📄 Visa support for international students
0 0 0