22h ago

Web Scraping Engineer

Indonesia

$179.4k-$224.4k / year

contract Remotegovernment

๐Ÿ›  Tech Stack

๐Ÿ’ผ About This Role

You'll build and maintain async scrapers for European public procurement portals, handling real-world challenges like CAPTCHAs, session management, and messy HTML. Your core impact is ensuring reliable data extraction from 100+ diverse sources for the data backbone of European procurement. This role stands out with a hands-on technical assessment that simulates real-world scraping problems.

๐ŸŽฏ What You'll Do

  • Build and maintain async scrapers using Python and Playwright
  • Handle session management, anti-bot measures, and IP rotation
  • Parse Italian data formats and document types (PDF, .p7m, ZIP)
  • Integrate scrapers into Prefect orchestration pipeline

๐Ÿ“‹ Requirements

  • Strong async Python with asyncio proficiency
  • Experience with Playwright or Selenium for headless browsers
  • Resilience mindset with retry and backoff patterns
  • Comfort with messy HTML and multi-strategy extraction

โœจ Nice to Have

  • Experience with Italian PA portals (ANAC, PVL, OCDS)
  • Italian locale date and currency parsing
  • Document processing with PDF, .p7m, OCR fallback

๐ŸŽ Benefits & Perks

  • ๐Ÿ’ฐ Competitive salary in IDR with bonus
  • ๐Ÿ–๏ธ Remote work from Indonesia
  • ๐Ÿ“ˆ Hands-on technical assessment instead of whiteboard

๐Ÿ“จ Hiring Process

Estimated timeline: 2-3 weeks ยท AI estimate

  1. 1Technical Assessmentยท estimated 2-3 hours
  2. 2Code Reviewยท 1 hour
  3. 3Team Fitยท 30 min

๐Ÿšฉ Heads Up

  • Contract role listed as Indonesia remote
  • Compensation in IDR for European procurement role
  • Multiple anti-bot challenges suggesting difficult scraping
0 0 0