22h ago
Web Scraping Engineer
Indonesia
$179.4k-$224.4k / year
contract Remotegovernment
๐ Tech Stack
๐ผ About This Role
You'll build and maintain async scrapers for European public procurement portals, handling real-world challenges like CAPTCHAs, session management, and messy HTML. Your core impact is ensuring reliable data extraction from 100+ diverse sources for the data backbone of European procurement. This role stands out with a hands-on technical assessment that simulates real-world scraping problems.
๐ฏ What You'll Do
- Build and maintain async scrapers using Python and Playwright
- Handle session management, anti-bot measures, and IP rotation
- Parse Italian data formats and document types (PDF, .p7m, ZIP)
- Integrate scrapers into Prefect orchestration pipeline
๐ Requirements
- Strong async Python with asyncio proficiency
- Experience with Playwright or Selenium for headless browsers
- Resilience mindset with retry and backoff patterns
- Comfort with messy HTML and multi-strategy extraction
โจ Nice to Have
- Experience with Italian PA portals (ANAC, PVL, OCDS)
- Italian locale date and currency parsing
- Document processing with PDF, .p7m, OCR fallback
๐ Benefits & Perks
- ๐ฐ Competitive salary in IDR with bonus
- ๐๏ธ Remote work from Indonesia
- ๐ Hands-on technical assessment instead of whiteboard
๐จ Hiring Process
Estimated timeline: 2-3 weeks ยท AI estimate
- 1Technical Assessmentยท estimated 2-3 hours
- 2Code Reviewยท 1 hour
- 3Team Fitยท 30 min
๐ฉ Heads Up
- Contract role listed as Indonesia remote
- Compensation in IDR for European procurement role
- Multiple anti-bot challenges suggesting difficult scraping
0 0 0