18h ago
Web Scraping Specialist
Remote
โจ $120k-$160k / yearest.
full-time Remoteai-ml
๐ Tech Stack
๐ผ About This Role
You'll join a lean team building infrastructure to power AI data extraction at global scale. Your core impact will be leading efforts to gather and analyze data while optimizing scraping processes. This role stands out for its unique access to high-quality public web data and the opportunity to shape open web data accessibility.
๐ฏ What You'll Do
- Write, test, and refine code for extracting data from online sources.
- Handle complexities like pagination and dynamic AJAX-loaded content.
- Clean and format extracted data for quality standards.
- Store and manage scraped data in databases, optimizing speed and integrity.
๐ Requirements
- Python or JavaScript proficiency with libraries like BeautifulSoup, Scrapy, or Selenium.
- Asynchronous programming and multithreading for distributed scraping.
- In-depth knowledge of HTML, CSS, JavaScript, and the DOM.
- Experience with NoSQL databases like MongoDB or Cassandra.
โจ Nice to Have
- Machine learning algorithms for data cleaning or categorization.
- Cloud services (AWS, GCP, Azure) for deploying scraping jobs at scale.
- Open-source contributions related to web scraping or data processing.
๐ Benefits & Perks
- ๐ผ Competitive salary and equity package.
- ๐๏ธ Flexible remote work culture.
- ๐ Opportunity to work on cutting-edge AI infrastructure.
๐จ Hiring Process
Estimated timeline: 1-2 weeks ยท AI estimate
- 1Recruiter Callยท 30 min
- 2Technical Interviewยท 60 min
- 3Take-home Assessmentยท 2-3 hours
0 0 0