5 days ago
Senior Data Engineer - Technology
Santo Domingo
full-timesenior RemoteTechnology Solutions
Tech Stack
Description
You will design, build, and optimize high-performance, scalable data pipelines using PySpark and Apache Spark to support advanced analytics and data-driven decision-making. This role involves developing data models, implementing efficient transformations, and collaborating with cross-functional teams to ensure data quality and reliability in large-scale environments.
Requirements
- Strong hands-on experience with Apache Spark, PySpark, and AWS Glue, building scalable data pipelines in distributed environments
- Proven experience designing and maintaining AWS-based data architectures, including S3, Athena, and data warehouse solutions
- Advanced proficiency in Python and SQL, with experience processing and transforming large datasets in relational databases
- Experience with ETL/ELT pipelines, data lakes, and event-driven architectures, ensuring scalable and reliable data workflows
- Strong collaboration, analytical, and problem-solving skills, with experience working in Agile environments and building BI/reporting solutions (e.g., QuickSight)
Responsibilities
- Design, build, and optimize scalable data pipelines using PySpark and Apache Spark for distributed data processing
- Develop and maintain data models, ensuring proper integration of multiple data sources into a centralized data lake and data warehouse
- Implement efficient data transformations and optimize pipelines for performance, scalability, and reliability in large-scale environments
- Create and manage data solutions for analytics, including SQL-based transformations, AWS Athena views, Iceberg tables, and BI dashboards (QuickSight)
- Collaborate with cross-functional teams and ensure data quality through monitoring, troubleshooting, and continuous improvement of data workflows
0 views 0 saves 0 applications