Data Engineer II, PPE Product Intelligence
Job Description
Join Amazon in Seattle, onsite, as a Data Engineer II on the PPE Product Intelligence team. You will design and maintain scalable data infrastructure and pipelines to support price evaluation models, leveraging the AWS big data stack and collaborating with scientists. The role offers a competitive salary range of USD 132,100 to 178,800 per year.
Benefits
- Health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage)
- 401(k) matching
- Sign-on payments
- Restricted stock units (RSUs)
- Paid time off
- Parental leave
Responsibilities
- Design, implement, and support data warehouse and data lake infrastructure using AWS big data stack, Python, Redshift, QuickSight, Glue/Lake Formation, EMR/Spark, Athena, and related services
- Develop and manage ETLs to source data from various systems and create a unified data model for machine learning models
- Continually explore the latest big data and visualization technologies to enable new capabilities and improve efficiency
- Collaborate closely with a team of scientists to advance price evaluation model research
- Manage multiple requests concurrently, prioritizing work strategically as needed
Requirements
- 3+ years of data engineering experience
- 1+ years of developing and operating large-scale data structures for business intelligence analytics using ETL/ELT processes
- 1+ years of developing and operating large-scale data structures for business intelligence analytics using OLAP technologies
- 1+ years of developing and operating large-scale data structures for business intelligence analytics using data modeling
- 1+ years of developing and operating large-scale data structures for business intelligence analytics using SQL
- Experience with data modeling, warehousing and building ETL pipelines
- Bachelor's degree
Technologies
- AWS bigdata stack
- Python
- Redshift
- QuickSight
- Glue
- Lake Formation
- EMR
- Spark
- Athena
- S3
- Kinesis
- Firehose
- Lambda
- IAM
- SQL