Data Engineer, PXT Central Science
Job Description
Join Amazon.com Services LLC in Seattle, onsite, as part of the PXT Central Science team. This role centers on building scalable data pipelines, productionizing models, and delivering analytics that influence employee sentiment and business outcomes. The position offers a comprehensive benefits package and a culture that values collaboration, scientific rigor, and impact. Salary range: USD 132,100 - 205,600 per year.
Benefits
- Health insurance
- 401(k) matching
- Paid time off
- Parental leave
- Sign-on payments
- Restricted stock units (RSUs)
- Flexible Spending Accounts
- Adoption and Surrogacy Reimbursement
- Employee Assistance Program (EAP)
- Mental Health Support
Responsibilities
- Design and maintain scalable data pipelines using native AWS services such as Glue, EMR, and Lambda; implement monitoring and error handling for data workflows; optimize performance, reliability, and cost efficiency.
- Productionize science models by building APIs and data serving layers for downstream consumption; create batch and real time inference pipelines.
- Build scalable feature extraction and processing frameworks for diverse data types; implement robust data quality checks and validation; design flexible schemas that support evolving requirements.
- Partner with economists, data scientists, and software engineers to translate analytical needs into production ready solutions; participate in technical design reviews and architecture discussions.
- Maintain layered data systems used by economists and scientists; develop automated reporting solutions; work across multiple interconnected AWS accounts with security best practices.
Requirements
- Knowledge of professional software engineering and best practices for the full software development life cycle, including coding standards, architectures, code reviews, source control, continuous deployment, testing, and operational excellence.
- 3+ years of data engineering experience.
- Experience with at least one modern programming or scripting language such as Python, Java, Scala, or NodeJS.
- Experience with data modeling, warehousing, and building ETL pipelines.
- Experience with AWS technologies including Redshift, S3, Glue, EMR, Kinesis, FireHose, Lambda, and IAM roles and permissions.
- Experience with non-relational databases and data stores (object storage, document or key-value stores, graph databases, or column-family databases).
- Bachelor's degree or foreign equivalent in computer science, engineering, mathematics, or equivalent.
Technologies
- Python
- Java
- Scala
- NodeJS
- AWS Glue
- AWS EMR
- AWS Lambda
- Redshift
- S3
- Kinesis
- Firehose
- IAM
- Hadoop
- Hive
- Spark
About the Team
The Central Science Team within Amazon’s People Experience and Technology organization (PXTCS) blends economics, behavioral science, statistics, machine learning, and Generative AI to proactively identify mechanisms and process improvements that enhance Amazon and the lives, well-being, and value of work for Amazonians. This interdisciplinary group combines science, engineering, and UX to develop and deliver solutions with measurable impact.