DataJobs.io
← Back to all jobs

Job Description

Amazon.com Services LLC is seeking a Senior Data Engineer to join the APIX squad in Seattle, onsite. This role centers on owning the team’s data architecture and delivering scalable, AI-ready data infrastructure across Central Lakehouse, Cortex, and CMM, while mentoring engineers and shaping governance standards. The position offers a salary range of USD 154,600 to 209,100 per year and requires a minimum of five years of data engineering experience.

Responsibilities

  • Own the team’s data architecture with a system-wide perspective, anticipating data access patterns and proactively removing bottlenecks across Central Lakehouse, Cortex, and CMM.
  • Design and deliver large-scale data solutions that are secure, maintainable, scalable, and extensible, enabling others to contribute and build on your work.
  • Lead architectural improvements that simplify complex data systems, addressing deficiencies where your architecture bottlenecks other teams across PXT’s 20 data lakes.
  • Make architectural trade-offs such as build versus buy, tiered storage strategies, and data abstraction patterns, balancing short-term needs with long-term business requirements for Amazon’s people data ecosystem.
  • Tackle ambiguous problems and steer technical strategy in areas where Golden Dataset onboarding, metadata enrichment, and AI contextualization require direction.
  • Identify and resolve complex data engineering challenges including data duplication across 264 redundant warehouses, inconsistent metric definitions, and governance gaps across federated data lakes.
  • Influence team technical and business strategy for PXT Data Strategy workstreams, providing context for current and future technology choices in AWS-first data platform adoption.
  • Build consensus when views diverge on data-architecture approaches, exercising judgment on when to leverage existing solutions versus building new capabilities.
  • Deliver high-impact data solutions at Amazon scale by designing scalable pipelines, ETL processes, and data abstraction layers supporting the Central Lakehouse, Cortex Data Plane APIs, and self-service CMM capabilities.
  • Architect solutions handling large volumes of people data across 17,000+ applications, optimizing data quality, availability, latency, security, performance, and integrity.
  • Reduce manual data preparation effort by 60-80% through intelligent data vending, contextualized metadata, and automated dataset onboarding workflows.
  • Deliver data infrastructure that enables AI-powered insights (Clarity Assist, Quick Suite integration) with over 90% query accuracy.
  • Drive engineering best practices and governance by setting standards for data discovery, naming conventions, operational excellence, data security, and code quality across PXT data teams.
  • Lead implementation of systematic governance through integration with FPDS primitives (DISAPERE, Maple, UBX), enabling policy-driven data classification, automated depersonalization, and cell-level access control.
  • Collaborate with AWS BDT, Security, and FPDS teams to influence roadmaps for SageMaker Unified Studio, Andes External Tables, and Quick Suite integration, addressing multiple feature gaps.
  • Ensure all data solutions comply with Amazon’s privacy standards, GDPR/DSAR requirements, and Red certification processes for sensitive people data.

Requirements

  • 5+ years of data engineering experience
  • Experience with data modeling, warehousing and building ETL pipelines
  • Experience with SQL
  • Experience in at least one modern scripting or programming language, such as Python, Java, Scala, or NodeJS
  • Experience mentoring team members on best practices
  • Experience with big data technologies such as Hadoop, Hive, Spark, EMR
  • Experience operating large data warehouses

Technologies

  • Python
  • Java
  • Scala
  • NodeJS
  • Hadoop
  • Hive
  • Spark
  • EMR
  • Andes
  • Athena
  • Glue
  • Redshift
  • SageMaker Unified Studio
  • FPDS primitives (DISAPERE, Maple, UBX)
  • Andes External Tables
  • Cortex Data Plane APIs
  • Quick Suite
  • QuickSight

Benefits

  • Health insurance
  • 401(k) matching
  • Paid time off
  • Parental leave
  • Sign-on payments
  • Restricted stock units (RSUs)
  • Adoption and Surrogacy Reimbursement coverage
  • Basic Life & AD&D insurance
  • Option for Supplemental life plans
  • Employee Assistance Program (EAP)
  • Mental Health Support
  • Medical Advice Line
  • Flexible Spending Accounts

About the Team

Meet the behind the scenes team that enables our Operations and Human Resource Leaders to make informed decisions. The Amazon Clarity team builds reporting and analytics tools for our teams that fulfill customer promise every day. Whether it is Fulfillment Center team that delivers your Prime order in two days, our Amazon Locker team that lets you pick up your package anytime that is convenient for you, our Prime Now team getting you lunch in under an hour, or one of many more, the PeopleInsight

Similar Jobs

Get Job Alerts

New jobs delivered to your inbox.