The Data Engineer I role is an onsite position based in Seattle, WA, focused on designing and building data models and pipelines to enable insights through AWS AI tools. The role collaborates with program managers and data engineers to ensure data quality and efficient resource usage.
Responsibilities
- Develop and refine logical data models and pipelines to support complex datasets and analytics scenarios
- Ensure ongoing data quality and enable self-service access to datasets for business intelligence, AI based tools, and Model Context Protocols (MCP)
- Enhance metadata to improve AI results
- Collaborate with Program Managers, Technical Program Managers, and other Data Engineers to design stable, high‑performing data solutions
- Produce secure, reliable, testable, and maintainable code with minimal defects
- Apply appropriate data design approaches and balance trade-offs without over‑engineering
- Optimize resource usage across system hardware, data storage, query performance, and AWS infrastructure
- Actively participate in code reviews, design discussions, and team planning
- Identify and resolve root causes of complex problems while balancing customer requirements with team needs
- Stay current on distributed systems technologies including MapReduce, MPP architectures, and NoSQL databases
Requirements
- 2+ years of data engineering experience
- Experience with data modeling, warehousing, and building ETL pipelines
- Bachelor's degree or higher in a quantitative/technical field such as computer science, engineering, or statistics
- Knowledge of basics of designing and implementing a data schema such as normalization, relational model vs dimensional model
- Knowledge of software engineering best practices across the development life cycle, including agile methodologies, coding standards, code reviews, source management, build processes, testing, and operations
Technologies
- MapReduce
- MPP architectures
- NoSQL databases
- Hadoop
- Hive
- Spark
- EMR
- Informatica
- ODI
- SSIS
- BODI
- Datastage
- Redshift
- Oracle
- AWS Infrastructure
- Tableau
- AWS QuickSight
- Cognos
Benefits
- Sign-on payments
- Restricted stock units (RSUs)
- Health insurance including medical, dental, vision, prescriptions; Basic Life and AD&D; option for Supplemental life plans; Employee Assistance Program; Mental Health Support; Medical Advice Line; Flexible Spending Accounts; Adoption and Surrogacy Reimbursement coverage
- 401(k) matching
- Paid time off
- Parental leave
Description
Key duties include designing, building, and optimizing logical data models and data pipelines for complex datasets; maintaining ongoing data quality and providing self-service access to datasets for business intelligence, AI based tools, and Model Context Protocols (MCP); improving metadata to support AI outcomes. The role collaborates with Program Managers, Technical Program Managers, and other Data Engineers to create stable, performant data solutions; writes secure, stable, testable, and maintainable code with minimal defects; applies appropriate data design approaches and makes prudent trade-offs without over-engineering; optimizes resource usage across hardware, storage, query efficiency, and AWS infrastructure. Active participation in code reviews, design discussions, and team planning is expected, along with diagnosing root causes of complex problems and staying current with distributed systems technologies such as MapReduce, MPP architectures, and NoSQL databases.