Senior Data Engineer - Analytics
Job Description
Senior Data Engineer - Analytics role focused on designing data models, transformation pipelines, and APIs to power dashboards, ML features, and downstream systems.
Responsibilities
- Architect and maintain end-to-end ELT/ETL pipelines using dbt Core and SQL targeted at BigQuery.
- Create modular SQL models and Python-based transformations to support analytics, reporting, and ML feature generation.
- Establish data quality checks, lineage, and observability to meet analytics SLAs.
- Partner with product, analytics, and ML teams to define metric definitions and translate business requirements into efficient data models.
- Develop and maintain RESTful APIs and integrations to surface curated datasets and features for internal and external consumers; integrate LLM APIs where applicable.
- Deploy and monitor data services and lightweight API endpoints on Google Cloud Platform using Cloud Run and other serverless options as appropriate.
- Tune BigQuery performance and cost through partitioning, clustering, and query optimization.
- Document data models, transformation logic, and runbooks; mentor teammates on dbt, SQL, and analytics engineering best practices.
Requirements
- 3+ years of experience in analytics engineering, data engineering, or a related role building analytics pipelines and data models.
- Experience working with Healthcare Claims Data.
- Expert proficiency in SQL and strong experience with Python for data transformation, orchestration, or testing.
- Proven experience using dbt Core to build modular, tested analytics transformations and manage deployments.
- Solid experience with Google Cloud Platform, especially BigQuery, including query optimization and cost management.
- Experience building and integrating APIs; familiarity with LLM APIs and integrating large language model outputs into analytics or product workflows.
- Strong understanding of data modeling concepts, ETL/ELT patterns, data quality practices, and observability.
- Excellent communication skills and ability to collaborate across cross-functional teams to operationalize analytics.
- Nice to have: hands-on experience with Cloud Run, Vertex AI, and FastAPI for serving data or ML features; domain knowledge of healthcare claims and related data models.
- Candidates must be authorized to work in the United States without visa sponsorship.
Technologies
- dbt Core
- SQL
- BigQuery
- Python
- LLM APIs
- Google Cloud Platform
- Cloud Run
- Vertex AI
- FastAPI
- RESTful APIs