Senior Data Engineer
Job Description
The Senior Data Engineer will design, implement, and maintain scalable production data pipelines for enterprise data platforms hosted in Azure. The role requires hands-on experience with PostgreSQL, Python, and a range of Azure data services to deliver reliable data products.
Overview
In this position you will architect and operate end-to-end data pipelines that support analytics and reporting across enterprise systems. You will work within Azure data environments, collaborating closely with analytics, BI, and engineering teams to ensure data quality, lineage, and governance while contributing to the long term data platform strategy.
Responsibilities
- Design, implement, and maintain scalable production data pipelines that support enterprise data platforms on Azure.
- Conduct exploratory and statistical data analysis, along with data validation to inform analytics and reporting efforts.
- Develop and optimize ETL and ELT workflows, covering job design, orchestration, and performance tuning.
- Build and maintain relational database schemas, SQL queries, and stored procedures.
- Operate within Azure based data environments including Data Factory, Databricks, and Data Lake Storage.
- Collaborate with analytics, BI, and engineering teams to deliver dependable data products.
- Establish monitoring, logging, and observability for data pipelines.
- Maintain documentation, data lineage, and governance standards.
- Participate in architecture discussions and contribute to long term data platform strategy.
Requirements
- Bachelor's degree in Computer Science, Statistics, Mathematics, Information Science, Data Engineering, Finance, or a related field; a Master’s degree is preferred.
- 5+ years of data engineering experience building production data pipelines.
- 5+ years of applied data analysis and statistical work.
- 3+ years of ETL or ELT development experience.
- 3+ years working with relational databases and performance tuning.
- Experience with Azure data services such as Data Factory, Databricks, Data Lake, Synapse, Delta Lake, or similar.
- Strong programming skills in Python, PySpark, SQL, and Bash.
- Experience with PostgreSQL, SAP ASE, or comparable databases.
- Experience with CI/CD pipelines, Git, Kubernetes, and infrastructure-as-code.
- Knowledge of data governance, data cataloging, and access control.
- Experience with monitoring tools such as Datadog, Grafana, or Prometheus.
- Ability to work independently while managing multiple priorities.
- Strong communication and documentation skills.
- Must be eligible to work in the United States.
Technologies
- Azure Data Factory
- Databricks
- Python
- SQL
- ETL/ELT
- Data Lake
- Spark
- Delta Lake
- Synapse
- Data Lake Storage
- PostgreSQL
- SAP ASE
- SAP Data Services
- Power BI
- BusinessObjects
- CI/CD
- Kubernetes
- Git
- PySpark
- Bash
- Kafka
- Grafana
- Datadog
- Prometheus
- Azure Synapse
Location
Reston, Virginia, onsite.
Similar Jobs
N