DataJobs.io
← Back to all jobs

Job Description

The Senior Scientific Data Engineer at Lawrence Berkeley National Laboratory, working with the Joint Genome Institute (JGI), will design and operate core scientific data systems, data workflows, and AI-ready pipelines, emphasizing data management, job orchestration, and platform integration. This role is based in the San Francisco Bay Area with a hybrid work arrangement.

Responsibilities

  • Develop and enhance JGI's core scientific data and compute capabilities as part of a skilled engineering team.
  • Design, implement, and deploy production automated systems, APIs, and workflows that support genomic data movement, metadata management, job orchestration, data access, and large-scale scientific computing.
  • Identify and resolve technical issues and integration gaps while driving ongoing system improvements.
  • Strengthen the reliability, scalability, observability, interoperability, and maintainability of shared production data systems while supporting sustainable operations and delivery.
  • Promote engineering best practices through technical reviews, knowledge sharing, and team process optimization.

Requirements

  • A Bachelor's Degree (or equivalent knowledge/training) in Computer Science or a related field and a minimum of 8 years of related professional experience developing, integrating, deploying, and operating production software and data systems that support metadata management, workflow orchestration, data lifecycle operations, and broad user data access or an equivalent combination of education and professional experience.
  • Strong knowledge of software and data engineering fundamentals relevant to data-intensive distributed systems, including system design, concurrency, performance, and testing.
  • Experience with database and data storage technologies including relational databases, object storage, and systems for managing structured, semi-structured, and large-scale data.
  • Experience with data engineering and event-driven technologies such as Airflow or Kafka.
  • Experience effectively using AI coding agents such as Claude Code, Codex, Cursor, including demonstrated judgment in reviewing and validating generated software for correctness, quality, security, maintainability, and suitability for production use.
  • Proficiency in Python and experience with one or more additional programming languages.
  • Excellent communication skills, including experience organizing and presenting complex technical information to internal teams and stakeholders.
  • Demonstrated ability to work effectively with users, stakeholders, and engineering teams to deliver technical results in a complex, interdisciplinary environment.

Technologies

  • Python
  • Airflow
  • Kafka
  • Claude Code
  • Codex
  • Cursor
  • WDL
  • Nextflow

Benefits

  • Exceptional health and retirement benefits, including pension or 401K-style plans
  • A culture where you’ll belong; we are invested in our teams
  • Winter Holiday Shutdown every year
  • Parental bonding leave (for both mothers and fathers)
  • Pet insurance
  • Relocation assistance

Desired Qualifications

  • A Master’s Degree (or equivalent knowledge/training) in Computer Science or a related field.
  • Experience working with genomics, bioinformatics, and/or next-generation sequencing data.
  • Experience with scientific workflow languages or workflow systems such as WDL and Nextflow.
  • Experience with full-stack or front-end application development.
  • Experience working in High Performance Computing (HPC) environments.

Additional Information

  • Application Date: Priority consideration will be given to candidates who apply with a resume and cover letter by June 1, 2026. Applications will be accepted until the job posting is removed.
  • Appointment Type: Full time, exempt from overtime pay, monthly paid, two-year term appointment with benefits eligibility, with potential for extension or conversion to a Career appointment based on performance, funding, and operational needs.
  • Salary Range: Budgeted salary range is $139,440 to $174,312 annually, within the broader range of $139,440 to $235,308 for job code C71.3; final offer depends on qualifications and experience.
  • Background Check: The position is subject to a background check; convictions will be evaluated for relevance to the role. A prior conviction does not automatically disqualify an applicant.
  • Work Modality: Hybrid work schedule with on-site presence at 1 Cyclotron Road, Berkeley, CA 94720; residency within 150 miles is required; some cases of full-remote work may be considered. Real ID or equivalent identification is required.
  • Relocation Assistance: Eligible for relocation support.
  • Work Authorization: Applicants must be legally authorized to work in the United States. Berkeley Lab does not provide visa sponsorship for this position.
  • Misconduct Disclosure Requirement: Finalists must disclose any misconduct-related administrative or judicial decisions within the last seven years; disclosure is a condition of employment.

Who is JGI?

The Joint Genome Institute (JGI) is a global leader in genome science, enabling biological discovery through advanced genomic capabilities, expert support, and large-scale, AI-ready data resources. As a DOE Office of Science user facility supported by BER, JGI advances BER's mission to achieve a predictive understanding of complex biological, Earth, and environmental systems to support the nation’s energy and infrastructure needs.

Why join Berkeley Lab?

  • Exceptional health and retirement benefits, including pension or 401K-style plans
  • A culture where you belong; we are invested in our teams
  • Winter Holiday Shutdown every year
  • Parental bonding leave (for mothers and fathers)
  • Pet insurance

Similar Jobs

Get Job Alerts

New jobs delivered to your inbox.