Data Infrastructure Engineer: Big Data, Functional Programming, Drug Discovery


Empirico, an early-stage biotechnology company, is looking for a talented software engineer that is motivated by the opportunity to build scalable data systems that power the discovery of new medicines. You will work closely with a team of engineers and computational scientists to build and extend Empirico’s data infrastructure, which include modern cloud-based systems and services that operate on some of the largest biological datasets in the world.

Responsibilities :

Your responsibilities will focus around designing and implementing robust and extensible data systems. You will be expected to:

Design and implement scalable data infrastructure and pipelines

Implement scalable algorithms in a distributed systems setting

Collaborate closely with an interdisciplinary team of scientists and engineers to address

system pain points

Improve developer efficiency and system quality through emphasis on elegant code

Advocate for systems and engineering practice improvements

Requirements :

2+ years professional experience designing and developing software on modern distributed data systems

Experience processing and analyzing large and heterogeneous datasets

Strong technical skill set that spans a broad range of technologies, programming languages,

and paradigms

Passionate about systems thinking and drive towards elegant and automated solutions to

data problems

Experience with Spark and Scala or other functional programming language is a plus

Applicants must have authorization to work in the United States