Remotees is for sale. Submit your bid to hello AT remotees DOT com if you’re interested.

Data Infrastructure Engineer: Big Data, Functional Programming, Drug Discovery

Empirico · Feb 10th 2021

Apply on StackOverflow Careers

Empirico, an early-stage biotechnology company, is looking for a talented software engineer that is motivated by the opportunity to build scalable data systems that power the discovery of new medicines. You will work closely with a team of engineers and computational scientists to build and extend Empirico’s data infrastructure, which include modern cloud-based systems and services that operate on some of the largest biological datasets in the world.


Your responsibilities will focus around designing and implementing robust and extensible data systems. You will be expected to:

  • Design and implement scalable data infrastructure and pipelines

  • Implement scalable algorithms in a distributed systems setting

  • Collaborate closely with an interdisciplinary team of scientists and engineers to address

system pain points

  • Improve developer efficiency and system quality through emphasis on elegant code

  • Advocate for systems and engineering practice improvements


  • 2+ years professional experience designing and developing software on modern distributed data systems

  • Experience processing and analyzing large and heterogeneous datasets

  • Strong technical skill set that spans a broad range of technologies, programming languages,

and paradigms

  • Passionate about systems thinking and drive towards elegant and automated solutions to

data problems

  • Experience with Spark and Scala or other functional programming language is a plus

  • Applicants must have authorization to work in the United States

Apply on StackOverflow Careers