Research Prime

Computational Scientist I, Genome Aggregation Database

Organisation Name: Broad Institute
Organisation Type:
City:
State:
Country:

Job Description:

Job Description

Since 2016 the Genome Aggregation Database (gnomAD) has been a pioneer in human genomic data aggregation through the regular public release of data for a rapidly growing collection of exomes and genomes sampled from diverse populations across the globe. gnomAD is the default resource used in virtually every clinical variant interpretation pipeline today and our browser has generated over 39 million page views to date with tens of thousands of regular monthly users.

We are seeking a creative self-motivated candidate at the PhD level to play a critical role in designing and developing fast automated open-source computational pipelines to produce high-quality public data releases for forthcoming and exponentially growing datasets in gnomAD. The role will involve close collaboration with scientists across the Broad to develop novel approaches for quality control and analysis of our highly heterogeneous datasets at exceptional scale as well as the eventual supervision of associate computational scientists in the group who will be assigned to work alongside the candidate. The candidate will also have the opportunity to interact closely with Hail developers at the Broad to play a role in the feature design of the fields most cutting-edge toolkit for massively parallel high-throughput computation of genetic data. As this role involves collaboration with a wide variety of staff across disciplines including computational scientists academic trainees software engineers biologists and clinical geneticists we are specifically looking for a candidate who works well in teams.

As part of the methods development team in the Translational Genomics Group you will have the opportunity to make substantial contributions to high-impact projects with direct implications for clinical practice as well as to participate in the vibrant research environment at the Broad with its close links to MIT Harvard and the Harvard-affiliated hospitals across Boston. You will have access to data sets of extraordinary scale and to colleagues with deep expertise in genetics computational biology software development and machine learning. The responsibilities of this role align closely with the mission of the Broad to transform medicine and human health through cross-disciplinary collaboration and the development of pioneering technologies to analyze scientific data on an unprecedented scale.

We are an Agile team running production and development in a Scrum framework and we care deeply about managing our work well maintaining healthy work-life boundaries and investing in the professional growth of our team members. You will have access to Broads thoughtful and well-resourced leadership development and management training programs in addition to a generous vacation policy and benefits package. We operated on a hybrid remote/in-office work schedule even before the pandemic and we expect to continue this model moving forward and are able to accommodate any candidates living within the New England region.

Growing a strong team with a diversity of life experiences and backgrounds who foster a culture of continual learning and who support the growth and success of one another is key to our success. We are therefore committed to seeking applications from women and from underrepresented groups.

Career development opportunities for this role:

  • Supervising and mentoring associate computational staff scientist(s) assigned to work on gnomAD releases including weekly check-ins quarterly performance reviews and discussions on career development. Management training and hands-on mentorship in this area will be provided depending on candidates previous experience managing others

  • Setting concrete objectives and tasks professional standards and expectations for associate staff scientist(s); helping them to prioritize tasks troubleshoot technical issues locate resources (people and tools) and manage relationships with collaborators

  • Handling general supervisory/HR administrative tasks for associate staff including approving vacations and expense reports writing annual performance reviews and making recommendations for promotions and salary increases

Characteristics and Qualifications:

The role will require an independent and highly motivated candidate with the ambition to maintain and develop a significant and sophisticated body of code that is used on a regular basis to produce large public data releases with a highly active and invested user community.

You will have domain expertise in computational methods for analyzing next-generation sequencing data as well as an interest in the technical aspects of deploying these methods at scale.

We are looking for someone who:

  • Is able to write clean efficient robust and usable code with demonstrated proficiency in one of the following: Unix/Linux Python Java C++ Matlab or R with a strong preference for Python Unix/Linux and R

  • Has a Ph.D. in mathematics computer science engineering physics mathematics statistics biology or another related field; or equivalent professional experience

  • Has demonstrated experience in quantitative (statistical mathematical computational) research with large data sets; skill and experience with statistical analysis and/or computational biology is strongly preferred with special consideration for individuals with prior experience using the Hail Python library

  • Has fluency with human genetics and next-generation sequencing data; ideally will have prior experience with the quality control of such datasets

  • Exhibits strong initiative and the ability to take ownership of complex projects and interest in the management and development of a team

  • Cares passionately about the quality of his/her work and demonstrates zealous attention to detail; is curious and tenacious about investigating anomalies in data

  • Is familiar with Git and modern team-based software development practices including peer code review through pull requests

  • Listens communicates and collaborates well with team members clinicians software developers and research scientists; is receptive to feedback and willing to provide constructive feedback to others; demonstrates kindness to others

  • Demonstrates excellent written and oral presentation skills

  • Manages time well and is able to respond to shifting priorities in a fast-paced and rapidly changing environment

"All computational scientists at Broad are encouraged to continue developing their expertise by engaging with the wider computational community through Broad's vibrant Models Inference & Algorithms Initiative (broadinstitute.org/mia)."

#LI-POST

All Broad employees regardless of work location must be fully vaccinated for COVID-19 by Tuesday October 12 2021. Requests for exemption for medical or sincerely held religious beliefs will be considered.

All qualified applicants will receive consideration for employment without regard to race color religion sex sexual orientation gender identity national origin disability or protected veteran status.

EEO is The Law - click here for more information

Equal Opportunity Employer Minorities/Women/Protected Veterans/Disabled

Check out this video for a look into our community!


Posting Date: Nov 17, 2021
Closing Date:
Organisation Website/Careers Page: https://broadinstitute.wd1.myworkdayjobs.com/en-US/broad_institute/job/Cambridge-MA/Computational-Scientist-I--Genome-Aggregation-Database_9315


Subscribe for receiving latest updates in Computational Sciences