Data Science/Informatics

Biostatistician, Pharmacoepidemiology

Rockville, Maryland
Work Type: Full Time

Nextonic Solutions is seeking a highly ambitious, adaptable, structured, and detail-oriented Biostatistician, Pharmacoepidemiology to join our vibrant team at the National Institutes of Health (NIH), National Center for Advancing Translational Science (NCATS), located in Bethesda, MD.


  • Provide operational, executional, tactical, and strategic support for the Good Algorithmic Practice Initiative.
  • Help to implement recommendations from the Good Algorithmic Practice Initiative focused on standardization, reproducibility, and data transparency.
  • Help develop standards for the operationalization and fit-for-purpose use of AI/ML methods applied to Real-World Data (RWD). Produce exemplary educational material, including tools geared toward under-served communities and new learners, covering topics relevant to use of RWD including but not limited to treatment of missing data, case control selection, minimization of type I/II error and bias, prevention of model overfitting and poor generalization and approaches to Electronic Health Record (EHR) data limitations.
  • Manage, schedule, and oversee outreach initiatives to underserved communities and extramural sites with limited bioinformatics capacities.
  • Support investigations from the Office of the Director and N3C leadership requiring biostatistical and/or machine learning knowledge and domain expertise. Outputs of these efforts would serve as exemplars of Good Algorithmic Practice and may result in publications.
  • Produce criteria to evaluate the incremental benefit of PPRL-linked datasets (i.e., improvements in the quality and/or scope of analyses of N3C data, reduction in data missingness, etc.).
  • Participate in N3C Enclave Knowledge Store object and code template peer review, integration and unit testing, and validation.


  • PhD/MS in computer science, (bio)statistics, or similar quantitative degree.
  • Proven track record of working with real world clinical data.
  • Experience in modern machine learning techniques and biostatistics. Preferably experienced in causal inference methods, supervised, and unsupervised learning.
  • Fluent in Python and R with knowledge of Scikit-learn and at least one deep learning framework (TensorFlow/Keras or PyTorch). Systems biology modeling experience a plus.
  • Strong oral and written communication skills and ability to actively participate in technical discussions and follow up on action items.
Agency / Institute:

Submit Your Application

You have successfully applied
  • You have errors in applying