Skinnider lab

Machine learning to identify unknown small molecules at Princeton University

The Skinnider lab develops machine-learning approaches to identify known and unknown small molecules that are relevant to human health and disease, with mass spectrometry-based metabolomics being the primary analytical technique.

The human body contains thousands of small molecules, and is exposed to thousands more during daily life. At present, however, the vast majority of these small molecules remain unknown. Whereas high-throughput techniques can now reliably measure the DNA, RNA, and protein content of any given biospecimen, enumerating the complete complement of small molecules—the metabolome—has proven much more challenging. Mass spectrometry (MS), the workhorse of metabolomics, is capable of detecting thousands of molecules in routine experiments, but the vast majority of these cannot be definitively identified. This profusion of unidentified chemical entities has been dubbed the “dark matter” of the metabolome.

We are interested in illuminating this metabolic dark matter by developing new computational approaches to identify both known and unknown small molecules using mass spectrometry. To achieve this aim, we design and apply cutting-edge AI technologies to translate mass spectrometric information into chemical structures. Some of the key challenges we are interested in include:

  • Structure elucidation: decoding unknown chemical structures from MS/MS data
  • Metabolomics: developing new tools for data analysis to deal with the complexity of metabolomic experiments
  • Forensic chemistry: identifying new illicit drugs of abuse from clinical and forensic mass spectrometry data
  • Natural products: discovering new bacterial natural products with genomic and metabolomic datasets
  • Low-data learning: developing machine learning approaches to learn from small chemical datasets

We are recruiting at all levels (undergraduates, post-baccs, PhD students, and postdocs). If you are interested in joining the lab, please see the join page.

news

Jul 18, 2023 Lab website is live

selected publications

2021

  1. Cell type prioritization in single-cell data
    Michael A Skinnider*, Jordan W Squair*, Claudia Kathe, Mark A Anderson, Matthieu Gautier, Kaya JE Matson, Marco Milano, Thomas H Hutson, Quentin Barraud, Aaron A Phillips, and  others
    Nature Biotechnology, 2021

2020

  1. Comprehensive prediction of secondary metabolite structure and biological activity from microbial genome sequences
    Michael A Skinnider, Chad W Johnston, Mathusan Gunabalasingam, Nishanth J Merwin, Agata M Kieliszek, Robyn J MacLellan, Haoxin Li, Michael RM Ranieri, Andrew LH Webster, My PT Cao, and  others
    Nat. Commun., 2020

1935

  1. Can Quantum-Mechanical Description of Physical Reality Be Considered Complete?
    A. Einstein, B. Podolsky, and N. Rosen
    Phys. Rev., May 1935