BODE 2
Mount Sinai to launch BODE 2 supercomputer for greater research capabilities
November 22, 2019
by
John R. Fischer, Senior Reporter
The U.S. Department of Health and Human Services has awarded a $2 million grant to Mount Sinai Health System for the building of a second "big omics data engine" (BODE 2) supercomputer.
The engine will enable the New York-based healthcare practice to address scientific inquiries more quickly in a variety of healthcare arenas, including medical imaging, genomics, clinical data, and molecular, behavioral and environmental areas of study.
"With access to BODE 2, with more and faster compute and storage, researchers will be able to take advantage of the ability to receive results faster for overall greater scientific throughput; the ability to increase the fidelity of their simulations and analyses; and the ability to migrate research applications seamlessly to a familiar software environment with more capability," Patricia Kovatch, senior associate dean for scientific computing and data science, told HCB News.
BODE 2 is expected to enable greater analysis in genetics and population health, gene expression, machine learning, and structural and chemical biology investigations for insights on conditions such as Alzheimer's, autism, influenza, prostate cancer, schizophrenia, and substance use disorders.
It is the successor to BODE, a previous supercomputer launched in 2013 that is used by 61 basic and translational researchers at Mount Sinai, and collaborators at 75 external institutions. Researchers applied Bode to 67 projects that accumulated more than $100 million in NIH funding. Though effective in its work, expansion of the Mount Sinai faculty and research initiatives require a new system that can offer greater storage space and computational throughput, according to hospital leadership.
A Lenovo ThinkSystem SR360, BODE 2 is made up of 3,840 Intel Cascade Lake cores, with 15 terabytes of memory, 14 petabytes of raw storage, and 11 petabytes of usable storage. It is expected to produce 28 million core compute hours per year at a frequency of 2.6 GHz, and has a peak speed of 220 teraflops per second which approximately doubles that of BODE.
It will also enable application of translational bioinformatics research and data-driven medicine, with its use already planned for upcoming projects. One is focused on SPl1-Dependent Alzheimer Disease Risk, with BODE 2 providing necessary storage for whole-genome-sequencing data sets from more than 10,000 study subjects and processing power of approximately 12 million compute hours for analyzing data using machine learning techniques to enhance treatments and explore new therapies.
The other is on Mount Sinai’s Trans-Omics for Precision Medicine (TOPMed) Program. For this, the engine will provide 1.75 petabytes of storage needed for whole-genome sequencing data, other omics, and molecular, behavioral imaging, environmental, and clinical data for investigating biological causes of heart, blood, lung, and sleep disorders. Hundreds of terabytes will also be offered for immediate results storage, along with approximately seven million compute hours for the highest-powered analysis of the TOPMed data. BODE 2 is expected to accelerate these processes compared to a stand-alone machine for each.
"Biomedical science is becoming more computationally- and data-intensive," said Kovatch. "Early career researchers and physicians are highly trained and savvy with data science at the intersection of biomedicine, and technology continues to evolve and become more affordable. It is essential for healthcare organizations to leverage these trends, and their 'natural' data resources to continue to improve patient outcomes. Capable computational and data resources such as BODE 2, coupled with biomedical data science experts as we have at Mount Sinai, provide better care through precision, personalized and predictive medicine, presenting a pathway to make better informed healthcare decisions."
BODE 2 is set to launch at the end of the year.