Tony Kerlavage on Data Lakes, Data Commons, and Empowering the Research of the Future

February 22, 2022


At the National Cancer Institute, Tony Kerlavage knows quite a bit about managing very large pools of data. When NCI launched the Genomic Data Commons, it aimed to democratize access to the genomic data in The Cancer Genome Atlas and other sources. Since then, though, Kerlavage points out that our data types and volumes have only grown. Now NCI is taking a “Commons of Commons” approach to link pools of well-structured data. “The more data we can bring together in a well-structured way, the more value it has in the long run,” he believes. He advocates for sharable Python notebooks and reusable R programming, believing significant investments in data hygiene and interoperability delivers more value than simply mining data lakes with artificial intelligence tools—for now, at least. The challenge for researchers, Kerlavage says, is to view their work with an eye to the future: How might someone else use this data going forward?

Tony Kerlavage, Director, Center for Biomedical Informatics & Information Technology, National Cancer Institute
Dr. Tony Kerlavage has served as the director of CBIIT since May of 2019. He joined NCI as a program director in 2011 after more than 25 years in the public and private sector as a leader in bioinformatics and genomics. He became chief of the Cancer Informatics Branch in 2012 and acting director of CBIIT in 2017. During his tenure, NCI’s efforts in advancing open data, open software, and open science have increased exponentially. Dr. Kerlavage has led ground-breaking efforts in these areas, including helping to establish the NCI Cancer Cloud Resources and the Cancer Research Data Commons.

Host Bio

Stan Gloss


As co-founder and Evangelist of BioTeam, Stan Gloss has been working to tell the stories of the intersection of science, data and technology since 2002. Gloss joined with fellow founding partners Bill Van Etten and Chris Dagdigian to form BioTeam in 2002 following his tenure in business development with AVAKI Corporation, a pioneer in global grid software solutions, and Blackstone Computing, a computing and IT consulting company for scientists, Gloss led the sales initiative that launched the company in the life sciences market. Gloss earned his MS at the University of Buffalo and was a department chairman and faculty member at Quinnipiac University.

A life science IT consulting firm at the intersection of science, data, and technology, BioTeam builds innovative scientific data ecosystems that close the gap between what scientists want to do with data—and what they can do. Learn more at