报告信息
主题
ClusterDE: a post-clustering multiple-testing framework robust to double dipping, with applications to cluster-marker gene identification from single-cell gene expression data
嘉宾:李婧翌
地点:腾讯会议:377-245-407 (或点击阅读原文)
时间:2023年06月27日(周二) 20:00
报告摘要
In standard single-cell gene-expression data analysis, cells are annotated as candidate cell types in two steps: given a cell-by-gene matrix, cells are first clustered, and then each cluster is assigned with marker genes that have higher expression levels in the cluster than other clusters. Hence, the procedure is clustering followed by multiple testing on the same data, a phenomenon referred to as ``double dipping." Due to double dipping, every cell cluster would inevitably have marker genes assigned even if the cluster should not be separated from the other cells, resulting in false positive cluster-marker genes. To overcome this challenge, we propose ClusterDE, a post-clustering multiple-testing framework, for controlling the false discovery rate (FDR) regardless of clustering quality. The core idea of ClusterDE is to set up in silico negative control data. Applied to single-cell gene expression data, we show that ClusterDE not only has solid FDR control but also finds cell-type marker genes that are biologically meaningful. ClusterDE is fast, transparent, and adaptive to a wide range of clustering methods and statistical tests.
嘉宾简介
Dr. Jingyi Jessica Li is a Professor in the Department of Statistics at the University of California, Los Angeles, where she holds secondary appointments in the Departments of Biostatistics, Computational Medicine, and Human Genetics. She leads a research group called the Junction of Statistics and Biology, which focuses on developing interpretable statistical methods for biomedical data analysis. Dr. Li received her Ph.D. from the University of California, Berkeley, and her B.S. from Tsinghua University. Her research interests include quantifying the central dogma, extracting hidden information from bulk, single-cell, and spatial multi-omics data, and ensuring statistical rigor in biomedical data analysis. She emphasizes using in silico negative controls to avoid false discoveries. Dr. Li has received multiple awards in recognition of her work, including the NSF CAREER Award, Sloan Research Fellowship, Johnson & Johnson WiSTEM2D Math Scholar Award, MIT Technology Review 35 Innovators Under 35 China, Harvard Radcliffe Fellowship, COPSS Emerging Leader Award, and ISCB Overton Prize.
统计之都(Capital of Statistics,简称 COS)成立于 2006 年,是一个旨在推广与应用统计学、数据科学知识的公益性网站和社区。
统计之都以专业、人本、正直、团结的理念尝试推动统计和数据科学在中国的发展,促进各行业的创新和繁荣。
继续阅读
阅读原文