首届机器学习与统计会议将于2023年8月24日-26日在华东师范大学普陀校区召开,本次会议由中国现场统计研究会机器学习分会主办,华东师范大学统计学院、统计交叉科学研究院、统计与数据科学前沿理论及应用教育部重点实验室及统计应用与理论研究创新引智基地联合承办。会议旨在促进机器学习与统计领域的国内外学者进行学术交流,引领机器学习与统计共同交叉发展的学术文化,推动作为数据科学与人工智能的奠基性学科的进步,以此助力相关数字经济产业的发展。
主题报告专场(六)
Statistical Foundations for Modern Machine Learning
报告时间:
2023年8月25日 15:30-17:00
报告地址:
华东师范大学普陀校区 文史楼203
组 织 者:
李赛  中国人民大学
01
 刘林  上海交通大学
题目:Applying Modern Neural Networks in Statistical Problems: Semiparametrics, Causal Inference, and Differential Equations.
摘要In this talk, we will survey some of our recent explorations of modern neural networks in problems spanning semiparametric statistics, causal inference, and forward & inverse problems in differential equations. We will discuss the practical performance, some limited theory, and when the theoretical results fall short of guiding practice. Time permitted, we will discuss some potential directions we are working on to narrow the theory-practice gap.
简介:刘林在上海交通大学自然科学研究院、数学科学学院、交大-耶鲁生物统计与数据科学中心任职助理教授,研究方向为非参数半参数统计理论、因果推断、机器学习、及统计学在生物医学中的应用。
02
张琼  中国人民大学
题目:Gaussian Mixture Reduction with Composite Transportation Divergence
摘要Gaussian mixtures are widely used as a parametric distribution for approximating smooth density function of various, simplifying downstream inference tasks. They find extensive applications in density estimation, belief propagation, and Bayesian filtering. These applications often utilize finite Gaussian mixtures as initial approximations that are recursively updated. A challenge in these recursions is that the order of the Gaussian mixture increases exponentially, and the inference quickly becomes intractable. To overcome the difficulty, the Gaussian mixture reduction, which approximates a high order Gaussian mixture by one with a lower order, can be used. Although existing clustering-based methods are known for their satisfactory performance and computational efficiency, their convergence properties and optimal targets remain unknown. In this work, we propose a novel optimization-based Gaussian mixture reduction method based on composite transportation divergence (CTD). We develop a majorization-minimization algorithm for numerically computing the reduced Gaussian mixture and establish its theoretical convergence under general conditions. Furthermore, we demonstrate that many existing clustering-based methods are special cases of our approach, effectively bridging the gap between optimization-based and clustering-based techniques. Our unified framework empowers users to select the most appropriate cost function in CTD to achieve superior performance in their specific applications. Through extensive empirical experiments, we demonstrate the efficiency and effectiveness of our proposed method, showcasing its potential in various domains.
简介:张琼2015年本科毕业于中国科学技术大学少年班学院。2022年博士毕业于加拿大英属哥伦比亚大学统计系。2022年9月起加入中国人民大学统计与大数据研究院并担任助理教授。目前的研究方向包括:混合模型、分布式学习、联邦学习等。她的研究论文发表在Journal of Machine Learning Research, ICCV等机器学习期刊和会议上。
03
杨朋昆  清华大学
题目:Towards a Mathematical Foundation of Federated Learning: A Statistical Perspective
摘要: Federated Learning (FL) is a promising framework that has great potentials in privacy preservation and in lowering the computation load at the cloud. The successful deployment faces many challenges in both theory and practice such as data heterogeneity and client unavailability. In this talk, I will discuss the convergence and statistical efficiency of two widely-adopted FL algorithms: FedAvg and FedProx, from a statistical perspective. Our analysis is based on the standard non-parametric regression in a reproducing kernel Hilbert space. Additionally, we propose the concept of federation gain to quantify the impact of heterogeneity. Time permitted, FL from the perspectives of clustering and robust statistics will be discussed.
简介:Pengkun Yang is an assistant professor at the Center for Statistical Science at Tsinghua University. Prior to joining Tsinghua, he was a Postdoctoral Research Associate at the Department of Electrical Engineering at Princeton University. He received a Ph.D. degree (2018) and a master degree (2016) from the Department of Electrical and Computer Engineering at University of Illinois at Urbana-Champaign, and a B.E. degree (2013) from the Department of Electronic Engineering at Tsinghua University. His research interests include statistical inference, learning, optimization, and systems. He is a recipient of Thomas M. Cover Dissertation Award in 2020, and a recipient of Jack Keil Wolf ISIT Student Paper Award at the 2015 IEEE International Symposium on Information Theory (semi-plenary talk).
04
吴瑞佳  上海交通大学
题目:Supervised Topic Modeling: Optimal Estimation and Statistical Inference
摘要:With the development of computer technology and the internet, increasingly large amounts of textual data are generated and collected every day. It is a significant challenge to analyze and extract meaningful and actionable information from vast amounts of unstructured textual data. Driven by applications in a wide range of fields, there is an increasing need for developing computationally efficient statistical methods for analyzing a massive amount of textual data with theoretical guarantees. In the presentation, I will discuss supervised topic modeling, which jointly considers a collection of documents and their paired side information. A bias-adjusted algorithm is developed to study the regression coefficients in the supervised topic modeling under the generalized linear model formulation. I will also introduce an approach to constructing valid confidence intervals. Applications of the proposed methods reveal meaningful latent topic structures of textual data.
简介:Ruijia Wu is an assistant professor in the Department of Data and Business Intelligence at Antai College of Economics and Management, Shanghai Jiao Tong University. She has a Bachelor’s degree and a Master's degree from the University of Oxford, both in Mathematics. She obtained her Ph.D. in Statistics from the Wharton School, University of Pennsylvania in 2022. Her research interests include statistical machine learning, high-dimensional statistics, text analysis, and their applications.
本次会议无需注册费,请扫描下方二维码完成会议注册流程。
 获取更多会议信息,请登录会议官网:
 https://ml-stat.github.io/MLSTAT2023/
往期回顾
REVIEW

会议通知 | 首届机器学习与统计会议暨中国现场统计研究会机器学习分会成立大会

继续阅读
阅读原文