Probability and Statistics (Fall 2023)
Topic Introduction
Probability and Statistics are two closely-related but different topics. While both of them study the black box of randomness, they focus on different problems:
- Probability: I have known the inside mechanism. What are the outside behaviors?
- Statistical Inference: I have known the outside behaviors. What is the inside mechanism?
Probability (including Random Process) is a very important theoretical mathematical model for many advanced topics, e.g., Advanced Algorithms (including Randomized Algorithms and Approximation Algorithms), Analysis of Algorithms, Combinatorial Mathematics (particularly, The Probabilistic Methods popularized by Paul Erdős[ˈɛrdøːʃ]), etc.
Based on Probability, Statistics is a very important applied mathematical tool to solve many problems. For example, it is the basis of most Machine Learning algorithms. Actually, since most Machine Learning algorithms come from the field of Statistical Learning, it is fair to say that a large part of Machine Learning belongs to Statistics. Besides, if you want to be a great decision maker (e.g., policy maker in government, investor, etc.), Statistics will be essential for you. In my view, Statistics is the core of Data Science.
Seminar Information
Content
In this seminar, we’ll be using Probability and Computing by Michael Mitzenmacher and Eli Upfal as our main material. Hopefully, we will cover at least Chapter 1 to Chapter 7 of this book in this seminar, which are:
- Events and Probability
- Discrete Random Variables and Expectation
- Moments and Deviations
- Chernoff and Hoeffding Bounds
- Balls, Bins, and Random Graphs
- The Probabilistic Methods
- Markov Chains and Random Walks
Besides, we might also cover some topics beyond these 7 chapters:
- Probability Theory
- Continuous Random Variable
- Law of Large Number & Central Limit Theorem
- Concentration of Measure
- Statistics
- Parameter Estimation
- Hypothesis Testing
- Some Randomized Algorithms
- Some Machine Learning Algorithms and Ideas
- …
Schedule
The detailed schedule will be updated later.
Topics | Time | Speaker | Material |
---|---|---|---|
Introduction to Probability | 2023/9/21 | Runshuo Xie | slides |
Random Variable | 2023/10/12 | Runshuo Xie | Prerequisite for L2 Prerequisite for L2 (with notes) handout handout (with notes) slides |
Presentations on different distributions | 2023/10/19 | Yifan Lü Yao Luo Jiutao Mao & Xiang Jiang Zhijie Pan Shide Liang Yingming He | 二项分布【吕一帆】 几何分布【罗耀】 泊松分布【毛九弢】 泊松分布的应用【蒋庠】 指数分布的应用【潘智杰】 Normal Distribution【梁世徳】 二项分布【贺英明】(2023/11/16) |
Tail Inequalities | 2023/11/16 | Runshuo Xie | slides handout handout (with notes) |
Introduction to Statistical Inference | 2023/11/30 | Runshuo Xie | slides |
Introduction to Markov Chain | 2023/12/21 | Runshuo Xie | slides |
Reference
Here are all materials that I’ve referred to when I was preparing for the seminar.
Books
Textbook of Probability and Statistics
- Probability and Random Processes (4th edition) by Geoffrey R. Grimmett, David R. Stirzaker
- Understanding Probability (3rd edition) by Henk Tijms
- Elementary Probability Theory With Stochastic Processes and an Introduction to Mathematical Finance by K. L. Chung, Farid AitSahlia
This is a really unique and fantastic book on probability, which is friendly enough to novice. - A Course in Probability Theory by Kai Lai Chung
- Probability Theory: The Logic of Science by E. T. Jaynes
- 概率论与数理统计,陈希孺著
Textbook of Computer Science
- Randomized Algorithms by Rajeev Motwani, Prabhakar Raghavan
- Concrete Mathmatics by Ronald L. Graham, Donald E. Knuth, Oren Patashnik
- Concentration of Measure for the Analysis of Randomized Algorithms by Devdatt P. Dubhashi, Alessandro Panconesi
Others
- The Seven Pillar of Statistical Wisdom by Stephen M. Stigler
- 从博弈问题到方法论学科——概率论发展史研究,徐传胜著
- 从掷骰子到阿尔法狗:趣谈概率,张天蓉著
- 机会的数学:统计学入门,陈希孺著
- 大数据时代的统计思想,李勇著
Courses
- Probability and Statistics (Fall 2023) by Prof. Shaoqun Zhang
- Probability Theory and Mathematical Statistics (Spring 2023) by Prof. Yitong Yin and Prof. Jingcheng Liu
- Advanced Algorithms (Fall 2023) by Prof. Yitong Yin, Prof. Shi Li and Prof. Jingcheng Liu
- Combinatorics (Spring 2023) by Prof. Yitong Yin
Many contents including the style of my slides come from Prof. Yitong Yin’s slides. - Probability and Measure by Prof. Adam B. Kashlak
- MIT 6.041 Probabilistic Systems Analysis and Applied Probability, Fall 2010 by Prof. John Tsitsiklis
- MIT 18.650 Statistics for Applications, Fall 2016 by Prof. Philippe Rigollet
- MATH2750 Introduction to Markov Processes by Prof. Matthew Aldridge