Statistics 1000-116bST
The course is an introduction to classical mathematical statistics. Below are the concepts and results that will be covered during the lectures. The material in square brackets is optional and will be presented if time permits.
- Sample counterparts of population quantities: mean, variance, standard deviation, quantiles. Empirical distribution function, Glivenko-Cantelli theorem. Empirical distribution.
- Statistical models: nonparametric, semiparametric, parametric.
Sufficient and minimal sufficient statistics. Factorization criterion and Dynkin-Lehmann-Scheffé theorem. Complete statistics.
- Exponential families and their properties.
- Parameter estimation: method of relative frequency, method of moments, maximum likelihood method. [EM algorithm]
- Bias of an estimator and mean squared error.
- Minimum variance unbiased estimators. Rao-Blackwell theorem. [Lehmann-Scheffé theorem]
- Fisher information, Cramer-Rao information inequality.
Asymptotic properties of estimators. Consistency, "Delta method". Asymptotic normality of the maximum likelihood estimator.
- Confidence intervals.
- Hypothesis testing theory: significance level, power, p-value. Uniformly most powerful test, Neyman-Pearson lemma. [Karlin-Rubin theorem]. Likelihood ratio test. [Wald test]
- Chi-squared test for simple and composite hypotheses. Chi-squared test of independence as a special case. Kolmogorov-Smirnov test, two-sample t-test. [Nonparametric tests: runs test, Fisher’s exact test]
- Gaussian linear models. Maximum likelihood estimators for coefficients, confidence intervals, hypothesis testing of zero coefficients. ANOVA.
- [Ridge regression and LASSO, logistic regression]
- [Elements of Bayesian statistics]
The accompanying computer lab sessions aim to illustrate the concepts introduced in the lecture and teach basic data analysis using the R programming language.
Type of course
Prerequisites (description)
Course coordinators
Learning outcomes
Students will understand the principles of data analysis and their theoretical foundations and be able to perform basic statistical procedures (using the R programming language). They will be able to analyze data using Gaussian linear models and apply these models for prediction.
Assessment criteria
The final grade is determined by a combination of the following points:
Exam (50 points)
Midterm (20 points)
Exercise grade (20 points)
Lab grade (10 points)
Bibliography
[1] W. Niemiro, Notatki do wykładu ze Statystyki:
https://www.mimuw.edu.pl/~pokar/StatystykaI/Literatura/NiemiroBook.pdf
[2] J. Noble, Notatki do wykładu ze Statystyki (ang):
www.mimuw.edu.pl/~noble/courses/Statistics
[3] P. Grzegorzewski, Statystyka matematyczna, PWN 2024
[4] P. Bickel and K. Doksum, Mathematical Statistics: Basic ideas and selected topics, Vol. 1, 2001.
Additional information
Information on level of this course, year of study and semester when the course unit is delivered, types and amount of class hours - can be found in course structure diagrams of apropriate study programmes. This course is related to the following study programmes:
Additional information (registration calendar, class conductors, localization and schedules of classes), might be available in the USOSweb system: