Statistics
Te Tari Pāngarau me te Tatauranga
Department of Mathematics & Statistics

## Papers

The available 200 and 300 level statistics papers are changing over the next 2 years, following a review of the Statistics Programme. By 2020 the revised programme will be in place, and it will provide a clearer structure for students and updated content. However 2019 is an interim year.

Papers available each year (2018 – 2020) are listed first. Then the following section includes short descriptions of all papers, along with the years they are available. From there, clicking on the paper name will take you to a page with details of the paper.

200-level and 300-level papers by year:
 In 2018: STAT210, STAT241, STAT242, STAT251, STAT261, STAT341, STAT342, STAT352, STAT362, STAT380 In 2019: STAT210, STAT260, STAT270, STAT311, STAT341, STAT342, STAT370, STAT372, STAT399 In 2020: STAT210, STAT260, STAT270, STAT310, STAT311, STAT312, STAT370, STAT371, STAT372

See the flowchart of available papers for 2018, their prerequisites and semesters.

Click the paper name below for complete details.

## 100 level

STAT110 Statistical Methods     18 points   First Semester, Summer School

This is a paper in statistical methods for students in the biological and social sciences covering descriptive statistics, probability distributions, estimation, hypothesis testing, regression, analysis of variance and experimental design. At the end of the course you should be able to make use of a wide variety of techniques in the design and analysis of your own research studies. The program R will be used for statistical analysis and data summary throughout the paper.

STAT115 Introduction to Biostatistics     18 points   Second Semester

A paper for students in the health sciences covering an introduction to the research process and study design, measures for describing data, the binomial and normal distributions, estimation and inference for continuous data, estimation and inference for categorical data, regression procedures and statistical issues in study design. The statistical software R will be used throughout the paper to assist with data analysis.

## 200 level

STAT210 Statistical Methods 2     18 points   Second Semester

A service course in Statistics that builds on the introduction that many students receive in either STAT 110 Statistical Methods or STAT 115 Introduction to Biostatistics. Note that this course is intended for students not taking statistics as a major. As such, it cannot be credited to a major or minor in statistics if STAT241, STAT242, STAT251, STAT342, ECON210, FINC203 or FINC308 has been passed previously or is being taken concurrently.

STAT241 Regression & Modelling 1     18 points   First Semester
From 2019 this will be replaced by STAT210

This central paper in applied statistics develops regression models for analysing variability in data. It is the first in a modelling sequence required for a statistics minor, a statistics major or a Diploma for Graduates endorsed in Statistics. It is also useful as a service subject for students majoring in any subject involving the analysis of data. Topics include simple and multiple linear regression, residual analysis and inflence, methods for finding the best model, the importance of qualitative predictors using dummy variables, covariance analysis and experiments involving one and two factors; logistic regression and the comparison of models using deviance differences and AIC; adjusted odds ratios and probability calculations using the chosen best model. Binomial data and overdispersion are discussed. SPSS 22 and R are used.

STAT242 Multivariate Methods     18 points   Second Semester
Not available after 2018

This paper looks at procedures for the analysis and interpretation of data involving several response variables, and has wide application in research methodology in the biological, health and social sciences. The following procedures are all taught: tests of significance for multivariate data, principal component analysis, exploratory and confirmatory factor analysis, methods of discrimination including Fisher discrimination analysis, quadratic discrimination, and the analysis and use of multinomial logistic regression models for discrimination when appropriate, cluster analysis, canonical correlation analysis, multivariate distance measures, multidimensional scaling, correspondence analysis and methods of ordination. SPSS 24 and its add-on AMOS24 are used throughout the paper although some use may be made of R.

STAT251 Design of Research Studies     18 points   First Semester
Not available after 2018

An overview of design principles for scientific research. The topics covered include types of research study, random and systematic sampling, stratified sampling, basic experimental designs, replication, error degrees of freedom, defining sampling and experimental units, blocking, factorial experiments and repeated nested designs.

STAT260 Visualisation and Modelling in R     18 points   Second Semester
Available from 2019

The software R is used to introduce computer skills needed for the statistical sciences. The course covers reproducible research, data wrangling, visualisation, exploratory data analysis, resampling and simulation.

STAT261 Probability and Inference 1     18 points   First Semester
Not available after 2018

A introduction to probability theory and mathematical statistics. Probability and random variables, simulation, sampling distributions, statistical modelling and estimation, hypothesis testing.

STAT270 Probability and Inference     18 points   First Semester
Available from 2019

An introduction to the theory that underlies the statistical methods introduced in STAT110/115. Probability theory, random variables and distributions, expectation and variance, likelihoods, estimators and confidence intervals, hypothesis testing, and Bayesian inference.

## 300 level

STAT310 Statistical Modelling     18 points   First Semester
Available from 2020

Statistical model building, motivated by real applications. Topics include regularisation, lasso, splines, non-linear regression, generalised linear models, model checking and introduction to mixed models.

STAT311 Design of Research Studies     18 points   First Semester
Available from 2019

Design of studies to address different types of research questions. Survey methods, experimental and observational studies, measurement, control of confounding and bias, evaluation of competing designs, determination of study size.

STAT312 Modelling High Dimensional Data     18 points   Second Semester
Available from 2020

An introduction to the statistical learning techniques commonly used to analyse high-dimensional (or multivariate) data. Penalised regression, classification trees, clustering, dimension-reduction, bagging, stacking, boosting, random forests and ensemble learning.

STAT341 Regression and Modelling 2     18 points   First Semester
Not available after 2019

An introduction to generalised linear models, non-linear regression models, and mixed effects models, with a mixture of background theory and practice in applying the methods to real datasets.

STAT342 Multivariate Methods     18 points   Second Semester
Not available after 2019

This paper looks at procedures for the analysis and interpretation of data involving several response variables, and has wide application in research methodology of the biological, health and social sciences. Tests of significance for multivariate data, principal component analysis, exploratory and confirmatory factor analysis, methods of discrimination including the use of multinomial logistic regression models, canonical correlation analysis, cluster analysis, multivariate distance measures, multidimensional scaling, correspondence analysis and methods of ordination. SPSS 23 and its add-on AMOS23 are used throughout the paper although some use may be made of R.

Note that this paper is restricted against STAT 242 which has the same lectures but different assessment involving a project.

STAT352 Applied Time Series     18 points   Second Semester
Not available after 2018

An introduction to the practical aspects of the statistical analysis of time series and its application to the physical sciences and econometrics. Topics include seasonal decomposition, identification and estimation of ARIMA models, seasonal ARIMA models, and forecasting. Time series analysis is the statistical analysis of ordered sequences of data. The distinguishing feature is that these data are correlated. Such data arise in a bewildering range of application areas that include:

* Climatology: Estimation long term changes of climate

* Economics: Analysis of quarterly or monthly CPI, unemployment rates

* Finance: Estimating volatility of stock market returns

* Dendrochronology: Paleoclimate reconstruction based on series of tree ring widths

* Sonar: Detection of underwater signals

as well as communications, speech compression, seismology, control theory, and many many more.

STAT362 Probability and Inference 2     18 points   Second Semester
Not available after 2018

Theory of ordinary least squares, maximum likelihood estimation and inference, hypothesis testing, Bayesian statistics.

STAT370 Statistical Inference     18 points   Second Semester
Available from 2019

A continuation of the theoretical development begun in STAT 270, this paper will cover the theory of ordinary least squares, maximum likelihood estimation and inference, hypothesis testing, and Bayesian inference.

STAT371 Bayesian Data Analysis     18 points   Second Semester
Available from 2020

Introduction to Bayesian methods with an emphasis on data analysis.  Topics include prior choice, posterior assessment, hierarchical modelling and model fitting using R, JAGS and other freely available software.

STAT372 Stochastic Modelling     18 points   First Semester
Available from 2019

Introduction to statistical methods for processes observed over time and space. Poisson processes, renewal processes, Markov chains, hidden Markov models, geostatistics, spatial point processes, model fitting, forecasting and simulation.

STAT380 Statistical Computing     18 points   First Semester
Not available after 2018

A paper that introduces students to computational methods in statistics.

STAT399 Special Topic – Statistical Computing     18 points   Second Semester
Available in 2019

The software R is used to introduce computer skills needed for the statistical sciences. The course covers reproducible research, visualisation, exploratory data analysis, optimization, resampling, simulation and R programming.