PhD programme

A PhD in statistics gives you a very strong foundation in the future labour market, where there is more demand for competency in data analysis.

The PhD programme in statistics at Uppsala University provide both a broadening and deepening in statistics, providing skills in modern statistics methods and research areas.

Within in the PhD programme in statistics, one should also apply statistical methods on practical applications, thereby increasing the proficiency with which to analyse data.

A book on statistical methods in medical research lies on top of two scientific articles on a table

Main research areas

We mainly supervise our PhD students within our four main research areas: high dimensional data, casual inference, structural equation modeling and time series econometrics.

Collecting large amount of data is today a norm rather than an exception – partly dictated by the complexity of modern problems, and partly due to its convenient availability. The field of multivariate statistics has correspondingly grown to cope with the questions associated with such data. Originally motivated by the field of genetics, the investigation of high-dimensional data has now crept into as diverse areas of applications as engineering, psychology and behavioural sciences, biological sciences, and even agriculture and finance. To address the modern challenges, the statistical methodology is currently in the process of unprecedented development on all aspects: theory, computation and application. The field is set to grow unabatedly, and has secure future prospects.

Responsible researcher: Rauf Ahmad

 

Causal inference aims to deepen the understanding of social phenomena, or of analyses of efficiencies of different treatments (such as, medicinal, work-life or environmental treatments). Statistics cannot by itself create knowledge of various phenomena, but is rather used to test theories under various assumptions. Thematic theory, together with statistical theory and knowledge of how data is collected, jointly form the causal analysis. For that reason, it is necessary to understand thematic questions, and that in the development of new methods jointly consider how data together with statistical theory can improve analysis of causal questions, such as testing of theories as well as pure effect evaluations.

Currently, we are working with researchers in medicine, economics, psychology and engineering. Some of our thematic work include: (i) test of theories for how mass media affect voters' ability to hold elected officials accountable, (ii) analysis of the effect of family friendly workplaces on wages and income for men and women, (iii) analysis of the effect of air pollution on childrens' health, (iv) test of gender differences in preferences and (v) analysis of electricity consumption and changes in electricity tariffs and consumer information related to energy savings. Methodologically, we have made contributions to, among others, the design of randomized experiments and identification of causal effects using observational data, and register data in particular. Examples of problems we have investigated concern situations when the timing of a treatment is a choice (as opposed to a randomized experiment when the time for intervention is the same for all treated and untreated individuals) and where there might exist measurement errors, both in control variables and in the timing of treatment.

Responsible researchers: Per Johansson and Ingeborg Waernbaum

 

Structural equation modeling (SEM) is a multivariate statistical analysis technique that simultaneously unites Factor Analysis and Multiple Regression Analysis. It analyses the causal relationships among observed variables and latent constructs, including linear and nonlinear effects. SEM includes two basic types of models. The measurement modelrepresents the theory that specifies how a set of observed variables measure the latent constructs. The structural model represents the theory that shows how latent constructs are causally related to each other.

SEM can apply to various data types, cross-sectional data, longitudinal data, time-series data, or multilevel data. For example, the cross-sectional models help us to assess causal and mediation hypotheses; Latent Growth Curve Models often apply to analyse potential changes in the latent construct of interest over time; Item Response Theory Models usually analyse the patterns of individual behaviours and questionnaire responses, and Multilevel Models can assess the cause of variations between different data levels. SEM has been widely applied in social sciences and spread to other natural sciences in recent decades, e.g., information science and medical research.

The Department of Statistics in Uppsala has a long tradition of structural equation modeling, and it is known as the birthplace of SEM. Professor Emeritus Karl G. Jöreskog is the pioneer in SEM, and the LISREL (linear structural relations) program (Jöreskog and Sörbom) was the first software for analysis of Structural Equation Models. Nowadays, Professor Fan Wallentin and Associate Professor Shaobo Jin with colleagues carry on this rich tradition and actively contribute to the field.

Responsible researcher: Fan Yang Wallentin

 

The Department of Statistics has a long tradition of research within times series econometrics. The tradition started in 1942 when Herman Wold become professor at the department. Econometrics is the field concerning applications of statistical methods on problems within economics, and the methods are developed for applied problems in economics. Time series econometric deal with economic data sorted over time, and the aim is to analyse associations between different economic variables and their development over time. For instance, the government need to know how the gross domestic product (GDP), inflation and unemployment will develop in order to make the national budget.

One research area is to improve forecast to just these variables (GDP, inflation and unemployment). We are living in a ever-changing world affected by various decision. These decisions result in associations between economic variables changing over time. Another research area is how to model structural changes over time using non-linear models. With such models, it is possible to get increased understanding of various economic phenomena.

Responsible researcher: Johan Lyhagen

 

Programme structure

The PhD programme consists of four years of full-time studies, and consists of 90 credits of coursework and a thesis worth 150 credits. Of the 90 coursework credits, 34.5 consists of mandatory courses, with the rest being elective courses.

The mandatory courses are: Inference theory (15 credits), Asymptotic theory (7.5 credits), Philosophy of science (5 credits), Scientific communication (5 credits) and Research ethics (2 credits), or equivalent courses. For PhD students who does not have prerequisite knowledge in probability theory, the course Probability theory (7.5 credits) is also mandatory.

The remaining courses are elective courses but should be within one or more of the areas of statistical methodology and/or applications. Courses both within and outside for the thesis area should be included, with at least 15 credits for each.

Qualifications

Basic and specific qualifications to be admitted to the PhD programme is described in the general study plan. Aside from basic qualifications for a PhD programme, the applicant must have passed results on courses of 90 credits in statistics, of which at least 60 credits at advanced level.

More about PhD studies at the Faculty of Social Sciences

FOLLOW UPPSALA UNIVERSITY ON

facebook
instagram
twitter
youtube
linkedin