The course intends to give knowledge of analysis and visualisation of complex multidimensional data. On completion of the course, the student should be able to
explain the theoretical basic principles of some model families within the subject area and apply these on biomedical/biological data
explain the theoretical basic principles of some general algorithmic modelling technologies within the subject area and apply these on biomedical/biological data
independently choose appropriate methods for given biological/biomedical data and issues
critically analyse, evaluate and compile biological/biomedical results based on the models that are created from complex amounts of data.
The course treats analysis and visualisation of complex multidimensional data where collected measured values for an observation may consist of a combination of different qualitative and quantitative variables that most often come together with measurement errors (technical variation), sometimes are completely unknown (missing data), and sometimes are censored (due to detection thresholds and/or saturation effects in measuring instruments). (1) Unsupervised explorative modelling: Compression and visualisation in the form of e.g., factor analysis, clustering, multidimensional scaling, nonnegative matrix factorisation, fitting to manifolds, autoassociative mappings. (2) Supervised modelling: Different families of methods for prediction e.g., model-based, linear, basis expansion based, local, additive, tree based, rule based, neural network based, support vector based, prototype based. (3) General algorithmic modelling techniques such as ensemble methods, bagging, boosting, Bayesian inference, performance evaluation, model selection, variable selection.
Lectures, seminars, computer exercises and projects.
Theory 5 credits, Implementation and projects 10 credits. For Pass grade, participation in at least 80% of the implementation and project meetings, as well as an approved project report are required. The theory part is examined in a written examination and in compulsory written assignments.
week 27, 2012
The elements of statistical learning : data mining, inference, and prediction