Monthly Archives: September 2012

Week 5. Model building using linear regression

Students learn how to construct a prediction model incrementally using stepwise linear regression. They also learn: about data preparation, and converting categorical variables into a set of dummy variables with numerical values of 0 and 1 the concept of interaction: 2 … Continue reading

Posted in Uncategorized | Leave a comment

Week 2. Introduction to statistical analysis

In the lab session, students are introduced to the SPSS statistical software. This is used to carry out bivariate analysis: between a quantitative variable (salary) and categorical variable (e.g. type of library) using t-test between two quantitative variables (e.g. salary … Continue reading

Posted in Uncategorized | Leave a comment

Week 1. Exploratory analysis using Microsoft Excel

The first few labs will make use of Salary Survey data from the Library Association to learn basic statistical analysis. In the first lab, students make use of MS Excel to explore bivariate relations — between annual income (target variable) … Continue reading

Posted in Uncategorized | Leave a comment

Week 1. Difficulty level of the course

Students will tell you that this is a difficult but useful course — possibly the most difficult course in the KM programme. I think students find the course difficult for 2 reasons: Other courses in the KM programme are management oriented, … Continue reading

Posted in Uncategorized | Leave a comment

Week 1. Reading & homework

Read [Linoff & Berry] chap. 1 & 2. Reading is like text mining — you should not memorise every line. Skim the text to look for “good stuff”, i.e. read purposefully. But what is the purpose of reading [Linoff & … Continue reading

Posted in general | Leave a comment

Week 1. Introduction to the course

There are two data mining courses offered in the School: a technical course offered in the MSc Information Systems programme (CI6227 Data Mining) a practical course (this course, K6225) using a how-to-do-it, how-does-it-work and how-to-apply-it kind of approach, with a … Continue reading

Posted in general | Leave a comment

Purpose of the blog

This blog is for the course K6225 Knowledge Discovery & Data Mining in the MSc Knowledge Management programme offered in the School of Communication & Information. The blog was set up for 3 purposes: pedagogical purpose: to be an additional … Continue reading

Posted in general | Leave a comment