01 and 02: Introduction, Regression Analysis and Gradient Descent
definition: a computer program is said to learn from experience E with respect to some task T and some performance measure P, if its performance on T, as measured by P, improves with experience E . — Tom Mitchell (1998)
supervised learning:
supervised learning: “right answers” given
regression: predict continuous valued output (e.g., house price)
classification: predict discrete valued output (e.g., cancer type)
unsupervised learning:
unlabelled data, using various clustering methods to structure it
examples: google news, gene expressions, organise computer clusters, social network analysis, astronomical data analysis
cocktail party problem: overlapped voice, how to separate?