Syllabus

COS 598D:

Data Analysis and Modeling in Science, Engineering and Information Services
Mondays 1:30 pm – 4:00 pm
Computer Science Room 105 (Small Auditorium)
Course Professor: Jaswinder Pal Singh
Coordinator: Steven Kleinstein
TA: Christopher Calderon


Monday, February 6: PCA, SVD and their applications (JP Singh)

Reading Material 1
Reading Material 2
Reading Material 3
Reading Material 4

Slides

 

Monday, February 13: Clustering with Applications
Mixture models, k-means, hierarchical, spectral (Charikar)
Applications of PCA/SVD and Clustering (Charikar and Ned Wingreen)

Slides
Reading Material

Monday, February 20: Advanced Dimensionality Reduction

Random projections and Sketches (Charikar)

Slides

Applications (Funkhouser)

A. Jain. M.N. Murthy, P.J.Flynn. "Data Clustering: A Review". ACM Computing Surveys, Vol. 31, No. 3, September 1999

Reading Material 1

P. Berkhin. "Survey of clustering data mining techniques". Technical report, Accrue Software, 2002.

Reading Material 2

Rui Xu; Wunsch, D., II, "Survey of clustering algorithms," Neural Networks, IEEE Transactions on , vol.16, no.3pp. 645- 678, May 2005

Reading Material 3

Monday, February 27: Classification (Schapire)
SVM, boosting, logistic regression, etc.
Applications (e.g., text classification)
Reading Material 1
Reading Material 2
Reading Material 3
Slides

Monday, March 6: Graphical Models I:
Introduction to Bayesian networks; Parameter learning and inference (Troyanskaya)
Introduction to Hidden Markov Models (M. Singh)
Reading Material
Rabiner and Juang. An Introduction to Hidden Markov Models IEEE ASSP Magazine 3(1): 4--16. January 1986

Monday, March 13: Graphical Models II:
Probabilistic approaches to dimensionality reduction (Blei)
*****Variational methods for approximate inference
Reading Material 1
Reading Material 2

Transcription factors example for Bayesian networks (Tavazoie)
Reading Material

Monday, March 27: Model-based data-analysis I (Shvartsman)
ODE-based models
Parameter estimation, confidence intervals
Application example: model-based mechanism discrimination for EGF receptor trafficking

Monday, April 3: Model-based data-analysisII (Shvartsman)
PDE-based models
Dimensionality reduction (nondimensionalization and scaling)
Parameter estimation
Application example: pattern formation in developing tissues

Monday, April 10: Data Dimensionality Reduction for Dynamic Systems I (Kevrekidis)
Spectral methods
Manifold learning techniques

Monday, April 17: Data Dimensionality Reduction for Dynamic Systems II (Kevrekidis)
Identification of slow variables and dynamically meaningful reaction coordinates

Monday, April 24: Model Dimensionality Reduction (Li)

Monday, May 1: Scalable Computing (JP Singh)