Postbaccalaureate Studies
The Department of Statistics offers courses in the basic principles and techniques of probability and statistics, advanced theory and methods courses, courses in stochastic processes and methods, and courses statistical methods in finance.
Departmental Chair:
David Madigan, 1255 Amsterdam Ave. Room Room 1004 SSW
212-851-2131
madigan@stat.columbia.edu
Departmental Adviser: Regina Dolgoarshinnykh, 1255 Amsterdam Ave., Room 1013 SSW
212-854-2140
regina@stat.columbia.edu
Department Office: Dood Kalicharan, 1255 Amsterdam Ave., Room 1005 SSW
212-851-2130
dk@stat.columbia.edu
Office Hours: Monday-Friday, 9:00 AM-5:00PM
Course scheduling is subject to change. Days, times, instructors, class locations, and call numbers are available on the Directory of Classes.
Fall course information begins posting to the Directory of Classes in February; Summer course information begins posting in March; Spring course information begins posting in June. For course information missing from the Directory of Classes after these general dates, please contact the department or program.
Click on course title to see course description and schedule.
A friendly introduction to statistical concepts and reasoning with emphasis
on developing statistical intuition rather than on mathematical rigor.
Topics include design of experiments, descriptive statistics, correlation
and regression, probability, chance variability, sampling, chance models,
and tests of significance.
Designed for students in fields that emphasize quantitative methods.
Graphical and numerical summaries, probability, theory of sampling
distributions, linear regression, confidence intervals and hypothesis
testing. Quantitative reasoning and data analysis. Practical experience
with statistical software. Illustrations are taken from a variety of
fields. Data-collection/analysis project with emphasis on study designs is
part of the coursework requirement.
Designed for students who desire a strong grounding in statistical concepts
with a greater degree of mathematical rigor than in STAT W1111. Random variables, probability distributions,
pdf, cdf, mean, variance, correlation, conditional distribution,
conditional mean and conditional variance, law of iterated expectations,
normal, chi-square, F and t distributions, law of large numbers, central
limit theorem, parameter estimation, unbiasedness, consistency, efficiency,
hypothesis testing, p-value, confidence intervals, maximum likelihood
estimation. Serves as the pre-requisite for ECON W3412.
A quick calculus-based tour of the fundamentals of probability theory and
statistical inference. Probability models, random variables, useful
distributions, expectations, law of large numbers, central limit theorem,
point and confidence interval estimation, hypothesis tests, linear
regression. Students seeking a more thorough introduction to probability
and statistics should consider STAT W3105 and W3107.
Data Mining is a dynamic and fast growing field at the interface of Statistics and Computer Science. The emergence of massive datasets containing millions or even billions of observations provides the primary impetus for the field. Such datasets arise, for instance, in large-scale retailing, telecommunications, astronomy, computational and statistical challenges.
This course will provide an overview of current research in data mining and
will be suitable for graduate students from many disciplines. Specific
topics covered with include databases and data warehousing, exploratory
data analysis and visualization, descriptive modeling, predictive modeling,
pattern and rule discovery, text mining, Bayesian data mining, and causal
inference.
A fast-paced introduction to statistical methods used in quantitative
finance. Financial applications and statistical methodologies are
intertwined in all lectures. Topics include regression analysis and
applications to the Capital Asset Pricing Model and multifactor pricing
models, principal components and multivariate analysis, smoothing
techniques and estimation of yield curves statistical methods for financial
time series, value at risk, term structure models and fixed income
research, and estimation and modeling of volatilities. Hands-on experience
with financial data.
Theory and practice, including model-checking, for random and mixed-effects
models (also called hierarchical, multi-level models). Extensive use of the
computer to analyse data.
Introductory course on the design and analysis of sample surveys. How
sample surveys are conducted, why the designs are used, how to analyze
survey results, and how to derive from first principles the standard
results and their generalizations. Examples from public health, social
work, opinion polling, and other topics of interest.
Least squares smoothing and prediction, linear systems, Fourier analysis,
and spectral estimation. Impulse response and transfer function. Fourier
series, the fast Fourier transform, autocorrelation function, and spectral
density. Univariate Box-Jenkins modeling and forecasting. Emphasis on
applications. Examples from the physical sciences, social sciences, and
business. Computing is an integral part of the course.
Review of elements of probability theory. Poisson processes. Renewal
theory. Wald's equation. Introduction to discrete and continuous time
Markov chains. Applications to queueing theory, inventory models, branching
processes.
A one semester course covering: Simple and multiple regression, including
testing, estimation, and confidence procedures, modeling, regression
diagnostics and plots, polynomial regression , colinearity and confounding,
model selection, geometry of least squares. Linear time series models.
Auto-regressive, moving average and ARIMA models. Estimation and
forecasting with time series models. Confidence intervals and prediction
error. Students may not receive credit for more than two of STAT W4315, W4437, and W4440.
This course covers the non-stochastic process portions of the MLC/3L exam,
and is about pricing and reserving of life insurance. Topics include
actuarial present value, the equivalence principle, premiums, three methods
of calculating reserves, joint life and multiple hazard.
This course covers portions of the C/4 exam not covered elsewhere in the
curriculum. Topics may include Bayesian statistics, credibility, and risk
measures.
Introduction to the mathematical theory of interest as well as the elements
of economic and financial theory of interest. Topics include rates of
interest and discount; simple, compound, real, nominal, effective, dollar
(time)-weighted; present, current, future value; discount function;
annuities; stocks and other instruments; definitions of key terms of modern
financial analysis; yield curves; spot (forward) rates; duration;
immunization; and short sales. The course will cover determining equivalent
measures of interest; discounting; accumulating; determining yield rates;
and amortization.
A friendly introduction to statistical concepts and reasoning with emphasis
on developing statistical intuition rather than on mathematical rigor.
Topics include design of experiments, descriptive statistics, correlation
and regression, probability, chance variability, sampling, chance models,
and tests of significance.
Designed for students in fields that emphasize quantitative methods.
Graphical and numerical summaries, probability, theory of sampling
distributions, linear regression, confidence intervals and hypothesis
testing. Quantitative reasoning and data analysis. Practical experience
with statistical software. Illustrations are taken from a variety of
fields. Data-collection/analysis project with emphasis on study designs is
part of the coursework requirement.
Designed for students who desire a strong grounding in statistical concepts
with a greater degree of mathematical rigor than in STAT W1111. Random variables, probability distributions,
pdf, cdf, mean, variance, correlation, conditional distribution,
conditional mean and conditional variance, law of iterated expectations,
normal, chi-square, F and t distributions, law of large numbers, central
limit theorem, parameter estimation, unbiasedness, consistency, efficiency,
hypothesis testing, p-value, confidence intervals, maximum likelihood
estimation. Serves as the pre-requisite for ECON W3412.
A quick calculus-based tour of the fundamentals of probability theory and
statistical inference. Probability models, random variables, useful
distributions, expectations, law of large numbers, central limit theorem,
point and confidence interval estimation, hypothesis tests, linear
regression. Students seeking a more thorough introduction to probability
and statistics should consider STAT W3105 and W3107.
Data Mining is a dynamic and fast growing field at the interface of Statistics and Computer Science. The emergence of massive datasets containing millions or even billions of observations provides the primary impetus for the field. Such datasets arise, for instance, in large-scale retailing, telecommunications, astronomy, computational and statistical challenges.
This course will provide an overview of current research in data mining and
will be suitable for graduate students from many disciplines. Specific
topics covered with include databases and data warehousing, exploratory
data analysis and visualization, descriptive modeling, predictive modeling,
pattern and rule discovery, text mining, Bayesian data mining, and causal
inference.
A fast-paced introduction to statistical methods used in quantitative
finance. Financial applications and statistical methodologies are
intertwined in all lectures. Topics include regression analysis and
applications to the Capital Asset Pricing Model and multifactor pricing
models, principal components and multivariate analysis, smoothing
techniques and estimation of yield curves statistical methods for financial
time series, value at risk, term structure models and fixed income
research, and estimation and modeling of volatilities. Hands-on experience
with financial data.
Statistical methods for rates and proportions, ordered and nominal
categorical responses, contingency tables, odds-ratios, exact inference,
logistic regression, Poisson regression, generalized linear models.
Statistical inference without parametric model assumption. Hypothesis
testing using ranks, permutations, and order statistics. Nonparametric
analogs of analysis of variance. Non-parametric regression, smoothing and
model selection.
Least squares smoothing and prediction, linear systems, Fourier analysis,
and spectral estimation. Impulse response and transfer function. Fourier
series, the fast Fourier transform, autocorrelation function, and spectral
density. Univariate Box-Jenkins modeling and forecasting. Emphasis on
applications. Examples from the physical sciences, social sciences, and
business. Computing is an integral part of the course.
Survival distributions, types of censored data, estimation for various
survival models, nonparametric estimation of survival distributions, the
proportional hazard and accelerated lifetime models for regression analysis
with failure-time data. Extensive use of the computer.
Review of elements of probability theory. Poisson processes. Renewal
theory. Wald's equation. Introduction to discrete and continuous time
Markov chains. Applications to queueing theory, inventory models, branching
processes.
This course covers theory of stochastic processes applied to finance. It
covers concepts of Martingales, Markov chain models, Brownian motion.
Stochastic Integration, Ito's formula as a theoretical foundation of
processes used in financial modeling. It also introduces basic discrete and
continuous time models of asset price evolutions in the context of the
following problems in finance: portfolio optimization, option pricing, spot
rate interest modeling.
This course covers the non-stochastic process portions of the MLC/3L exam,
and is about pricing and reserving of life insurance. Topics include
actuarial present value, the equivalence principle, premiums, three methods
of calculating reserves, joint life and multiple hazard.
This course covers portions of the C/4 exam not covered elsewhere in the
curriculum. Topics may include Bayesian statistics, credibility, and risk
measures.
Introduction to the mathematical theory of interest as well as the elements
of economic and financial theory of interest. Topics include rates of
interest and discount; simple, compound, real, nominal, effective, dollar
(time)-weighted; present, current, future value; discount function;
annuities; stocks and other instruments; definitions of key terms of modern
financial analysis; yield curves; spot (forward) rates; duration;
immunization; and short sales. The course will cover determining equivalent
measures of interest; discounting; accumulating; determining yield rates;
and amortization.