STA 6207 -- Regression Analysis

Fall 2017  Syllabus (WORD format)

Fall 2016     Exam 1    Solutions     Scores (209 Points)

Fall 2016     Exam 2    Solutions     Scores   (240 Points)

Fall 2012       Exam 1       Exam 2          Exam 3

Fall 2013       Exam 1       Exam 2          Exam 3 (Version 1)    Exam 3 (Version 2)

Fall 2014
Exam 1       Exam 2         Exam 3

Fall 2015        Exam 1       Exam 2         Exam 3

Fall 2016        Exam 1       Exam 2         Exam 3

Exam 1 Topics - Fall 2015   (Exam @ 7AM, Friday, September 30)     Chapter 3 Portions are for Simple Regression (p=1)

Exam 2 Topics (Exam @ 7AM, Friday, October 27)

Instructor: Larry Winner
Office Hours:  M 9:45-11:15    Tu 12:00-1:30    Th 8:00-9:00

E-mail: winner@stat.ufl.edu
Phone: (352) 273-2995

TA: Xuan Cao,  Office Hours: M 1:30-5:30    Tu 8:30 am - 12:30 pm    E-mail:  caoxuan@ufl.edu   Office: FLO 116D

Regression Notes - New Version (Available at Target Copy. Do not print this out on Department Copier!)         R Programs          References

Very Old Exams

Statistical Tables

Running R on Windows and Macs (Source: Stanford University Social Science Data and Software)

Chapter 1 - Math Stat Review/Introduction

Examples of Common Families of Distributions

Michael Jordan Career Regular Season Stats       EXCEL

Distributions of Functions of Normal RVs         R Program        R Output         Graphics Output

Discrete Bivariate Distribution - Female Curling Scores by End at 2014 World Championships     EXCEL

Chapter 2 - Simple Linear Regression - Scalar Form (RPD Chapter 1)

Practice Problems      WORD       PDF

Carpet Aging            R Program (.r)         R Program (.pdf)         R Output           EXCEL Spreadsheet

Electric Train Supply and Demand       Data       Description

EXCEL   Spreadsheet      Combined EXCEL, R, SAS   Programs/Results

R    Program            SAS    Program

MPA Suspension Concentration and Peak Intensity  EXCEL            Antiioxidant Levels and Activity in 40 Varieties of Lager Beer        Data (.csv)         Description      EXCEL Spreadsheet

WNBA Heights and Weights           Data (.csv)       R Program             R Text Output           R Graphics Output                 Regression through Origin (EXCEL)

Orlando Weather Data (EXCEL)

Minneapolis Annual Temperature 1900-2014 (EXCEL)

NBA Over/Under and Total Points 2014/2015 Correlation Analysis        Data.csv       Description        R Program     EXCEL Spreadsheet

Classical Simple Linear Regression Model - Galton Height Data

Chapters 3 & 4 - Matrix Approach to Simple Linear Regression and Distributional Properties  (RPD Parts of Chapter 2 - 4)

Practice Problems      WORD      PDF

Introduction to Matrix Algebra and Simple Linear Regression in Matrix Form (Chapter 2 and part of 3 in RPD)      PDF

Florida Lotto Results - October 24,1999 - September 16, 2014        R Program

Gravity/Latitude Worksheet        EXCEL Spreadsheet             Data       Description

RPD Problem 4.6     R Program      R Output

Chapter 5 - Problem Areas in Least Squares and
Residual Diagnostics/Tests  (RPD Chapters 10-12)

Practice Problems            WORD         PDF

R Program to Simulate Problem Areas in Least Squares

Residuals and Influence Measures  (WORD)

Residuals and Influence Measures (PPT)       Apparent R Guidlelines for Identifying Influential Observations

US State Wine Consumption and Population     Data (.csv)        R Program        PDF

Math Score/LSD Concentration      EXCEL       R Program

Residual Analysis of Regression of Argentine Wheat Yields Rainfall and Temperature  (WORD)

Argentine Wheat Yields     Data           Description

Argentine Wheat Yields   SAS Program      SAS Text Output      SAS Graphics Output

Argentine Wheat Yields

Muscle Regression Case Study (PPT)
NFL 2007 Spread and Actual Scores - Regression/Residual Analysis and Tests  (PPT)

Variance Stabilizing Transformations

Box-Cox Transformation Description

Spanish Silver in New World 1720-1800 - Box-Cox Transformation SAS Program (Matrix Form) (Matrix Form)       SAS Program Graphics Output (Matrix Form)

Spanish Silver in New World 1720-1800 - Box-Cox Transformation R Program (Matrix Form) (Matrix Form)         R Program Graphics Output (Matrix Form)

Chapter 6 - Multiple Linear Regression

Practice Problems      Word          PDF

Sections 6.1-6.3 - Estimation and Testing  (RPD Chapters 3 and 4)

Using EXCEL for Matrix Form of Multiple Regression Model - Hotel Energy Consumption   PPT      EXCEL      R Program      Data (.csv)      Description

Multiple Linear Regression - Texas January High Temperatures  (Complete/Reduced Models)

Assessed Winning Probabilities in Texas Hold 'Em

Estimating Demand Elasticity for Sugar 1896-1914            PPT                  EXCEL           Data        Description

Texas January High Temps (n=369 Locations)      EXCEL

Association Between Height and Foot and Hand Lengths in Females  (EXCEL)

Hand Length EXCEL

Section 6.4 - General Linear Tests (RPD Sections 4.5)

NFL Point Spreads and Actual Scores (PPT)

Height, Hand Length, and Foot Length for 80 Adult Males

Texas Mean January Temperature (EXCEL)

General Linear Test - Cobb-Douglas Production Function

Sections 6.5-6.6 - Models with Qualitative Variables and Interactions (RPD Section 9.6)

Sections 6.7-6.9 - Models w/Curvature, Response Surfaces, and Trigonometric Models (RPD Chapter 8)

Heat Capacity and Temperature for Solid Hydrogen Bromide         Data          Description

Ice Cream Sensory Evaluations                     EXCEL             Data           Description

Yarn Count and Output for ealy 20th Century New England Textile Mills      Data            Description

2013 NBA Player Height and Weight  EXCEL

Container Ship Speed and Fuel Consumption for ship_leg = 1        EXCEL     Data(All ship_legs)      Description

Sine and Cosine Plots

Trigonometric Regression - Tampa Bay Monthly Hotel Revenues

Trigonometric Regression - Shipping Container Throughput by Month

Response Surface Model - Top NASCAR Qualifying Speeds by Track

Response Surface Relating Sugarcane Wine Rating to 3 Factors      Data          Description

R Program       R Text Output    R Graphics Output
SAS Program      SAS Output

Response Relating Mango Wine Ethanol Level to 3 Factors      Data        Description

EXCEL        SAS Program         SAS Output          R Program       R Output

Section 6.10 -  Model Building (RPD Chapter 7)

Mortgage Rates for 18 Cities - Worksheet

R Program for k-fold Cross Validation

Cruise Ship Model Building (Updated)             R Program (Updated)

NBA Odds 2014/2015              Data (.csv)          Description           R Program

Section 6.11 -  Multicollinearity (RPD Chapter 13)

Shaq O'Neal Ponts/Rebounds - Eigenvalues, Eigenvectors, Principal Components      R Program

Multiple Linear Regression - Standing Heights and Other Stature Attributes for Female Police Officer Candidates (Multicollinearity/Principal Component Regression)

Multiple Linear Regression - China Carbon Emmissions and Population Factors 1978-2008 (Multicollinearity/Ridge Regression)

Cruise Ship      Ridge Regression  R  Program         Principal Components R Program

Sections 6.12-6.13 - Models with Heteroskedastic and Correlated Errors (RPD Section 12.5)

Weighted Least Squares Case study -- Cholesterol Reduction (PPT)

Estimated Weighted Least Squares - Profits and Market Structure for High Advertising Firms

US Wine Sales and Population      Data    Description    SAS Program

Chapter 7 - Nonlinear Regression (RPD Chapter 14)

Practice Problems        Word         PDF

Intrinsically Linear Regression - Cobb-Douglas Production Function

Orlistat Case Study       Data       Description

R Program     R Text Output    R Graphics Output    SAS Program      SAS Text Output      SAS Graphics Output

Salmonella Weighted Nonlinear Least Squares    R Program         R Text Output          R Graphics Output

Solomon  Island Bird Species    Data      Description     R Program     Worksheet

Kentucky Derby Winning Times    Data (.csv)    R Program

Estimated Generalized Least Squares Matrix Algorithm for AR(q) Errors

Chapter 8 - Random Coefficient Regression/General Mixed Linear Models  (RPD Chapter 18)

Practice Problems        Word         PDF

Airline Revenues for 10 Markets 1996-2000 Case Study - PPT         Updated 1/29/2017

Airline Revenues for 10 Markets         Data     Description

SAS Program (proc mixed)                                      SAS Output

WNBA Example (EXCEL)

Chapter 9 - Models Based on Non-Normal Distributions

Logistic Regression - NFL Field Goal Attempts (2003)

Logistic Regression - Pre-Challenger Field-Joint O-Ring Failures and Temperature

Poisson Regression - NASCAR Crash Data (1975-1979)

Poisson Regression with Rates - Traffic Accidents in Finland on Friday the 13th versus Other Fridays by Gender (1971-1997)

Negative Binomial Regression - NASCAR Lead Changes (1975-1979)

Gamma Regression - Napa Valley Marathon Speeds by Age and Gender  (2015)

Beta Regression - Proportion of Prize Money by Race for Ford - NASCAR Races (1992-2000)