STA 6166
Statistical Methods in Research I



Spring 2020 Syllabus

Instructor: Larry Winner
Office: 228 FLO
Office Hours: 
  M 1:30-3:00  Tu 11:15-12:30  W 12:00-1:15  Th 7:45-8:45
E-mail: winner@stat.ufl.edu
Phone: (352) 273-2995

TA: Jin Zhumengmeng (Jin)    Office Hours: MF 11:50-12:50, 4:00-6:00, Tu 10:30-12:30       E-mail:  z.jin@ufl.edu     Office: FLO 116D

R Program 1/17/2020


ESP Experiment Data (EXCEL)
 
Applied Statistical Methods Text and Computer Programs


Downloading and Installing R, RStudio, MikTex

Course Notes     R Programs  

Chapter 1 Slides

Chapter 2 Slides

Chapter 3 Slides

Chapter 4 Slides

Chapter 5 Slides

Chapter 6 Slides

Section 7.1 Slides        Section 7.2  Slides

Chapter 8 Slides

Chapter 9 Slides


 
Statistical Tables

Fall 2016 Exam 1 Solutions (Includes 2 Versions)

Fall 2016 Exam 2 Solutions (Includes 2 Versions)       Fall 2016 Exam 3 Solutions (Includes 4 Versions)

Fall 2015 - Exam1 Solutions
                   Spring 2016 - Exam 1 Solutions           Spring 2016 - Exam 2 Solutions

Z, t Distributions (Upper Tail Probabilities)   There is something wrong with my link here, please get table from STA 6126 page, sorry!

Z-table (Upper Tail Probabilities)      Z-table (Lower Tail Probabilities)

Exam 1 Topics - Spring 2020

Exam 2 Topics - Fall 2017
   

Categorical Practice Problems for Exam 2: 6,8,11,13,15a-c,19,30,33,39,44,46,47,53,55,56

t, Chi Square, F Distributions (Upper .025 Tail Probabilities)

t, Chi
Square, F Distributions (Upper .05 Tail Probabilities)

Combined Studentized Range and Bonferroni t-table
           

Exam 3 Topics and Instructions - Fall 2017


Categorical Practice Problems for Exam 3: 1-5,7-10,12,14,15d,16,18,20-29,31,32,34-38,40,42,43,45,48,49,51,52,54,57

t, Chi Square, F Distributions (Upper .05 Tail Probabilities)



Statistical Tables

Critical Values for Wilcoxon Rank-Sum Test

Critical Values for Wilcoxon Signed-Rank Test

t, Chi Square, F Distributions (Upper .025 Tail Probabilities)

t, Chi
Square, F Distributions (Upper .05 Tail Probabilities)

Combined Studentized Range and Bonferroni t-table

Bonferroni t-Table

t, Chi Square, F Distributions

Studentized Range Distribution



Practice Problems by Topic


Probability Problems         PDF       Solutions

Inference Concerning a Mean Problems       PDF        Solutions

Inference Concerning 2 Means Problems       PDF        Solutions   

Inference Concerning Variance Problems       PDF        Solutions

1-Way ANOVA Problems          PDF         Solutions

Randomized Block Design Problems        PDF          Solutions   

Categorical Data Problems        PDF           Solutions

Regression Problems             PDF        Solutions 


Old Exams

Practice Exam 1          
Practice Exam 2        Practice Problems for Fall 2008 Exam 3

Fall 2008 - Exam 1          
Fall 2008 - Exam 1 Solutions (One Version)       Fall 2008 - Exam 1 Solutions (Other Version)

Fall 2009 - Exam 1             Fall 2010 - Exam 1           Fall 2011 - Exam 1        Fall 2012 - Exam 1     Fall 2013- Exam 1    Fall 2015 - Exam 1    

Spring 2016 - Exam 1

Fall 2008 - Exam 2         Fall 2009 - Exam 2       Fall 2010 - Exam 2    Fall 2011 - Exam 2   Fall 2012 - Exam 2   Fall 2013 - Exam 2     Fall 2015 - Exam 2  

Spring 2016 - Exam 2


Fall 2008 - Exam 3         Fall 2009 - Exam 3       Fall 2010 - Exam 3     Fall 2011 - Exam 3   Fall 2012 - Exam 3    Fall 2013 - Exam 3    Fall 2015 - Exam 3

Spring 2016 - Exam 3          Fall 2016 - Exam 3            



Fall 2012 - Exam 4         Fall 2013 - Exam 4       Fall 2015 - Exam 4    Spring 2016 - Exam 4

Fall 2006 - Exam 2 Solutions


1. Introductory statistics with R

Peter Dalgaard.

Author: Dalgaard, Peter

Published: New York : Springer, c2008.

(E-book)  E-book

SpringerLink http://dx.doi.org/10.1007/978-0-387-79054-1 An electronic book accessible through the World Wide Web; click for information

UF ONLINE See Link to Connect














Chapter 1 Slides (PPT)

Chapter 2 Slides (PPT)

Chapter 3 Slides (PPT) - Updated 1/4/2016

NHL Players Body Mass Indexes   (R Program)

Rock and Roll Marathon Speeds              R Program

NFL 2017 Preseason Rosters Height/Weight     R Program

Gasoline Prices March 2005 (EXCEL)

NFL Combine Example               R Program        EXCEL

Chapter 4 Slides (PPT) - Updated 1/4/2016

Common Families of Probability Distributions (PPT)

Philly Rain Data

Philly Rain Random Samples (EXCEL) - First 100 Samples

Philly Rain Sample Means - All 1000 Sample Means, SDs, CIs

Philly Rain Sample Means - Summary

Philly Rain Summary Statistics and Sampling -  R Program

Philly Rain Summary Statistics and Sampling - Text Output

Philly Rain Summary Statistics and Sampling - Graphics Output

Philly Rain Histogram with Smooth Density Plot            R Program       Graphics Output

Cell Phone Radiation          Data       R Program      Worksheet

Las Vegas Casino Square Footage by Activity         Data (.csv)       R Program

Rock and Roll Marathon Speeds              R Program

Cold Fish Data     Description


Cross- Tabulations for Cold Fish Data - R Program      Text Output     Graphics Output
                                                               
                                                                SAS Program     Text Output     Graphics Output


Chapter 5 Slides (PPT)

Power Calculation for Test for a Single Mean Based on Z-distribution   (EXCEL)

NHL BMI --- Z- and t- distributions   (R)

Application of the Bootstrap - Average Film Shot Lengths  (PPT)

Chapter 6 Slides (PPT)

R Program for Sampling Distribution of Mean Differences (NHL & EPL BMI) - Standard Error Corrected

NHL/NBA Body Mass Index     Worksheet    R Program  (Updated to include unequal variance case for Comparing means)

NHL/NBA Body Mass Index (Includes Rank-Sum Test)   R Program     Worksheet

Female/Male Marathon Velocities
(Includes Rank-Sum Test)   R Program

Permutation Tests - WNBA/NBA Body Mass Indexes and English Premier League 2012 Home Field Advantage

Permutation Test (2 Independent Samples) - 2013 WNBA/NBA Body Mass Indexes       Data (.csv)         R Program                

Permutation Test (Paired Samples)  - 2012 English Premier  Soccer League Home Field Advantage         Data (.csv)           R Program  


Problem 6.18

Experimental Design Issues (PPT)

Chapter 7 Slides  (PPT)

NHL BMI   R Program               Rock and Roll Marathon    R Program       PGA/LPGA 2008 Driving Distance   R Program

EXCEL Spreadsheet for Levene's Test, Based on Medians (Brown-Forsythe version) (<=6 Groups, N<=1000)

SAS
Program for Levene's Test, Based on Medians (Brown-Forsythe version)    Output

R
Program for Levene's Test, Based on Medians (Brown-Forsythe version) - Must Download lawstat library       Output


Chapter 8 & 9 Slides  (PPT)

EXCEL Spreadsheet for 1-Way ANOVA based on Summary Statistics

NHL/NBA/EPL Body Mass Indices    EXCEL    R Program

Drumstick Weight              Data    Description     R Program      EXCEL (WIP)

Amoeba Analysis (PPT)

Amoeba Analysis (EXCEL)          Amoeba Orthogonal Contrasts (EXCEL)

Connection Between Independent Sample t-test and 1-Way ANOVA when t=2  (EXCEL)

Iron Depth  (Excel)

Iron Depth  (Data)

Iron Depth  (Description)

Amoeba Data

Amoeba Data Description

Amoeba Dataset in EXCEL (in format for description in EXCEL instructions for Data Analysis Toolpack)

SAS Code for Amoeba ANOVA    Graph1      Graph2

R Code for Amoeba ANOVA        Graphs       R Code for Amoeba Bonferroni Simutaneous CI's    Output

Excel Spreadsheet for Amoeba Completely Randomized Design

Arsenic Worksheet (WORD)

Chapter 14 Slides  (PPT)

Caffeine Effect on Endurance (PPT)

Caffeine    Data     Description

Caffeine Dataset in EXCEL (in format for description in EXCEL instructions for Data Analysis Toolpack)

SAS Code for Caffeine Analysis    Graph1    Graph2

R Code for Caffeine Analysis         Graphs     
R Code for Caffeine Bonferroni Simutaneous CI's      Output

Excel Spreadsheet for Caffeine Randomized Block Design

Chopstick Length Experiment   Data    Description     Worksheet    EXCEL       R Program

Chapter 10 Slides (PPT)

Categorical Data Analysis Examples    PDF    WORD       EXCEL    R Program    R Output

NBA 2014/15 Point Spread and Over/Under Results     Data     Description       R Program

EXCEL Spreadsheet for Comparison of 2 Proportions

Multinomial Distribution - Soccer Game Outcomes for 5 European Premier Leagues

Goodness of Fit - Poisson Distribution for Brazil Soccer League Total Goals per Game - 2013

EXCEL Spreadsheet for Chi-Square Test

Relative Risks & Odds Ratios - John Snow's Cholera Investigations


Chapter 11 Slides  (PPT)

NBA Height and Weight    PPT      EXCEL 

Orlando July Mean High Temperature 1960-2015  (EXCEL)

Hair Growth/Temperature EXCEL Worksheet

Simple Regression Simulation  (EXCEL)

Variance Stabilizing Transformations  (PPT)

Math Score/LSD Concentration Data

Math Score/LSD Concentration Description

Math Score/LSD Concentration SAS Code

Math Score/LSD Concentration R Code

Paddy Field Data Worksheet       EXCEL        Data      Description

Lack of Fit F-Test (EXCEL)

Berry Chewiness Lack-of-Fit Computations (EXCEL)

Chapter 12 Slides (PPT)

Texas Weather Complete/Reduced Models  (PPT)

Texas Weather        Data      Description       SAS Program     SAS Output     R Program      R Output

Texas Weather EXCEL Regression Models (n=369 Sites)

NHL Weight, Height, and Age Model    R Program

Worksheet       EXCEL Spreadsheet

Texas January Mean Temperature (n=369)  EXCEL

Geomechanical Properties of Rocks in Tunneling    Data (.csv)   R Program

Egyptian Cotton   Worksheet     EXCEL Spreadsheet       R Program

NASA Propellant Mixture/Nozzle Area Experiment (EXCEL)

Movie Revenue Data (EXCEL)

Textbook Datasets    www.stat.ufl.edu/~winner/sta6166/datasets/Excel/


Assignments

Assignment 1: Due Wednesday, September 13, 2017 (15GH)  Thursday Sept. 14, 2017 (5842)

Cell Phone Radiation          Data       R Program   EXCEL Spreadsheet    


Assignment 2: Due Wednesday, October 4, 2017  (15GH)   Thursday, October 5, 2017 (5842)

Airline Dataset (EXCEL file)

Airline Dataset   Text File    Description

R Program to Read in Airline Dataset and Create Percent Change Variable and More

R Program for Parts 2 and 3

Animal Feed Dataset (Text file)

Animal Feed Dataset (EXCEL File)


LPGA 2003 Data (Golfer (Length=25), US Open Total, British Open Total)





Atlanta - October 2004 Flight times to Western cities (City, City ID, Flight Time, Distance)

Pests Effects on Sugar-Cane Yields  (Treatment, Weight Yield, Juice Yield)



Assignment 3: Due Thursday, Nov. 2, 2017 (Section 5842) Friday, Nov. 3, 2017 (Section 15GH)

Abdominal Leakage in Breast Reconstruction Surgery           Data (.csv)        R Program  

Polyphenols in Beer              Data (.csv)      R Program 

Theophylline Dataset  (Text file)             R Program        SAS Program          

Theophylline Dataset  (EXCEL File)

Theophylline Dataset for ease of plotting  (EXCEL)

Theophylline Description File

Hominy Sales Data  (Text File)

Hominy Sales Data (EXCEL File)




Assignment 4: Due Monday, 12/4/2017 (15GH) Tuesday 12/5/2017 (5842)

R Program for Parts 1, 2, 4, 5       EXCEL Spreadsheets for      Part 3

Shrinkage Data (Text File)

Shrinkage Data (EXCEL File)


Mice Growth Data  (Text File)

Mice Growth Data  (EXCEL File)

Hong Kong Building Cost       Data          Description




Practice Problems (Do not turn in)
Chapter 1: 6
Chapter 2: 9,14,26,28
Chapter 3: 4,7,8,12,15,21,29,33,35,39,41
Chapter 4: 1,27,29,31,33,35,43,45,73,75,77,83,85,89,99,101,103
Chapter 5: 13,21,24,27,33,39,41,52,54,62,64,66,68,76
Chapter 6: 5,7,11,27,35,43,47,51,57,59
Chapter 7: 7,19,20,22,24
Chapter 8: 7,27,29,31,33,35,37,39,41,43
Chapter 9: 
Chapter 10: 
Chapter 11: 
Chapter 12: 





Computing Resources/Information

Summary Statistics, Simple Plots in SAS/EXCEL, JMP

Generating Random Samples in SAS/EXCEL/JMP/SPSS

SAS

Program to obtain summary statistics, scatterplot, and boxplots  (.sas)

Description of SAS Procedures for Common Statistical Analyses


R

Running R on Windows and Macs (Source: Stanford University Social Science Data and Software)

Running Programs in R (Change all references to STA6167 to STA6166)

R Batch File

Introduction to R (Work in Progress)


SPSS

SPSS General Instructions for Procedures

EXCEL

Some Useful EXCEL Functions

Obtaining Summary Statistics and Plots by Groups (Factor Levels)  in EXCEL

EXCEL General Instructions for DATA ANALYSIS Toolpack

Generating Histograms in EXCEL

EXCEL Program for Summary Statistics/Boxplot (n<=1000)

EXCEL Program for Normal Probability Calculations/Plot (n<=1000)

EXCEL Program for 2-sample t-test/CI's (and test of equal variances) based on summary statistics

EXCEL Program for 1-Way ANOVA based on summary statistics

EXCEL Program for 2-sample z-test and CI's for proportions

EXCEL Program for Chi-Square Test

EXCEL Program for Simple Linear Regression Including CI's/PI's for Means and Individuals (n<=1000)

EXCEL Program for Levene's Test (Up to 6 Groups, 1000 Total Observations)