Clarence (Lance) Gravlee's Home Page

Home > Teaching > SYG 3300 > Assignments > Lab 3

Clarence Gravlee, Instructor
Office hours: Wed., 12:30-2:00p, 11/2125
Class: Mon. and Wed., 9:00-12:00, 3/1371
Lab: Mon., 12:40-2:20, 2/2082


Announcements

Syllabus

Assignments

Lab & Class Notes

Lab 3: Measures of Central Tendency and Dispersion; Graphs

Also available as PDF document (requires free Acrobat Reader)

Note: You may submit this lab as an email attachment,
or fax it to (603) 963-0087, by 9:00 a.m. Monday, May 29

1.  For which of the following variables would it be appropriate to obtain a histogram?

  1. Miles driven to work each week
  2. Attitude toward affirmative action
  3. Blood pressure
  4. Annual household income
  5. Happiness of marriage
  6. Region of country
  7. Subjective social class identification
  8. Age in years

2.  For each of the following variables, indicate which measure(s) of central tendency would be permissible (place an X in the appropriate blank).

Mode Median Mean
  a.   Political party identification _____ _____ _____
  b.   Level of occupational prestige _____ _____ _____
  c.   Annual number of deaths from AIDS _____ _____ _____
  d.   Religious identification _____ _____ _____
  e.   Violent crime rate _____ _____ _____
  f.    Age in years _____ _____ _____
  g.   Level of support for US foreign aid _____ _____ _____
  h.   Number of consecutive alcoholic drinks _____ _____ _____

For questions 3-5, use the Explore procedure to produce descriptive statistics and plots.  You are required to print out and hand in the output from your work.

3.  Find the variable AGE in the dataset named CH10END.

  1. Examine the stem-and-leaf plot.  How would you describe the distribution of this variable?  Is it approximately normal, or is it skewed to the left or right?
  2. Report the mean and the median for AGE.  Which of these measures would you use to describe central tendency?  Why did you make this choice?
  3. Report the range, interquartile range, and standard deviation for AGE.  Based on the standard deviation, roughly two-thirds of all cases fall between which ages?

4.  Find the variable CHILDS in the dataset named FAMILY96.

  1. Examine the boxplot.  How would you describe the distribution of this variable?  Is it approximately normal, or is it skewed to the left or right?  Are there any outliers?
  2. Report the mean and median for CHILDS.  Which of these measures would you use to describe central tendency?  Why did you make this choice?
  3. Report the range, interquartile range, and standard deviation for CHILDS.  Which of these measures best summarizes the variability in number of children?

5.  Find the variable TVHOURS in the dataset named CH10END.

  1. Examine the boxplot.  How would you describe the distribution of this variable?  Is it approximately normal, or is it skewed to the left or right?  Are there any extreme outliers?
  2. Report the mean and median for TVHOURS.  Which of these measures would you use to describe central tendency?  Why did you make this choice?
  3. Report the range, interquartile range, and standard deviation for TVHOURS.  Which of these measures best summarizes the variability in number of hours watching television?

6.  You can use data from previous years of the GSS to answer questions about change over time.  First, find the variable EDUC in the dataset CH10END.  Calculate the mean and standard deviation, and report both statistics below.  Then, find the same variable in the dataset from six years earlier, GSS90a.  Calculate and report the mean and standard deviation again.  Is there any difference between 1990 and 1996?  If we used GSS data from different years in this way, which type of longitudinal design would it be?

 

 
home | cv | papers | research | teaching | links | contact
updated 10.18.02