EXERCISES AND REVIEW QUESTIONS
PART ONE: GETTING STARTED
Before attempting these questions read through Chapters 1, 2 and 3 of the SPSS Survival Manual.
Designing a study
1.1 When choosing a scale for use in research, what are the two main characteristics you need to be aware of?
1.2 What are the two main types of reliability of a scale?
1.3 What measure is often used to indicate the internal consistency of a scale?
1.4 If you read that a scale had a Cronbach alpha value of .4 what would you think?
1.5 There are many different types of validity of a scale. Describe three.
1.6 If you were designing a questionnaire and wanted to measure respondents' ages, which of the following formats, (a) or (b), would be better? Explain your choice.
(a) Please tick one of the following categories to indicate your age: |
|
|
|
18-30 |
___ |
||
31-45 |
___ |
||
46-60 |
___ |
||
61-80 |
___ |
||
81+ |
___ |
||
(b) Please indicate your age in years: _______ |
|
|
Preparing a codebook
1.7 There are a number of rules that must be obeyed when choosing variable names to use in SPSS (see Chapter 2). Use the following questions to review some of these rules.
(a) Can a variable name start with a number?
(b) What is the maximum number of characters that a variable name can have?
(c) Can variable names contain spaces?
1.8 For each of the following, indicate which is a suitable variable name. If not suitable, explain why.
(a) *q1
(b) and
(c) religion
(d) martialstatus
(e) a110q
(f) incom.hous
(g) 5optim
(h) optim5
Getting to know SPSS
1.9 In Chapter 3 of the SPSS Survival Manual you are taken on a guided tour of the basics of SPSS. The best way to learn this program is by using it. Open the data file survey.sav. To get you familiar with the program, try some of the activities below.
(a) Using the Data Editor window, go to the bottom of the file (using the scroll bar) and find out the ID number of the last case in the file.
(b) Explore the different menus available in SPSS.
Click on Graphs and find out what types of graphs are available.
Click on Analyze and discover the wide range of statistics available.
(c) Practise using the dialogue and sub-dialogue boxes by clicking on Analyze and then on Frequencies. Next, highlight the following variables and move them into the Variables box: sex, age, marital, educ, op1, op2, op3, op4, op5, op6. Then, move these variables out of the Variables box and use Cancel to escape from the dialogue box.
PART TWO: PREPARING THE DATA FILE
Before attempting these questions read through Chapters 4 and 5 of the SPSS Survival Manual.
Creating a data file and entering data
2.1 Before you start using SPSS to prepare a data file and run analyses, it is important to check the SPSS Options. This is covered at the start of Chapter 4 of the SPSS Survival Manual. Use the following questions to review this material.
(a) How do you change the order in which your variables are displayed (e.g. alphabetical order instead of file order)?
(b) How do you change the number of decimal places that are used for a particular variable?
(c) If you wished to change the format used to display the output tables, where in the Options would you do this?
2.2 The best way to learn how to set up an SPSS data file is to actually work through each of the steps yourself. At the back of the SPSS Survival Manual you will find a codebook for suver.sav (the file that was used to generate some of the output throughout the book) and the questionnaire that this came from. Use this codebook to set up a data file from scratch. Follow the procedures described in Chapter 4 to define each of the variables listed in the codebook. When you have finished, enter some pretend data—you can generate this data yourself by completing the survey presented in the appendix at the back of the Manual. Save the file as mydata.sav.
Screening and cleaning the data
It is very important that you check your data file for errors before beginning statistical analyses. These exercises give you practice with the process of screening your data and correcting errors.
2.3 Using the instructions provided in Chapter 5 of the SPSS Survival Manual, check the following categorical variables for out-of-range cases. You will need to open the survey.sav data file. Use the codebook provided in the appendix at the back of the Manual to guide you.
(a) sex
(b) marital status (marital)
(c) children (child)
(d) major source of stress (source)
(e) do you smoke? (smoke)
2.4 Using the instructions provided in Chapter 5 of the SPSS Survival Manual, check the following continuous variables for out-of-range cases. You will need to open the survey.sav data file. Use the codebook provided in the appendix at the back of the Manual to guide you.
(a) each of the items in the Optimism scale (op1 to op6)
(b) each of the items in the Life Satisfaction scale (lifsat1 to lifsat5)
(c) each of the items in the Perceived Stress scale (pss1 to pss10)
PART THREE: PRELIMINARY ANALYSES
Before attempting these questions read through Chapters 6, 7, 8, 9 and 10 of the SPSS Survival Manual.
Descriptive statistics
The first step in the analysis of any data file is to obtain descriptive statistics on each of your variables. These can be used to check for out-of-range cases, to explore the distribution of the scores, and to describe your sample in the Method section of a report.
3.1 Use the instructions in Chapter 6 and Chapter 7 of the SPSS Survival Manual to answer the following questions concerning the variables included in the survey.sav data file.
(a) What is the mean age of the sample? What is the age range of the sample (minimum and maximum values)?
(b) What is the percentage of males and females in the sample? Did any of the sample fail to indicate their gender?
(c) What percentage of the sample were smokers?
(d) Assess the distribution of scores on the age variable. How normal is the distribution?
(e) Inspect the distribution of scores on the Total Negative Affect scale. How normal is the distribution? Are there any cases that you would consider outliers?
(f) Following the instructions in Chapter 7 generate a histogram to explore the distribution of scores on the Optimism scale (Total Optimism). How closely does it resemble the normal curve?
(g) Generate a scatterplot to explore the relationship between scores on the tpcoiss and tposaff. Describe and interpret what this scatterplot tells you about the relationship between these two variables.
Using graphs to describe and explore the data
3.2 Using the data file survey.sav, follow the instructions in Chapter 7 of the SPSS Survival Manual to obtain the following graphs.
(a) histogram of scores on the Total Self-esteem scale (tslfest)
(b) bar graph of scores on the Total Self-esteem scale (tslfest) for males and females (sex), across the three age groups (agegp3)
(c) scatterplot of scores on age and total scores on the Optimism scale (toptim)
(d) boxplot of scores on the Total Negative Affect scale (tnegaff) for males and females
(e) line graph of the Total Self-esteem scale (tslfest) for males and females (sex), across the three age groups (agegp3)
Manipulating the data
This section includes a number of activities to help you review, and to apply, the material covered in Chapter 8 of the SPSS Survival Manual. You should read through this chapter before attempting these questions.
3.3 One of the things that many students initially find difficult is being able to identify when items in a scale need to be `reversed' before being added to give a total score. It is essential that this is done properly, otherwise the values obtained for the total scale do not mean anything. To give you some practice at this we will use the Perceived Control of Internal States scale (an 18-item scale on p. 276 of the Manual). The aim of this exercise is to identify which items to reverse (not to actually carry out the reversals on the items in the survey.sav data file as these have already been correctly reversed).
(a) Identify which items in the scale would need to be reversed so that high total scores would indicate high levels of perceived control.
As discussed in Chapter 8 of the SPSS Survival Manual the next step is to calculate total scores by adding together the items that make up each scale. The following two exercises give you some practice with this process.
3.4 Use the procedures covered in Chapter 8 to create (compute) the following new total scale scores. Create new total subscale scores for the Perceived Control of Internal States scale (this scale is shown on p. 276).
(a) To calculate the Emotion subscale, add items pc1 to pc6. Call this new variable pcemot.
(b) To calculate the Thoughts subscale, add items pc7 to pc12. Call this new variable pcthou.
(c) To calculate the Physical subscale, add items pc13 to pc18. Call this new variable pcphys.
3.5 In this exercise a variable with eight different responses will be recoded into another variable which has only two possible values. You will need to use information from the questionnaire and codebook provided in the appendix of the Manual concerning the marital status question. You will also need to open the survey.sav data file.
(a) Run Frequencies on the variable marital status (marital) to find out how many people fall into each of the categories.
(b) Follow the instructions in Chapter 8 to create a new variable (relship) from the variable in the data file (marital). The new variable will only have two values, indicating whether a person is or is not in a relationship.
In the first group include people who are not in a relationship (single, separated, divorced, widowed). These will be coded 1.
In the second group include people who are in a relationship (steady relationship, living with partner, married for the first time, remarried). These will be coded 2.
(c) Run Frequencies on the new variable (relship) and compare this with the results of the Frequencies on the original variable (marital). Are there the correct number of cases in each of the new groups?
Checking the reliability of a scale
If you use scales or standardised measures in your research (this is common in psychological research) it is important to assess the reliability (internal consistency) of the scores on the scale in your sample. The following exercise gives you some practice in this process.
3.6 Follow the procedure in Chapter 9 of the SPSS Survival Manual to assess the reliability of the following scales. You will need to refer to the codebook in the appendix to identify the items that make up each of the scales.
(a) Optimism scale (op1 to op6)
(b) Perceived Stress scale (pc1 to pc18)
(c) Self-esteem scale (sest1 to sest10)
Choosing the right statistic
Many students find it difficult to identify which statistical technique to use to address their research questions. Chapter 10 of the SPSS Survival Manual will help you with this process.
3.7 For each of the following research situations identify which statistical technique could be used.
(a) Ann is interested in exploring the possibility of gender differences in levels of perceived stress.
(b) Ann would also like to explore the relationship between optimism and perceived stress. She suspects that higher levels of optimism would be associated with lower levels of perceived stress.
(c) Bill is interested in exploring the effect of both sex and age group on self-esteem scores. He is interested in the effect of each variable individually, and any interaction that may exist.
(d) Celia would like to know which is a better predictor of negative affect: optimism or self-esteem.
(e) If Celia were also concerned that age may be a confounding variable, how would she go about controlling for this variable in the analyses?
(f) David is interested in the question: Are younger people (18-29yrs) more likely to be smokers than older people (30-44yrs or 45+yrs)?
(g) Ellie conducts a study to find out if there is a significant change in depression levels across three time periods (prior to an intervention, after the intervention and at a three-month follow-up).
(h) Frank, a lecturer, wants to know who performed better on a statistics exam, males or females?
(i) Frank is also interested to find out if there was a relationship between age and exam score.
3.8 Review each of the situations listed in Exercise 3.7 and consider what non-parametric technique you would use if it was not appropriate to use a parametric test. (Hint: Not all will have a non-parametric alternative.)
PART FOUR: STATISTICAL TECHNIQUES TO EXPLORE RELATIONSHIPS AMONG VARIABLES
You should review the material in the introduction to Part Four and in Chapters 11, 12, 13 and 14 of the SPSS Survival Manual before attempting these exercises.
Correlation
4.1 Using the data file survey.sav follow the instructions in Chapter 11 to explore the relationship between age and life satisfaction (tlifesat). Present the results in a brief report.
4.2 Use the instructions in Chapter 11 to generate a full correlation matrix to check the intercorrelations among the following variables.
(a) age
(b) perceived stress (tpstress)
(c) positive affect (tposaff)
(d) negative affect (tnegaff)
(e) life satisfaction (tlifesat)
4.3 Gill, a researcher, is interested in exploring the impact of a number of variables on the experience of positive affect (tposaff), negative affect (tnegaff) and perceived stress (tpstress). In particular, he is interested in the impact of age, self-esteem (tslfest), optimism (toptim), control of internal states (tpcoiss) and mastery (tmast).
(a) Follow the instructions in Chapter 11 of the SPSS Survival Manual to generate a condensed correlation matrix which presents the correlations between positive affect, negative affect and perceived stress with each of the other variables listed.
(b) Repeat the analysis in (a), but first split the sample by sex. Compare the pattern of correlations for males and females. Remember to turn off the Split File option after you have finished this analysis.
Partial correlation
4.4 Follow the procedures detailed in Chapter 12 of the SPSS Survival Manual to calculate the partial correlation between optimism (toptim) and perceived stress (tpstress) while controlling for the effects of age. Compare the zero order correlations with the partial correlation coefficients to see if controlling for age had any effect.
Multiple regression
4.5 There are three main types of multiple regression analyses. What are they? When would you use each approach?
4.6 You wish to run a standard multiple regression to explore the impact of three independent variables on one dependent variable. What would be the minimum sample size that you should have? (Hint: Use the formula presented in Chapter 13 of the SPSS Survival Manual.)
4.7 As part of the preliminary screening process it is recommended that you inspect the Mahalanobis distances produced by SPSS. What do these tell you?
4.8 The example used in the SPSS Survival Manual to demonstrate the use of standard multiple regression compares two control measures (PCOISS and Mastery) in terms of their ability to predict perceived stress. Repeat this analysis, this time using negative affect (tnegaff) as your dependent variable. Use the output to answer the following questions.
(a) Overall, how much of the variance in negative affect is explained by these two variables?
(b) Which of the independent variables (tpcoiss, tmast) is the best predictor of negative affect?
(c) Do both variables make a statistically significant contribution to the prediction of negative affect?
4.9 Follow the instructions on p. 146 of the SPSS Survival Manual to perform an hierarchical multiple regression, this time using negative affect as the dependent variable.
Factor analysis
4.10 There is some controversy in the literature concerning the underlying factor structure of one of the scales included in the questionnaire presented in the appendix of the SPSS Survival Manual. The Optimism scale was originally designed as a one-dimension (factor) scale which included some positively worded items and some negatively worded items. Recent studies suggest that it may in fact consist of two factors representing optimism and pessimism. Conduct a factor analysis using the instructions presented in Chapter 14 to explore the factor structure of the optimism scale (op1 to op6).
PART FIVE: STATISTICAL TECHNIQUES TO COMPARE GROUPS
Before attempting these questions read through the introduction to Part Five and Chapters 15-21 of the SPSS Survival Manual.
T-tests
5.1 Using the data file survey.sav follow the instructions in Chapter 15 of the SPSS Survival Manual to find out if there is a statistically significant difference in the mean score for males and females on the Total Life Satisfaction Scale (tlifesat). Present this information in a brief report.
5.2 Using the data file experim.sav apply whichever of the t-test procedures covered in Chapter 15 of the SPSS Survival Manual that you think are appropriate to answer the following questions.
(a) Who has the greatest fear of statistics at time 1, males or females?
(b) Was the intervention effective in increasing students' confidence in their ability to cope with statistics? You will need to use the variables, confidence time1 (conf1) and confidence time2 (conf2). Write your results up in a report.
(c) What impact did the intervention have on students' levels of depression?
One-way analysis of variance
For exercises 5.3, 5.4 and 5.5, you will need to open the data file survey.sav.
5.3 Perform a one-way between-groups ANOVA to compare the levels of perceived stress (tpstress) for the five different age groups (agegp5), 18-24yrs, 25-32yrs, 33-40yrs, 41-49yrs and 50+yrs.
5.4 Perform post-hoc tests to identify where the significant differences between the groups occur.
5.5 Perform planned comparisons to compare the Perceived Stress scores for people aged 18-29yrs with those 45+yrs. Use the agegp3 variable, groups 1 and 3.
For the following exercise you will need to open the data file experim.sav.
5.6 Use one-way repeated measures ANOVA to compare the Fear of Statistics scores for the three time periods (time1, time2 and time3). Inspect the means plots and describe the impact of the intervention and the subsequent follow-up three months later.
Two-way between-groups ANOVA
5.7 For this exercise you will need to open the data file survey.sav. Follow the instructions in Chapter 17 of the SPSS Survival Manual to conduct a two-way ANOVA to explore the impact of sex and age group on levels of perceived stress. The three variables you will need are sex, agegp5 and tpstress.
(a) Interpret the results. Is there a significant interaction effect? Are the two main effects significant?
(b) Write up this analysis and the results in a report. (Don't forget to report the means and standard deviations for each group.)
Mixed between-within subjects analysis of variance
5.8 In Chapter 18 of the SPSS Survival Manual we explored the impact of two different intervention programs (maths skills/confidence building) on participants' fear of statistics. We found that both interventions were equally effective in reducing participants' fear—that is, we found no differences between groups—but a significant difference across the three time periods. Repeat these analyses, but this time use confidence scores as the dependent variable. You will need to use the following variables: group, conf1, conf2 and conf3.
(a) Is there a significant interaction effect between type of intervention (group) and time?
(b) Is there a significant main effect for the within-subjects independent variable, time?
(c) Is there a significant main effect for the between-subjects independent variable, group (maths skills/confidence building)?
Multivariate analysis of variance
5.9 How does MANOVA differ from ANOVA?
5.10 In Chapter 19 of the SPSS Survival Manual it is recommended that you check the Mahalonobis distances before proceeding with MANOVA. What does this allow you to check for?
5.11 Multicollinearity can be a problem when you are intending to use MANOVA. What does this word mean and how can you check for it?
5.12 Which assumption is Box's Test used to assess?
5.13 Follow the procedure detailed in Chapter 19 of the SPSS Survival Manual to perform a MANOVA to explore positive and negative affect scores for the three age groups (18-29yrs, 30-44yrs, 45+yrs). The three variables you will need are tposaff, tnegaff, agegp3. Remember to check your assumptions first.
5.14 Write a brief report of your results of the procedure you conducted in Exercise 5.13.
Analysis of covariance
5.15 Under what circumstances would you want to consider using analysis of covariance?
5.16 What issues do you need to consider when you are selecting possible covariates?
5.17 Using the experim.sav data file, perform the appropriate analyses (including assumption testing) to compare the confidence scores for the two groups (maths skills, confidence building) at time 2, while controlling for confidence scores at time 1. The variables you will need are group, conf1, conf2.
5.18 Perform a two-way analysis of covariance to explore the question: Does gender influence the effectiveness of the two intervention programs designed to increase participants' confidence in being able to cope with statistics training? You will need to assess the impact of sex and type of intervention (group) on confidence at time 2, controlling for confidence scores at time 1.
Non-parametric statistics
5.19 What is the difference between parametric techniques and non-parametric techniques?
5.20 What factors would you consider when choosing whether to use a parametric or a non-parametric technique?
5.21 For each of the following parametric techniques indicate the non-parametric alternative (if one exists).
(a) one-way between-groups ANOVA
(b) Pearson's product-moment correlation
(c) independent samples t-test
(d) multivariate analysis of variance
(e) one-way repeated measures ANOVA
(f) paired samples t-test
(g) partial correlation
(h) one-way repeated measures ANOVA
5.22 Choose and perform the appropriate non-parametric test to address each of the following research questions.
(a) Using the survey.sav data file find out whether smokers are significantly more stressed than non-smokers. The variables you will need are smoke and total perceived stress (tpstress).
(b) Using the survey.sav data file compare the self-esteem scores across the three different age groups (18-29yrs, 30-44yrs, 45+yrs). The variables you will need are tslfest and agegp3.
(c) Using the survey.sav data file explore the relationship between optimism and negative affect. The variables you will need are toptim and tnegaff.
(d) Using the survey.sav data file explore the association between education level and smoking. The variables you will need are educ2 and smoke. Check the codebook and the questionnaire in the appendix of the SPSS Survival Manual for details on these two variables.
(e) Using the experim.sav data file compare the depression scores at time 1 and the depression scores at time 2. Did the intervention result in a significant change in depression scores? The variables you will need are depress1 and depress2.
(f) Using the experim.sav data file compare the depression scores for the three time periods involved in the study (before the intervention, after the intervention and at the three-month follow up). The variables you will need are depress1, depress2 and depress3.
CONSOLIDATION EXERCISES
The remaining questions give you the opportunity to not only practise using SPSS, but also to choose which analyses can be used to address different research questions. Run whatever analyses or procedures you think appropriate to answer the following questions. Wherever appropriate, support your conclusions with graphs. Consider how you would present your answers to these questions in a report.
Part I: For this set of questions you will need to open the survey.sav data file.
(a) Obtain the information necessary to describe, in a Methods section of a report, the demographic characteristics of the sample used in the data file survey.sav. In particular describe:
the number (and percentages) of males and females in the sample
the age range of the sample and the mean and standard deviation for age
the number (and percentages) of married, single and divorced participants
(b) Are scores on the Perceived Control of Internal States scale (tpcoiss) normally distributed?
(c) What is the ID number of the respondent(s) with the lowest score on the self-esteem scale (tslfest)?
(d) How many people in the sample smoke?
(e) Who is most stressed: males or females (use the Total Perceived Stress scale scores, tpstress)?
(f) Do older people have higher levels of self-esteem (use the variable tslfest)? (Hint: there are two different ways that this question can be addressed.)
(g) Are optimists happier? Or, rephrased: Is there a relationship between a person's level of optimism (toptim) and their scores on the Positive Affect scale (tposaff)?
(h) Do smokers and non-smokers differ in terms of their perceived stress (tpstress) or their self-esteem (tslfest)?
(i) How strong is the relationship between perceived stress (tpstress) and overall life satisfaction (tlifesat)?
(j) Are younger people more likely to smoke than older people?
(k) In this sample, how many people fell into the age category 50+ (use agegp5)? Did these people have higher or lower mean scores on the Optimism scale (toptim) than people in the 18-24 yr age group?
(l) Is there a linear (straight-line) relationship between control of internal states (tpcoiss) and perceived stress (tpstress)? Is this relationship positive or negative? How strongly are these two variables related?
Part II: For the following questions you will need to open the experim.sav data file.
(a) Using the experim.sav data file find out whether there is a significant change in depression levels across the three time periods, and whether this pattern was the same for males and females.
(b) Who performed better on the statistics exam—males or females?
(c) Is there a relationship between age and exam score? Describe the nature of this relationship.
(d) Review each of the questions listed in both Part A and Part B above and consider what non-parametric technique you would use if it was not appropriate to use a parametric test. (Hint: Not all will have a non-parametric alternative.) Follow the instructions provided in Chapter 21 to carry out any non-parametric analyses that are appropriate. Compare the conclusions you obtain using the parametric and non-parametric alternatives.