PROGRAM ON RESEARCH HARVARD UNIVERSITY

` Introduction to Survey Research ` The Survey Process ` Survey Resources at Harvard ` , Coverage, and Nonresponse ` Thinking About Modes April 14, 2009 ` Question Wording

`Representation ◦The make inference to a larger population ◦Various population characteristics are represented in research data the same way there are present in population ◦Example: General population survey to estimate population characteristics Reporting `Realism Research Survey ◦A full picture of subjects emerges And ◦Relationships between multiple variables and multiple ways of looking at Methods the same variables can be studied Theories Analysis ◦Example: A qualitative case study to evaluate the nature of democracy in a small town with community meetings

` ◦Variables which are not important in model are completely randomized ◦Effects of non-randomized variables can be tested ◦Example: Randomized clinical trial to test effectiveness of new cancer drug

Integrated Example: Surveys and the Research Process

Theories About Things `Good Measures: ◦Questions or measures impact your ability to study concepts ◦Think carefully about the underlying concepts a survey is trying to measure. Do the survey questions do a good job of capturing this? ◦The PSR Tip Sheet on Design contains good ideas on Concepts Population External Validity how to write survey questions.

Specification Error ` Measures Good Samples: ◦Samples give you the ability to generalize your findings Survey Errors ◦Think carefully about the population you are trying to generalize

Internal Validity Internal your findings to. Does your sample design do a good job of Data Respondents representing these people? ◦The PSR Tip Sheet on Survey Sampling, Coverage, and Nonresponse contains thinks to think about in designing or evaluating a sample. Relationships Between Things ` Systematic method of ` Usually use samples •Getting Started: ` Designed to measure things http://psr.iq.harvard.edu/getting_started ◦ Attitudes ◦ Behaviors •General Resources: ` Create statistics •http://psr.iq.harvard.edu/internet_resources ◦ Descriptive ◦ Analytic •Tips and Tools: •http://psr.iq.harvard.edu/type_psr_resource/psr_survey_toolbox

Chase H. Harrison

CGIS N-407 Office hours: Tuesday 2:30 – 4:30 And by Appointment [email protected] (617) 384-7251

`Question-Level (Micro) Data

`Roper Center IPoll

`Survey-Level Data `Polling the Nations ICPSR `National Election Studies Roper Center

` Odum Institute European Union Guide to Survey Data Archives ` and Centers

` Survey Design Perspective ◦ Steps to use in implementing a survey

` Survey Quality Perspective ◦ Sources of error in surveys

` Good designs take quality into account ` Good reports focus on error `A attempts to collect data from all members Population of a population. `Random samples let you use collect data from a portion of a population and use sampling statistics to Sample Frame generalize your findings to a large population. Nonresponse Error

Respondents

`Random (Probability) Samples: •Based on ◦Based on probability theory ◦Allow generalization ◦Sample statistics can be calculated xSample records are drawn from a well-specified frame • xSample records are drawn according to random procedures Describes Variability xEach sample record has a known probability of selection

`Non-Random Samples: • ◦Do not meet the above criteria Applies From Respondents to Sample Frame ◦Inference can only be made to itself ◦Sample error cannot be calculated xFrame is not well specified xBenefits: xConvenient and cost effective xCan be used for idea generation

`List or a set of procedures `People excluded from sample frame `Records Don’t Have to Lead to Respondents `Typical sampling statistics assume no coverage `Sometimes Requires Two Stages of Selection error `: ◦Proportion Excluded ◦Differences Between Excluded and Included •Sample Members Who Do Not Respond •Up Front •Reasons: •Call Backs xUnable •Refusal Conversions x Unavailable •Incentives xUnwilling

◦Bias: •Back End xProportion Excluded •Weighting xDifferences Between Excluded and Included •Statistical Adjustments

•Self Administered •Internet •Mail •Interviewer Administered •Telephone •In-Person

` ` Traditional Modes: CAPI Computer Assisted Personal Interviewing ◦ Face-to-face ` CASI Computer Assisted Self Interviewing

◦ Mail x SAQ Self Administered Questionnaire x ◦ Telephone CSAQ Computerized Self-Administered Questionnaire ◦ ACASI Audio Computer Assisted Self Interviewing ◦ TCASI Text Computer Assisted Self Interviewing ` Web ◦ VCASI Video Computer Assisted Self Interviewing ◦ Is it a mode or a method of delivery? ` CATI Computer Assisted Telephone Interviewing ◦ Sample frame issues confuse mode issue ◦ IVR Interactive Voice Response ◦ T-ACASAI Telephone Audio Computer Assisted Self Administered ◦ TDE Touch Tone Data Entry ` Multiple Modes Within Instrument ` Multiple Modes For Different Respondents ◦ All respondents receive same questionnaire ◦ Contact method varies by respondent preference administered the same way ◦ Contact method varies by difficult of contact ◦ Administration uses different modes: ◦ Sub-sampled respondents with expensive modes ◦ For example: x FTF –VCASI –CATI Follow-Up ` x Phone – Mail – Phone Multiple Modes For Recruitment or Follow Up ` Multiple Modes For Different Sample Frames ◦ Mail pre-notification letters for phone ◦ Telephone contact with WWW SAQ ◦ Telephone reminders for mail or web ◦ Telephone contact with CATI ◦ For example: ◦ Face-to-Face oversample x FTF –VCASI –CATI Follow-Up x Phone – Mail – Phone

` Nature of population ◦ Are all population members available through mode?

` Nature of unit ◦ Screening for respondents is better with interviewers

` Nature of record ◦ If sample record corresponds to unit, less screening required

Identification of Valid sample records ` Standardization ◦ ◦ To what degree does the survey experience need to be the Does this require an interviewer? same for all respondents ◦ Can respondents do this themselves through screening? ` Supervision ◦ How much supervision do interviewers or respondents need ` Clustering ` Clarification ◦ Are records clustered geographically ? ◦ If questions arise during the survey process, how will they ◦ i.e. face to face interviews may be feasible be addressed x ◦ Respondent questions Otherwise face-to-face reduces sample efficiency x Interviewer questions ` Item Nonresponse ` Primacy Effects ◦ Higher without interviewers x Respondents answer first thing they see x More prevalent in SAQ’s ` Unit Nonresponse ` Recency Effects ◦ Higher without interviewers x Respondents answer the last thing they heard x More prevalent in interviewer-administered Q’s ` Randomization or Rotation of Response ` Bias with sensitive questions choices ◦ Lower without interviewers ◦ Reduces primacy and recency effects in answer categories ◦ Usually requires computer administration

` Randomization or Rotation of Response ` Nature of Information ◦ choices Some information is available from administrative records ◦ Sometimes, multiple respondents are needed to answer ◦ Reduces primacy and recency effects in answer categories ◦ Rotate if there is an order to the question ` Nature of Respondents ◦ Randomize if no order ◦ Literacy and sophistication have implications for SAQ’s

` Nature of Measures ◦ Is audio important? ◦ Visual stimulus? ◦ Reading?

Surveys and the Research Process

Concepts Population External Validity

Measures Sample Internal Validity Internal Data Respondents • `Measure underlying concept Existing Questions Often Preferable •Trend Data Requires Identical Questions ` Don’t measure other concepts •Sources of existing Questions `Mean the same thing to all respondents ◦The General Social Survey ◦The National Election Survey ◦IPoll (database of 500,000 polls) ◦Polling the Nations:

•Administer Questionnaire to Small Sample •Keep the Questionnaire SHORT •Convenience Sample is Okay •Keep Questions SIMPLE •Have Preceptor Review Your Questionnaire •Keep Question Order in Mind •Question Understanding Aid •Earlier Questions Can Influence Later Questions • http://mnemosyne.csl.psyc.memphis.edu/QUAID/quaidinde •Sensitive Questions Are Best at Back x.html •Provide Clear Instructions •Filter and Branch

•Open End `Five to Seven Categories `Provide Middle Category • Closed End `Label Scale With Clear Unambiguous Words `Agree/Disagree Scales Can be Problematic •Rating Scale `Technical Terms and Jargon `Scale Categories are Ordinal `Vague or Imprecise Terms `Categories Are Mutually Exclusive `Complex Sentences `Categories Anticipate All Answers `Double-Barreled Wording `Leading or Emotional Language

Questions and Discussion