2016 Surveytech Guide X Dec
User Manual:
Open the PDF directly: View PDF
.
Page Count: 17
| Download | |
| Open PDF In Browser | View PDF |
DIGITAL ENGAGEMENT RESEARCH WORKING GROUP Survey Technical Guidance: Samples, Design and Analysis November 2015 Paper authored by Dr Grant Blank, Oxford Internet Institute, University of Oxford With support from the Digital Engagement Research Working Group page 1 Table of Contents Executive summary……………………………………………………….…………………. 3 Technical guidance…………………………………………………………………………...4 Sampling…………………………………………………………………………………….4 Sample size and sample error…………………………………………………………….4 Total survey error…………...……………………………………………………………...6 Question wording…………………………………………………………………………...7 Question order……………………………………………………………………………...8 Instructions for respondents………………………………………………………………9 Response rates…………………………………………………………………………...10 Pretesting your questionnaire…………………………………………………………...10 Administering the survey………………………………………………………………....11 Checking your data……………………………………………………………………….12 Weights…………………………………………………………………………………….12 Analysis…………………………………………………………………………………….13 Reporting your methodology…………………………………………………………….16 For additional information………………………………………………………………….17 page 2 Executive summary A simple survey can be done relatively easily and it can produce persuasive, informative results. There are, however, a number of technical considerations that will improve the quality and reliability of the results. High quality surveys produce evidence that stands up to scrutiny and can inform policy. It follows the principles of ethics, transparency, accountability and auditability. This document briefly outlines many of the key issues and supplies guidance on how to deal with them. Sampling and question wording Ensure that the sample is randomly selected so that it is representative of the population that you want to study. If you want do statistical analysis, make the sample size large enough. This probably means a minimum of 400 respondents and a minimum of 800 is much better. Write simple, easily understood questions to avoid biased results. If you use questions that have been used in other surveys, you can compare your results to theirs. This is valuable if you want to compare your results to a national survey. Pretest your questions with a small number of people from the population of interest. In your cover letter, tell respondents who is conducting the survey, why, and give contact details if they want further information. Analysis and reporting Know how large the difference between percentages must be in order to say that it is statistical significant given your sample size. Do not claim that smaller differences are important. Do not confuse correlation with causation (e.g. if higher Internet use is associated with lower marks this may not mean that Internet use reduces marks). Clearly label all graphs and tables, as well as all axes on graphs. To the extent possible, graphs and tables should be selfexplanatory. Report percentages with a clear statement of the sample or subsample they refer to (e.g. Internet users age 16 and over; Users of social network sites age 2534). This should be part of the label of each table or graph. Accountability Report funding sources and any potential conflicts of interest. Explain how the respondents were selected. Report the exact wording of the questions and possible responses. Report what organization conducted the survey along with the dates and circumstances of data collection. If you summarize results for the press, provide access to the percentage tables from which you drew your reported findings. Ensure the full research report is available or provide contact details so that interested people can ask researchers about the data or the analysis page 3 Technical guidance Surveys have been conducted, in their modern form, for at least 75 years, and many books have been written about them. This brief document can’t hope to duplicate the detail you would get from any one of a number of good books. This document is an overview. The goal of this document is to introduce you to crucial issues and give you relatively straightforward suggestions. It is not a substitute for indepth knowledge and, at the end of the document, I suggest some books you can turn to for more detailed information. Sampling Collecting data on whole populations is often too expensive and it is not necessary. You can answer almost all questions by selecting a sample, and it will be faster and cheaper. A sample is just a collection of people from the population that you wish to learn about. How these people are selected matters a great deal. A sample is valuable when it accurately represents the underlying population. To do this people in the sample must be selected randomly . “Random” has a very specific meaning: the selection is designed so that each person in the population has an equal chance of being selected. These samples are called “probability samples” because each member of the population has an equal probability of selection. The payoff from a probability sample is that the sample will be representative of the underlying population; that is, that the results you get from the sample can be generalized to the population. Random selection is the most important single element in the success of your survey. If you don’t do this well, the rest doesn’t matter because the results from the sample can’t be generalized. Results from nonprobability samples cannot be generalized. Nonprobability samples include selfselected or “convenience samples”, including Internet optin surveys, callin samples, person in the street surveys, and nonprobability mailin or telephone samples. Sample size and sample error Probability samples depend on two mathematical axioms: the law of large numbers and the central limit theorem. The central limit theorem says that if we draw a sample to measure something, say Internet use, the measurements will fall in a predictable, stable pattern around the true value in the population. This predictable pattern is the familiar bellshaped curve, which is called a “normal distribution”. The centre of a normal page 4 distribution is likely to be at the true value in the population. The law of large numbers says that the more people you have in the sample the closer your distribution will be to a bellshaped curve. That is with a large sample, your results will be very close to the underlying population. How close your results come to the population is called the error in the sample or the “sampling error”. Sampling error has a simple relationship to sample size: as sample size goes up, sampling error gets smaller. The relationship between sample size and sampling error can be described mathematically, as shown in the figure below. The figure shows the size of the sampling error in percent for sample sizes between 100 and 1600 respondents. For example, it says that a sample of 400 respondents will have a sampling error of 5%. If you took a sample of 400 and found that 80% of them were Internet users, this says that the proportion of Internet users in the population would be 80% ±5%. Another way to say this is that your survey shows the proportion of Internet users is between 75% and 85%. The figure shows why the typical national poll samples about 11001200 people: For that sample size the margin of error is ±3%. Sampling error has several characteristics that you should know to help understand it. First, it is a theoretical minimum. It assumes that you have drawn a truly random sample, where all members of the population had an equal probability of being included. Second, it is only one kind of error, but it is quantifiable and so it is often reported. Other errors will be discussed below. Third, it is not possible to calculate the page 5 sampling error of a nonprobability sample. It is worth emphasizing again that all Internet polls in Britain are nonprobability samples. Finally, you often want to compare percentages; for example, voter preferences between Conservative and Labour in a political poll. In these cases it is important to remember that sampling error applies not to the gap between estimated percentages but to each estimate . Below is an example from the 2009 OxIS survey. This was a survey of 2,013 respondents, which had a margin of error of 2.2%. The table below shows that 71% of men are likely to use the Internet compared to 68% of women, a difference of three percentage points. This is outside the 2.2% margin of error. However, the possible range for men is 69% to 73%; for women it is 66% to 70%. Since the minimum estimate for men, 69%, is less than maximum estimate for women, 70%, these two ranges overlap. This says that men and women have about the same Internet use. Two groups are only different from each other if the difference between them is more than twice the sampling error . Internet use by gender 2009 Oxford Internet Survey Men 71% Women 68% Men 71% ± 2.2% Women 68% ± 2.2% Men 69% 73% Women 66% 70% Total survey error Sampling error is one source of error. Total survey error comprises four sources: ● Sampling error. The sample may differ from the population. ● Coverage error. The sample may not map to the population. ● Nonresponse error. Many refuse to be surveyed. ● Measurement error. Question wording and order problems. I have already discussed sampling error. I will discuss the other three in succession. The “sampling frame” is the name for list of all the people in the population from which you intend to draw the sample. The sampling frame should include every person in the population. Coverage error occurs when some people in the population were not included in the sampling frame. Because they were not included in the sampling frame they could not have been included in the sample. The problem with coverage error is that people excluded from sampling frame are almost never a random selection from page 6 the population. Instead they are typically people who are more marginal and harder to reach. In national samples they are poorer, less welleducated, possibly homeless or without a permanent address. People excluded from the sampling frame have zero probability of being included in the sample, thus the sample will not be a true reflection of the population. To the extent that the sample deviates from the population, the sample is said to be “biased”. A second source of bias is nonresponse error. People may refuse to participate in the survey for many reasons; the problem is that typically they do not refuse at random. The people who refuse are systematically different from the people who agree to participate. For example, most surveys are biased toward people who have more free time to answer questions: often housewives, unemployed, retired, or students. The ratio of the number of people who participate in the survey over to the number who were asked is called the “response rate”. The response rate is an important measure of the quality of the survey. Response rates vary a lot depending on the mode of the survey. Typical response rates for inhome interviews are around 50%; for telephone polls, 1020%, for Internet surveys 14%. Question wording Finally, there are errors due to measurement problems. This is a complex subject that will not be covered in detail. One source of measurement problems is badly worded questions. For example, if you want to know how people felt about a training programme don’t ask “Was the training programme useful?,” instead ask “Was the training useful or not useful?” because you don’t want to emphasize one answer or the other. You want to offer options that represent the basic choice: useful or not useful; approval or disapproval. Sometimes it could be a range, like “How confident are you that you will use the training?” You can be extremely confident, you can be not confident at all, or you can be somewhere in the middle. Second, consider the question: “Do you want to see less money spent on defence and more on the NHS?” The problem is that regardless of whether a respondent agrees or disagrees it is not clear what they are agreeing/disagreeing to. They may want to spend more on the NHS but that doesn’t mean that they want less spent on defense, and vice versa. These are called “doublebarreled questions” because they are really two separate questions. Another example of a bad question is: “Do you favor killing unborn babies?” This sort of question is loaded with emotional or red flag words that make it very hard to disagree with, regardless of the respondent’s true feelings about abortion. None of these types of questions will produce accurate answers. page 7 Finally, questions need to be understood by everybody who is answering the survey. They need to be understood by people who have a Ph.D. and people who have no educational qualifications. Good questions: ● ● ● ● ● ● ● Are concise, short, simple and avoid jargon Are easily understood by all respondents Don’t presume information Don’t tax a respondent’s memory or cognitive ability Use balanced, neutral wording to avoid bias Ask about only one thing Avoid negative terms like “not”, “none” or “no” Question wording matters a lot. It is important to provide the complete text of questions and responses so that readers can judge for themselves whether you have avoided these problems. The questions we provide to measure digital engagement have been professional written to avoid these and other problems. It is very important that you keep the exact wording and do not change a single word. This ensures that your results are comparable to other surveys using the same questions. Question order The context of the questions matters a lot. This is an issue of questionnaire design and format. Several suggestions for the questionnaire are: ● Start with simple and interesting questions. You want to grab respondents at the beginning with something that they know and enjoy thinking about. Possibly pleasurable, enjoyable experiences online. ● After several warm up questions, ask the most important questions in the survey. This is the place where respondents are least likely to be affected by respondent fatigue. ● End with sensitive questions like age, qualifications, marital status, gender, employment status, and ethnicity. In many countries, income is among these questions, but the British people are very sensitive to talking about their income so you may want to omit it. If you do include it be prepared for a high rate of nonresponse. ● Include the date and time when the survey was distributed as well as a respondent ID number. To help protect confidentiality the respondent’s name should not appear on the questionnaire. page 8 Pay attention to the order of the questions. People respond to questions by telling you about what is on top of their heads. Consider the following sequence of four questions: 1. How important is the NHS to you? 2. Are you worried about possible privatization of part of the NHS? 3. Do you think that high quality NHS care will be available to you when you need it? 4. What is the most important problem facing the United Kingdom? The first three questions will prompt people to think about the NHS. These questions push the NHS to the top of their minds. When respondents reach the fourth question and need to retrieve from their minds the most important problem facing Britain, the NHS will be easy to retrieve. A much higher proportion of people would say the NHS is an important problem than they would if the question about the most important problem had been placed first instead of last. The point is that the answers are influenced by preceding questions. If you want accurate, unbiased results you need to be careful about question order. Instructions for respondents You need a cover letter, email or instruction sheet for any survey. There are usually four important pieces of information to include in the instructions. 1. The purpose of the survey. This is very important. The more respondents see the survey as personally important to them, the more likely they are to complete it. 2. Who is sponsoring the survey and who is administering it. 3. How confidentiality will be protected. A strong assurance of anonymity is an important way to increase the likelihood of honest responses. 4. Who the respondent can call, write or email if they have questions, concerns, or want a copy of the survey results. If possible, the instructions, email or cover letter should be personalized with the respondent’s name. This will improve your response rate. page 9 Response rates Response rates to survey vary enormously. There are a number of actions that you can take to improve response rates. Salience refers to the significance or value of a topic to the respondent. Salient questionnaires receive much higher response rates than nonsalient surveys. Your cover letter should always explain how the respondent will benefit personally from completing the questionnaire. One approach is for the cover letter to be signed by a person who the respondent knows, either personally, by reputation or by affiliation (like a CEO), a politician, or another wellknown person. Salience is usually the single most important factor in improving your response rate. Public health surveys can achieve response rates of over 80%. Marketing surveys often get 20% Followup contacts . When potential respondents don’t respond to your initial contact, what is the effect of followup contacts? The first followup adds about 1520% more respondents; the second followup adds another 1015%; the third followup adds perhaps 5%. The fourth and later followups seem to add little. Reachable population . If the population is easily reachable, for example, students, employees or clients, they are more likely to return the questionnaire than if the survey is of the general population. Adding a monetary incentive increases the response rate. Typical incentives include £5 or £10 or inclusion in a drawing for a prize. Advance contact , such as sending a letter or an email telling respondents that the questionnaire is coming, seems to have about the same effect as followup. Length does not seem to have a big effect. It only appears when salience and followup are controlled. Obviously at some point length matters, but not for moderately long questionnaires of 1020 minutes. Pretesting your questionnaire By the time you have written the questionnaire you may be too close to it to see potential problems. You may want to ask several people to critique it before you pretest it. page 10 Pretesting is important because you may find that your understanding of the questions differs from that of potential respondents. To pretest, choose a small sample of people who are similar to the people you will be surveying. You may want to include people with different qualifications or different income, with or without children, different ages, different marital status, depending on what factors you think will affect a respondent’s ability and willingness to complete the survey. If possible time how long the pretesters need to complete the survey. Encourage pretest respondents to make comments on each question, as well as on the order and format of the questions. This is often best done by interviewing them immediately after they complete the questionnaire. If several people complete the survey at once, then discussing it in a small group may be helpful. Pay close attention to any questions that pretesters refuse to answer, misunderstand or answer incorrectly. These questions may be poorly worded, too difficult to answer, or too sensitive to answer. These questions should be revised, and pretested again, if possible. Administering the survey There are usually seasonal differences in availability of willing respondents. You want to avoid times like August and December because they will produce higher nonresponse rates. School term breaks may also be bad times if your population includes large numbers of households with children. If you are conducting the survey yourself, there are several administrative issues to consider. Keep lists of all people contacted for the survey (including addresses, email addresses, and phone numbers) and when they completed the survey. For respondents who refuse to complete the survey, try to learn why and record the reasons. This may help you understand ways you can modify your data collection methods. Keep track of any additional contacts, like follow up phone calls or emails. Make sure that all personally identifiable information like names and addresses are kept strictly confidential. Tabulate the number of people contacted and the number who actually completed the survey so you can calculate a response rate. This is an important number that you should report whenever you report any results from the survey. page 11 Checking your data Once you have data, the first step is to check it for errors. For example if the possible codes for gender are 0 = Male and 1 = Female, then be sure that you don’t have any 3s or 4s. This is a check for impossible codes. You need to do this check for every single question in your questionnaire. A related check is for unusually large or small values, for example a respondent who claims to spend over 100 hours per week on the Internet. You need to decide what to do when you encounter implausible values like this. There are many possibilities. Some respondents may have special characteristics that make an unusually large or small number plausible, or they may have misunderstood the question, or this may be evidence they are not taking your survey seriously. You should check to see if this is a data entry error that you can fix. If this occurs in a small number of instances you may want to remove it from the dataset. If you remove data, you need to report it in the methodological section of the results. You may also want to report the data with and without these cases. You will need special codes for nonresponse. There are many kinds of nonresponses, such as people who were never asked certain questions. For example, in a survey of Internet use, people who don’t use social media would not be asked social media questions. People who don’t own smartphones would not be asked smartphone questions. Households without children would not be asked questions about their children’s Internet use. Because these questions were skipped they are called “legitimate skips”. They could be coded 1. Also, some people will refuse to answer certain questions. In Britain questions about income have high rates of refusal. Refusals can be coded as 2. Some people may be asked questions but they don’t know the answers. People who respond “don’t know” can be coded as 3. Weights If your survey is for a local project you don’t need to use weights and you don’t need to read this. However, all national surveys need to be weighted. Weighting corrects for the problem of not including groups in the sample in the correct proportion to their size in the population. This is called “poststratification” weighting. It is one way to correct for sampling error and nonresponse error. The table below contains a simple example: page 12 Qualifications No qualifications GCSE Further education University graduate Population % 10 25 35 30 Sample % 5 10 35 50 Weight 2.0 2.5 1.0 0.6 The second column in the table shows the percent of the population who have four educational qualifications. The third column shows hypothetical survey results. The survey received responses from too few people with no qualifications or GCSEs and too many university graduates. We can correct for this by weighting each respondent. Respondents who report no qualifications receive a weight of 2.0. Respondents reporting GCSEs will be weighted 2.5. Further education respondents will be weighted 1.0 and university graduates will be given a weight of 0.6. The software you use to analyze the survey has to be able to use weights correctly. Any standard statistical software will be able to handle weights. Look in the documentation or help files for “weights” or “poststratification weights” or “probability weights”. Spreadsheets like Excel and most database software will not handle weights and this is a good reason to avoid them for statistical analysis. What the software will do is count the respondent in the results in proportion to the weight. Thus a person with no qualifications and a weight of 2.0 will be counted twice as heavily in the results compared to a person who has further education and a weight of 1.0. The value of weighting is that it adjusts the results from the sample so that they more closely reflect the results from the population as a whole. Weights have an important limit that you should know. You can only weight to known population values. In national samples this usually means the population values reported by the British census. Although the census collects data on everyone, it does not collect very much data. Weights will probably be limited to gender, age, region, urbanrural status and a few others. Analysis Once you have collected your data they have to be analyzed. Analysis of survey data is a very large topic. I will just touch on a few central issues; for further information, see the suggested readings at the end. page 13 The oldest and still most common method of analysis is percentage tables. They remain an excellent technique for gaining insights into your data. You are unlikely to need anything more sophisticated. Simple frequency tables are an excellent way to describe your data. Below is a frequency table of the social grade of respondents from the 2013 wave of the Oxford Internet Survey: Social Grade of Respondents Valid Missing Total A Upper middle B Middle class C1 Lower middle C2 Skilled working D Working class E Subsistence Total valid Legitimate skip Refused Total Frequency 25 332 720 427 504 274 2,282 365 10 375 2,657 Percent 0.9 12.5 27.1 16.1 19.0 20.3 85.9 13.7 0.4 14.1 100 Valid % 1.1 14.6 31.6 18.7 22.9 12.0 100 The table has three columns of data: the frequency in each category, the percent in each category and the valid percent in each category. Looking first at the column titled “Frequency”, the bottom row, labelled “Total” shows that there were 2,657 respondents in the survey, but the “Total valid” rows indicates that only 2,282 have valid data on social grade. Among the 375 missing, 365 were missing because they were not asked (social grade is based, in part, on occupation, so these would be students, unemployed or retired) and 10 refused to give the information. The “percent” column contains percents of all respondents, regardless of whether they have data on social grade. The “valid %” column contains percents of only those cases where there are valid responses. In general, the valid % column is the one you want to pay attention to. You can see in the first row of the table that 25 respondents, or 1.1%, were in social grade A, compared to 274 respondents, or 12.0%, in social grade E. The graph below shows the same percentage table in a more visual form. This form is probably better for presentation to most audiences. page 14 Twoway percentage tables tell you how different variables are related to each other. For example, to show how social grade is related to Internet use, we crosstabulate social grade and Internet use. We put social grade in the rows and Internet use in the columns and ask for row percents. The result is below: Social Grade by Internet Use A Upper middle B Middle class C1 Lower middle C2 Skilled working D Working class E Subsistence Total Non user 0.0% 6.3 12.6 28.7 30.8 51.8 23.2 User 100.0 93.8 87 71.4 69.2 48.2 76.8 Total 100 100 100 100 100 100 100 Frequenc y 21 314 676 465 478 239 2,195 For each category of social grade the table contains four data columns. The first data column is the percentage of nonInternet users in that social grade. The second column is the percentage of Internet users in that social grade. These two columns add to 100%, which is the third column. The final column on the right gives the number of respondents in each row. Analysis consists of comparing the percentages down the columns. For example, looking at the percentage of users, we notice that 100% of social grade A are Internet users. As we go down the list of social grades the percentage of Internet users declines steadily. At the lowest level, only 48.2% of the people social grade E are Internet users. From this analysis we would conclude that page 15 social grade has a major impact on Internet use. You could do the same analysis of the nonuser column, but since the percent nonusers plus the percent users adds up to 100% an analysis of nonusers would just be a mirror image of the analysis we just did on users. Since we are looking at the influence of social grade on Internet use, we are thinking of social grade as a causal variable. It causes (some of the) variation in Internet use. We say that the proportion of Internet users depends (in part) on social grade. When we think about causes and effects in this way we have a simple rule for setting up a twoway table: Ask for percentages in the direction of the causal variable. If the causal variable is in the rows, calculate row percents. Then do your comparisons down the columns, like we did in the previous paragraph. This rule can be stated concisely: For a twoway table, calculate percentages in the direction of the causal variable and compare in the other direction. Reporting your methodology There are minimum standards for reporting the methodology you use in your survey. Reporting this information allows readers to judge the quality of your work. This information can be on a web page and it should be included as a “Methodological Appendix” in any report. The minimum standard information is: ● Name of the survey sponsor ● Name of the organization that conducted the survey ● The exact wording of the questions being analyzed and the possible responses ● The definition of the population under study. ● A description of the sampling frame used to represent the population under study ● An explanation of how respondents were selected ● Total number of potential respondents contacted, total completed surveys returned and the response rate. ● The mode of data collection; e.g. paper, telephone, Internetbased, email, etc. ● The dates and location(s) of data collection page 16 ● Estimates of sampling error ● A description of the weighting procedure (if used) For additional information Many books have been written about the conduct and analysis of surveys. Textbooks are most accessible for a novice. Two good choices are: th Babbie, Earl. (2010) The practice of social research . 12 ed. Wadsworth. This has excellent chapters on sampling, writing questions, and analysis of surveys. th De Vaus, David. (2014) Surveys in social research . 6 ed. Routledge. Has excellent chapters on all phases of the survey process. page 17
Source Exif Data:
File Type : PDF File Type Extension : pdf MIME Type : application/pdf PDF Version : 1.4 Linearized : No Producer : Skia/PDF Page Count : 17EXIF Metadata provided by EXIF.tools