In This Chapter
^ Deciphering the difference between qualitative and quantitative variables ^ Choosing appropriate statistical techniques for the task at hand ^ Evaluating bias and precision levels ^ Interpreting the results properly
Ne of the most critical elements of statistics and data analysis is the ability to choose the right statistical technique for each job. Carpenters and mechanics know the importance of having the right tool when they need it and the problems that can occur if they use wrong tool. They also know that the right tool helps to increase their odds of getting the results they want the first time around, using the "work smarter not harder" approach.
In this chapter, you look at the some of the major statistical analysis techniques from the point of view of the mechanics and carpenters — knowing what each statistical tool is meant to do, how to use it, and when to use it. You also zoom in on mistakes some number crunchers make in applying the wrong analysis or doing too many analyses. Knowing how to spot these problems can help you avoid making the same mistakes, but it also helps you to steer your way through the ocean of statistics that may await you in your job and in everyday life.
If many of the ideas you find in this chapter seem like a foreign language to you and you feel like you need more background information, don’t fret. Before continuing on in this chapter, head to your nearest intro stats book or check out another one of my books, Statistics For Dummies (Wiley).
Qualitative Versus Quantitative Variables in Statistical Analysis
After you’ve collected all the data you need from your sample, you want to organize it, summarize it, and analyze it. Before plunging the data in to do all the number crunching though, you need to first identify the type of data you’re dealing with. The type of data you have points you to the proper types of graphs, statistics, and analyses you’re able to use.
Before I begin, here’s an important piece of jargon: Statisticians call any quantity or characteristic you measure on an individual a Variable; The data collected on a variable is expected to vary from person to person (hence the creative name).
The two major types of variables are the following:
Qualitative: A qualitative variable classifies the individual based on categories. For example, political affiliation may be classified into four categories: Democrat, Republican, Independent, and other; gender as a variable takes on two possible categories: male and female. A person may be categorized as a female Republican, which means that, regarding the gender variable, she falls into the female category, and regarding the political affiliation variable, she falls into the Republican category. Another name for a qualitative variable is a Categorical variable.
Quantitative: A quantitative variable measures or counts a quantifiable characteristic, such as height, weight, number of children you have, your GPA in college, or the number of hours of sleep you got last night. The quantitative variable value represents a quantity (count) or a measurement and has numerical meaning. That is, you can add, subtract, multiply, or divide the values of a quantitative variable, and the results make sense as numbers. This characteristic isn’t true of qualitative variables, which can take on numerical values only as placeholders.
Because the two types of variables represent such different types of data, it makes sense that each type has its own set of statistics. Qualitative variables, such as gender, are somewhat limited in terms of the statistics that can be performed on them. For example, suppose you have a sample of 500 classmates classified by gender — 180 of them are male, and 320 are female. How can you summarize this information? You already have the total number in each category (this statistic is called the Frequency). You’re off to a good start, but frequencies are hard to interpret because you find yourself trying to compare them to a total in your mind in order to get a proper comparison. In the previous example, you may be thinking "One hundred and eighty males out of what? Let’s see, it’s out of 500. Hmmm. . . what percentage is that? I can’t think."
The next step is to find a means to relate these numbers to each other in an easy way. You can do this by using what is called a relative frequency. The Relative frequency Is the percentage of data that falls into a specific category of a qualitative variable. You can find a category’s relative frequency by dividing the frequency by the sample total (500, using this example) and multiplying
By 100. In this case, you have i80 = 0.36 * 100 = 36 percent males and
500
320 = 0.64 *100 = 64 percent females. 500
You can also express the relative frequency as a proportion in each group by leaving the result in decimal form and not multiplying by 100. This statistic is called the Sample proportion. If you continue with the same example, the sample proportion of males is 0.36, and the sample proportion of females is 0.64.
You mainly summarize qualitative variables by using two statistics — the number in each category (frequency) and the percentage (relative frequency) in each category.
Statistics for Qualitative Variables
The types of statistics done on qualitative data may seem to be limited; however, the wide variety of analyses you can perform using frequencies and relative frequencies offers answers to an extensive range of possible questions you may want to explore.
In this section, you see that the proportion in each group is the number-one statistic for summarizing qualitative data. Beyond that, you see how you can use proportions to estimate, compare, and look for relationships between the groups that compose the qualitative data.
Comparing proportions
Researchers, the media, and even everyday folk like you and me love to compare groups (whether you like to admit it or not). For example, what proportion of Democrats support oil drilling in Alaska, compared to Republicans? What percentage of women watch college football versus men? What proportion of readers of Intermediate Statistics For Dummies Pass their stats exams with flying colors, compared to nonreaders? To answer these questions, you need to compare the sample proportions using a hypothesis test for two proportions (see Chapter 3 or your intro stat textbook).
Suppose you’ve collected data on a random sample of 1,000 United States voters. You may want to compare the proportion of female voters to the proportion of male voters and find out whether they’re equal. Suppose in your sample you find that the proportion of females is 0.53, and the proportion of males is 0.47. So for this sample of 1,000 people, you have a higher proportion of females than males. But here’s the big question: Are these sample proportions different enough to say that the entire population of U. S. voters has more females in it than males? After all, sample results vary from sample to sample. The answer to this question requires comparing the sample proportions by using a hypothesis test for two proportions. I demonstrate and expand on this technique in Chapter 3.
Estimating a proportion
You can also use relative frequencies (check out the section "Qualitative versus Quantitative Variables in Statistical Analysis") to make estimates about a single population proportion.
Say, for example, you want to know what proportion of females in the United States are Democrats. According to a sample of 29,839 female voters from the U. S. conducted by the Pew Research Foundation in 2003, the percentage of female Democrats was 36. Now because the Pew researchers based these results on only a sample of the population and not on the entire population, these results may vary from sample to sample. The amount of variability is measured by the Margin of error (the amount that you add and subtract from your sample statistic), which for this sample is only about 0.5 percent. (To find out how to calculate margin of error, explore Chapter 3.) That means that the estimated percentage of female Democrats in the U. S. voting population is estimated to be somewhere between 35.5 percent and 36.5 percent.
The margin of error, combined with the sample proportion, forms what statisticians call a confidence interval for the population proportion. Recall from intro stats that a Confidence interval Is a range of likely values for a population parameter, formed by taking the sample statistic plus or minus the margin of error. (For more on confidence intervals, see Chapter 3.)
Looking for relationships between qualitative Variables
Suppose you want to know whether two qualitative variables are related (for example, is gender related to political affiliation?). Answering this question requires putting the sample data into a two-way table (using rows and
Columns to represent the two variables), and analyzing the data by using a Chi-square test (see Chapter 14). By following this process, you can determine whether two categorical variables are independent (unrelated) or whether a relationship exists between them. If you find a relationship, you can use percentages to describe it.
Table 2-1 shows an example of data organized in a two-way table. The data was collected by the Pew Research Foundation.
|
Table 2-1 |
Gender and Political Affiliation for 56,735 U. S. Voters |
|
Gender |
Republican Democrat Other |
|
Males |
32% 27% 41% |
|
Females |
29% 36% 35% |
Notice that the percentage of male Republicans in the sample is 32 and the percentage of female Republicans in the sample is 29. These percentages are quite close in relative terms. However, the percentage of female Democrats seems much higher than the percentage of male Democrats (36 percent versus 27 percent); also, the percentage of males in the "Other" category is quite a bit higher than the percentage of females in the "Other" category (41 percent versus 35 percent). These large differences in the percentages indicates that gender and political affiliation are related in the sample. But do these trends carry over to the population of all U. S. voters? This question requires a hypothesis test to answer. The particular hypothesis test you need in this situation is a Chi-square test, which I discuss in detail in Chapter 14.

To make a two-way table from a data set by using Minitab, first enter the data in two columns, where column one is the row variable (continuing with the previous example, this variable would be gender) and column two is the column variable (in this case, political affiliation). For example, suppose the first person is a male Democrat. In row one of Minitab, enter M (for male) in column one and D (Democrat) in column two. Then go to Stat>Tables>Cross Tabulation and Chi-square. Highlight column one and click Select to enter this variable in the For Rows line. Highlight column two and click Select to enter this variable in the For Columns line. Click on OK.

People often use the word Correlation To discuss relationships between variables, but in the statistical world, you can use correlation only to discuss the relationship between two quantitative (numerical) variables, not two qualitative (categorical) variables. Correlation measures how closely the relationship between two quantitative variables, such as height and weight, follows a
Straight line and tells you the direction of that line as well. In total, for any two quantitative variables, X And Y, The correlation measures the strength and direction of their linear relationship. As one increases, what does the other one do?
Because qualitative variables don’t have a numerical order to them, they don’t increase or decrease in value. For example, just because male = 1 and female = 2 doesn’t mean that a female is worth twice a male. (Although some women may want to disagree.) These numbers represent categories, not values. Therefore, you can’t use the word Correlation To describe the relationship between, say, gender and political affiliation. The appropriate term to describe the relationships of qualitative variables is Association. You can say that political affiliation is associated with gender, and explain how. (For full details on association, see Chapter 13. For more information on correlation, see Chapter 4.)
Building models to make predictions
You can also build models to predict the value of a qualitative variable based on other related information. In this case, building models is more than a lot of little plastic pieces and some irritatingly sticky glue. When you build a model, you look for variables that help explain, estimate, or predict some response you’re interested in (the variables that do this are called Explanatory variables). You sort through the explanatory variables and figure out which ones do the best job of predicting the response, and you put them together into a type of equation like Y = 2x + 4 where X = shoe size and Y = length of your calf. That equation is a Model.
For example, what if you want to know which factors or variables can help you predict someone’s political affiliation? Is a woman without children more likely to be a Republican or a Democrat? What about a middle-aged man who proclaims Hinduism as his religion? In order for you to compare these complex relationships, you must build a model to evaluate each group’s impact on political affiliation (or some other qualitative variable). This kind of model building is explored more in-depth in Chapter 8, where I discuss the topic of logistic regression.
Logistic regression builds models to predict the outcome of a qualitative variable, such as political affiliation. If you want to make predictions about a quantitative variable, such as income, you need to use the standard type of regression (check out Chapters 4 and 5).
In 2003, the Pew Research Foundation studied the following variables in terms of their relationship with political affiliation: gender, race, state of residence, income level, age, education, religion, marital status, and whether or not you have children. While you can do individual Chi-square analyses to examine possible connections between each of these variables and political affiliation separately, you can’t find out which combinations of these variables increase the likelihood of someone being a Democrat, Republican, or other.
For example, the Foundation found that women are more likely to be Democrats than men, but age is also a factor. Younger people tend to be more inclined to be Republican, and older people lean toward being Democrat. However, if you look at the combination of gender and age, you can see mixed results; males who are older are more likely than young females to be Democrat rather than Republican, for example. This kind of result is called an Interaction effect Between gender and age group. An interaction effect occurs when certain combinations of variables produce different results than other combinations. The only way to look for these kinds of more-complex relationships is to do model building, which allows you to examine the combinations of variables and their impact on political affiliation. The Pew Foundation was able to make conclusions about the United States population based on its model linking political affiliation, age and gender, as well as their interactions.
Statistics for Quantitative Variables
Quantitative variables, unlike qualitative variables, have a wider range of statistics that you can do, depending on what questions you want to ask. The main reason for this wider range is that Quantitative data Are numbers that represent measurements or counts, so it makes sense that you can order, add or subtract, and multiply or divide them — and the results all have numerical meaning. Examining quantitative date opens up a whole world of possibilities for analysis. In this section, I present the major data-analysis techniques for quantitative data. I further expand each technique in later chapters of this book.
Making comparisons
Suppose you want to look at income (a quantitative variable) and how it relates to a qualitative variable, such as gender or region of the country. Your first question may be: Do males still make more money than females? In this case, you can compare the mean incomes of two populations — males and

Females. This assessment requires a hypothesis test of two means (oftentimes called a /-test for independent samples). I present more information on this technique in Chapter 3.
When comparing the means of More Than two groups, don’t simply look at all the possible /-tests that you can do on the pairs of means, because you have to control for an overall error rate in your analysis. Too many analyses can result in errors — adding up to disaster. For example, if you conduct 100 hypothesis tests, each one with a 5 percent error rate, then 5 of those 100 tests give wrong results on average, just by chance.
If you want to compare the average wage in different regions of the country (the East, the Midwest, the South, and the West, for example), this comparison requires a more sophisticated analysis, because you’re looking at four groups rather than just two. The procedure you can use to compare more than two means is called Analysis of variance (ANOVA), and I discuss this method in detail in Chapters 9 and 10.
Finding connections
Suppose you’re an avid golfer and you want to figure out how much time you should spend on your putting game. The question is this: Is the number of putts related to your total score? If the answer is yes, then spending time on your putting game makes sense. If not, then you can slack off on it a bit. Both of these variables are quantitative variables, and you’re looking for a connection between them. You collect data on 100 rounds of golf played by golfers at your favorite course over a weekend. Table 2-2 shows the first few lines of your data set.
|
Table 2-2 |
First Ten Golf Scores (ordered) |
|
Number of Putts |
Total Score |
|
23 |
76 |
|
27 |
80 |
|
28 |
80 |
|
29 |
80 |
|
30 |
80 |
29
82
|
Number of Putts |
Total Score |
|
30 |
83 |
|
31 |
83 |
|
33 |
83 |
|
26 |
84 |
The first step in looking for a connection between putts and total scores (or any other quantitative variables) is to make what is called a Scatterplot Of the data, which graphs your data set in two-dimensional space by using an X And Y Plane. You can take a look at the scatterplot of the golf data in Figure 2-1. Here, X Represents the number of putts, and Y Represents the total score. For example, the point in the lower-left corner of the graph represents someone who had only 23 putts and a total score of 75. (For instructions on making a scatterplot by using Minitab, see Chapter 4.)
Figure 2-1:
A scatterplot is a two-dimensional graph you can use to look for relationships in data.

According to Figure 2-1, it appears that as the number of putts increases, so does total score. The relationship seems pretty strong — the number of putts plays a big part in determining the total score.
Now you need a measure of how strong the relationship is between X And Y And whether it goes uphill or downhill. Correlation is the number that measures how close the points follow a straight line. Correlation is always between -1.0 and +1.0, and the more closely the points follow a straight line,
The closer the correlation is to -1.0 or +1.0. A positive correlation means that as X Increases on the x-axis, Y Also increases on the y-axis. Statisticians call this type of relationship an Uphill relationship. A negative correlation means that as X Increases on the X-axis, Y Goes down. Statisticians call this type of relationship — you guessed it — a Downhill relationship.
For the golf data set, the correlation is 0.896 = 0.90, which is extremely high as correlations go. This strong correlation (close to +1.0) is a good thing because it means number of putts can do a great job of predicting total score. Because the sign of the correlation is positive, it means as you increase number of putts, your total score increases (an uphill relationship). For instructions on calculating a correlation in Minitab, see Chapter 4.
Making predictions
If you want to predict some response variable (y) Using one explanatory variable (x), And you want to use a straight line to do it, you can use Simple linear regression (see Chapter 4 for all the fine points on this topic). Linear regression finds the best-fitting line that cuts through the data set, called the Regression line. After you get the regression line, you can plug in a value of X And get your prediction for Y. (For instructions on using Minitab to find the best-fitting line for your data, see Chapter 4.)
To use the golf example from the previous section, suppose you want to predict the total score you can get for a certain number of putts. In this case, you want to calculate the linear regression line. By using the data set shown in Table 2-2, and running a regression analysis, the computer tells you that the best line to use to predict total score using number of putts is the following:
Total score = 39.6 + 1.52 * Number of putts
So if you have 35 putts in an 18-hole golf course, your total score is predicted to be about 39.6 + 1.52 * 35 = 92.8, or 93. (Not bad for 18 holes!)
Notice that the slope of the regression line tells you what you really want to know — how much does your total score increase with every additional putt? In other words, how much damage is done when you miss the hole on your first, or second, or third putt? The slope of the regression line for the golf data set is 1.52. Because the slope of a line is the ratio of the change in Y (total score) to the change in X (number of putts) this means that every additional putt you need results in an overall increase in total score by 1.52. Maybe that’s why Tiger Woods spends so much time on his short game.
Don’t try to predict Y For x-values that fall outside the range of where the data was collected; you have no guarantee that the line still works outside of that range, or that it will even make sense. For the golf example, you can’t say that if X (the number of putts) = 0 the total score would be 39.6 + 1.52 * 0 = 39.6 (unless you just call it good after your ball hits the green). This mistake is called Extrapolation.
You can discover more about simple linear regression, and expansions on it, in Chapters 4 and 5.
AvoIDINg BIAs
Bias is the bane of a statistician’s existence; it’s easy to create and very hard to deal with, if not impossible in most situations. The statistical definition of Bias Is the systematic overestimation or underestimation of the actual value. In language the rest of us can understand, it means that the results are always off by a certain amount in a certain direction. For example, a bathroom scale may always report a weight that’s five pounds more than it should be (I’m convinced this is true of my doctor’s office scale); this consistent adding of five points to every outcome represents a systematic overestimation of the actual weight.
The most important idea when dealing with bias is prevention, or at least minimizing it. Bias is like weeds in a garden: After they’re present, they’re very hard to deal with, and it’s always better to eliminate them from the start. In this section, you see ways bias can creep into a data set, or even into a statistic, and what you can do about it.
LookINg at bIAs through statIStICal glasses
Bias can show up in a data set a variety of different ways. Here are some of the most common ways bias can creep into your data:
Selecting the sample from the population: Bias occurs when you leave some intended groups out of the process, and/or give certain groups too much weight.
For example, TV surveys (the ones where they ask you to phone in your opinion) are biased because no one has selected a prior sample of people to represent the population — people call in on their own. When people participate in a survey on their own, they’re more likely to have stronger opinions than those who don’t choose to participate. Such samples are called Self-selected samples And are typically very biased.

Designing the data-collection instrument: Poorly designed instruments (including surveys) can result in inconsistent or even incorrect data.
For example, a survey question’s wording plays a large role in whether or not results are biased. A leading question can make people feel like they should answer a certain way. For example: "Don’t you think that the president should be allowed to have a line-item veto to prevent government spending waste?" Who would feel they should say No To that?
Collecting the data: In this case, bias can infiltrate the results if someone makes errors in the recording of the data or if interviewers deviate from the script.
Ur Deciding how and when the data is collected: The time and place you collect data can affect whether your results are biased. For example, if you conduct a telephone survey during the middle of the day, people who work from nine to five aren’t able to participate. Depending on the issue, the timing of this survey could lead to biased results.
Bias can creep into a data set very easily. The best way to deal with bias is to avoid it in the first place. You can do this in two major ways:
Use a random process to select the sample from the population. The
Only way a sample is truly random is if every single member of the population has an equal chance of being selected. Self-selected samples aren’t random.
Make sure that the data is collected in a fair and consistent way. Be
Sure to use neutral question wording and time the survey properly.
SeTtliNg ThE VARiAnce conTrOveRSy: The battle of n-1 Versus n
Not all statistical formulas are free of bias. In other words, some statistics have good characteristics (like offering great precision) and some not-so-good characteristics (like not giving the best possible result in all situations). Statisticians definitely prefer statistics that are both precise and unbiased, and the techniques you find in this book have both qualities. However, precise and unbiased statistics doesn’t always happen naturally; sometimes the basic idea requires a little tweaking to get a statistic that actually meets the standards of the statistical powers that be (of which I am not one). The classic example of this need to fine-tune is the formula for the variance of a data set, which I describe in the following section.
The problem
Statistics textbooks sometimes show two formulas for the variance of a data
! _ Xi – X)
Set. One formula shown for the variance is S2 = ——n-, where N Is the
Sample size, the values of X Are the data values, and the sample mean (or the
N
Average of all the values of the data set) is X = ■L=n—. This formula for variance, you may note, contains an N All by itself in the denominator. The fact that the denominator is N And not N - 1 makes a teacher’s job of explaining variance a whole lot easier, because it represents the average squared distance from the mean. In this case, the values being squared are the differences between the data values and their mean. You get the average of these squared values by summing them up and dividing by N, The sample size.
However, this version of a formula for variance, as it’s written, is biased. That means in a statistical sense, you know that in the long term, the results are always off by a very small amount from their target value. If you take repeated samples, find the variance, and do this over and over, the results on average are a little smaller than they should be. (Statisticians can prove this, but you don’t have to worry about that. I’m sure you have better things to do.)
The Solution
Because statisticians prefer results being correct to results that can be more easily explained, they decided to do something about this bias problem in the formula for the sample variance. A group of stat big wigs figured out that dividing by N Was the problem, and if you divide by N - 1 rather than N, You can get answers that are right on target. That’s how the following commonly used formula for sample variance came into being:
N
! _Xi – X)
S 2 = —-
N-1
Notice that an N - 1 rather than an N Is now in the denominator. However, trying to explain why the formula isn’t dividing by N Does tend to open up a can of worms for statistics professors (and explains why biased statistics are a topic left for the intermediate-level students, like you!).
Because statistics can be biased too, in terms of the results they create through their formulas alone, it’s always a good idea to check with a statistician or someone else in the know whether a particular statistic is unbiased before you use it.

An animal science researcher came to me one time with a data set he was so proud of. He was studying cows and the variables involved in helping determine their longevity. He came in with a super-mega data set that contained over 100,000 observations. He was thinking "Wow, this is gonna be great! I’ve been collecting this data for years and years, and I can finally have it analyzed. There’s got to be loads of information I can get out of this. The papers I’ll write, the talks I’ll be invited to give. . . the raise I’ll get!" He turned his precious data over to me with an expectant smile and sparkling eyes.
But after looking at his data for a few minutes I made a terrible realization — all of his data came from exactly one cow. With no other cows to compare with and a sample size of just one, he had no way to even measure how much those results would vary if he wanted to apply them to another cow. His results were so biased toward that one animal that I couldn’t do anything with the data. After I summed up the courage to tell him, it took a while to peel him off the floor. The moral of the story, I suppose, is to find a statistician and check out your big plans with her before you go down a cow path like this guy did.
GettINg Good PREcISIOn
Precision Is the amount of movement you expect to have in your sample results if you repeat your entire study again with a new sample. Low precision Means that you expect your sample results to move a lot (not a good thing). High precision Means you expect your sample results to remain fairly close in the repeated samples (a good thing). In this section, you find out what precision does and doesn’t measure, and you see how to measure the precision of a statistic in general terms.
Understanding precision from a statistical point of view
You may think that precision means the level of correctness you have in your statistical results. But precision only measures the Level of consistency In the results from sample to sample. Your results can be consistently correct or consistently incorrect.
For example, a field-goal kicker on a football team may consistently kick the ball two feet to the right of the goalposts every single time. Even though he’s consistent, he never gets to score, because his results are systematically off by the same amount each time. In other words, his results are biased, even though they’re precise.
IBE# A statistic can be precise with or without bias, and vice versa. The best situation is when your results are both precise (consistent) as well as unbiased (on target). That goal is what statisticians always strive for. How often does it happen? You can have a lot of control of the precision part by simply taking a larger sample. However, the goal of completely unbiased results is rarely achieved, but that doesn’t stop statisticians from trying. And you do have ways to minimize it (keep reading).
Measuring precision with margin of error
You can measure precision by the margin of error. The Margin of error Is the amount that you expect your statistical results to change from one sample to the next. While you always hope, and may even assume, that statistical results shouldn’t change much with another sample, that’s not always the case. It’s like a commercial that tries to sell a weight-loss product by showing a person who lost 50 pounds in a single weekend; then in small letters at the bottom of the screen, you see the words "results will vary." Before you report or try to interpret any statistical results, you need to have some measurement of how much those results are expected to vary from sample to sample.
The following sections show how to calculate the precision of your statistic and how to come up with a margin of error.
Calculating precision
The exact formulas for margin of error differ depending on the type of data that you’re analyzing; however, they all contain two major components:
Confidence coefficient Standard error of the statistic
The general structure of a formula for margin of error is the following, where standard error is the standard deviation of the population divided by the square root of the sample size (you can see all the details on margin of error in Chapter 3):
Margin of error = ± Confidence coefficient * Standard error
The big idea is that the confidence coefficient tells you the number of standard errors you’re willing to add and subtract in order to have a certain level of confidence in your results. If you want to be more confident in your results, you add or subtract more standard errors. If you don’t have to be as confident, you don’t have to add or subtract as many standard errors. Typically, you add and subtract about two standard errors if you want to be 95 percent confident and three standard errors if you want to be more than 99 percent confident. This rule of thumb follows a statistical result called the Empirical Rule, Also known as the 68-95-99.7 Rule.

The Standard error Is the average amount of movement in the statistic you’re using. It’s a function of two quantities:
Sample size: Sample size is perhaps the most important factor in controlling margin of error. The sample size is in the denominator of the standard error, meaning that as your sample size increases, the standard error goes down, and that’s why the margin of error goes down.
This result makes sense, because having a larger sample means having more information in your analysis, which should lead to greater precision. As the sample size decreases, the margin of error goes up, because you have less information to work with and that makes for less-precise results.
Standard deviation in the population: Standard deviation is close to the average distance from the mean. If the population you took your sample from has a large amount of variability, the standard deviation is large, and the margin of error for your statistic goes up (because standard deviation is in the numerator of the margin of error). If the population is more homogeneous, your sample results are more homogeneous as well, and the margin of error goes down (because the standard error gets smaller).
The Gallup Organization states its survey results in a universal, statistically correct format. Using a specific example from a recent survey it conducted, you can see the language it uses to report its results:
"These results are based on telephone interviews with a randomly selected national sample of 1,002 adults, aged 18 years and older, conducted June 9-11, 2006. For results based on this sample, one can say with 95% confidence that the maximum error attributable to sampling and other random effects is ±3 percentage points. In addition to sampling error, question wording and practical difficulties in conducting surveys can introduce error or bias into the findings of public opinion polls."
The first sentence of the quote refers to how the Gallup Organization collected the data, as well
As the size of the sample. As you can guess, precision is related to the sample size, as seen in the section "Calculating precision."
The second sentence of the quote refers to the precision measurement: How much did Gallup expect these sample results to vary? The fact that Gallup is 95 percent confident means that if this process were repeated a large number of times, in 5 percent of the cases the results would be wrong, just by chance. This inconsistency occurs if the sample selected for the analysis doesn’t represent the population — not due to biased reasons, but due to chance alone (more on this in Chapter 3).
(Check out the section "Bias not included" to get the info on why the third sentence is included in this quote.)
For more details on how to calculate margin of error in various statistical techniques, see Chapter 3.
Interpreting margin of error
Finding the margin of error is one thing — figuring out what it means is a whole other ball o’ wax. But don’t fear; it’s actually not so bad. To interpret the margin of error, just think of it as the amount of play you allow in your results to cover most of the other samples you could have taken.
Suppose you’re trying to estimate the proportion of people in the population who support a certain issue, and you want to be 95 percent confident in your results. You sample 1,002 individuals and find that 65 percent support the issue. The margin of error for this survey turns out to be plus or minus 3 percentage points (you can find the details of this calculation in Chapter 3). That result means that you can expect the sample proportion of 65 percent to change by as much as 3 percentage points either way if you took a different sample of 1,002 individuals. In other words, you believe the actual population proportion is somewhere between 65 – 3 = 62 percent and 65 + 3 = 68 percent. That’s the best you can say.

Bias not included!
Realizing that the margin of error measures the consistency (precision) of a statistic only, not its level of bias is extremely important. In other words, a margin of error can appear on paper to be very small yet actually be way off target because of bias in the data that was collected. (In the nearby sidebar, you can see that Gallup discusses margin of error and bias separately.)
Any reported margin of error was calculated on the basis of having zero bias in the data. However, this assumption is rarely true. Before interpreting any margin of error, check first to be sure that the sampling process and the data-collection process don’t contain any obvious sources of bias. If a great deal of bias exists, you should ignore the results, or take them with a great deal of skepticism.
Making Conclusions and Knowing Your Limitations
The most important goal of any data analyst is to remain focused on the big picture — the question that you or someone else is asking — and make sure that the data analysis used is appropriate and comprehensive enough to answer that question correctly and fairly.
Here are some tips for analyzing data and interpreting the results, in terms of the statistical procedures and techniques that you may use — at school, in your job, and in everyday life. These tips are implemented and reinforced throughout this book:
Be sure that the research question being asked is clear and definitive.
Some researchers don’t want to be pinned down on any particular set of questions because they have the intent of mining the data (looking for any relationship they can find, and then stating their results after the fact). This can lead to overanalyzing the data, making the results subject to skepticism by statisticians.
Double-check that you clearly understand the type of data being collected. Is the data qualitative or quantitative? The type of data used drives the approach that you take in the analysis.
Make sure that the statistical technique you use is designed to answer the research question. If you want to make comparisons between two groups and your data is quantitative, use a hypothesis test for two means. If you want to compare five groups, use analysis of variance (ANOVA). You can use this book as a resource to help you determine the technique you need.
Look for the limitations of the data analysis. For example, if the researcher wants to know whether negative political ads affect the population of voters, and she bases her study on a group of college students, you can find severe limitations here. For starters, student reactions to negative ads don’t necessarily carry over to all voters in the population. And even if the population were limited to all student voters, the students from this particular class don’t represent all students. In this case, it’s best to limit the conclusions to college students in that class (which no researcher would ever want to do). Ultimately what needs to be done is design the study so the sample contains a representation of the intended population of all voters in the first place (a much more difficult task, but well worth it).
One of the hardest parts of my job as a statistical consultant is dealing with analyses after the design was already done — and done incorrectly. It’s much better to put in a little work to get a good design together first, and then the analysis will take care of itself.



















