The role of classically trained Statisticians is to answer questions with data and communicate the logic behind the results. Rarely does a statistician attempt to bridge the gap between statistical logic and practical interpretation unless there is a content expert working closely with the team. The typical method for communicating statistical findings follows a seven step process called Hypothesis Testing. There are many great places online to learn more about Hypothesis Testing (http://stattrek.com/hypothesis-test/hypothesis-testing.aspx).
Step 1 – General Question: Someone asks a question and wants an answer based on numerical evidence, and expects the closest thing to fact that is humanly possible. The questions may sound like this. Is there an HR problem in the Company? Do I need to hire new people? Why are sales higher in the Northeast? What does the public think of our new product? How can we improve our public image? None of these questions are statistically measurable until translated into research questions.
Step 2 – Research Question: This step involves translating general questions into a series smaller, measurable questions. General Question: Is there an HR problem in the Company? Research Question: How trustworthy are the employees in Company X as measured by the Employee Trustworthiness Scale? Research Question: Is trustworthiness different between genders in Company X using the same measure?
Step 3 – Hypotheses: Statisticians use data to answer questions. Since 100% certainty is not possible, statistical answers are given within a degree of measurable certainty, and written as Hypotheses. Hypotheses are “plausible” explanations among many. For example, “There is no significant difference in Trustworthiness between genders” is a plausible Hypothesis to consider. (I will write more about the mechanics of Hypothesis testing in a future article).
Step 4 – Analysis Plan: You may have many Hypotheses to test. Each Hypothesis may require a unique calculation. And, each calculation may have a unique set of assumptions to consider. A well written analysis plan is essential to understanding and communicating the statistical findings in a way that is relevant to the audience.
Step 5 – Calculate a Statistic: The Hypothesis, type of data, and sample/population size dictates the appropriate statistical test. With hundreds of test to choose from, there really is no magic for knowing what test to use. However, there are several “cheat sheets” available online (I will write more later about the mechanics of Hypothesis testing and how to use calculated statistics).
Step 6 – Compute Probability: The calculated value of a statistical test “alone” is not very informative. The Hypothesis testing process uses the calculated value to make inferences. The values are compared to computed probabilities that form the basis of the conclusion (I will write more about the mechanics of Hypothesis testing and probability in future articles).
Step 7 – Present Results: Presenting statistical results is very different from interpreting results. Presenting results follow a structure that may vary slightly depending on the statistic, but generally looks like this:
1. Chose a Test: ie: t-test
2. Calculate a Result: ie: t(df) = t-value, p = p-value
3. Significant? Yes / No
4. Null Hypothesis: Reject or Not Reject
5. Therefore: There IS or IS NOT a significant difference between two means
6. Conclusion: Make a statement that summarizes all previous steps