Abstract
This full-day tutorial tries a new approach to teaching statistics to
CHI practitioners. The approach avoids two errors common in statistics
pedagogy: 1) snowing students with mathematics and 2) handing them
"recipes" to apply without understanding. Instead, this tutorial
focuses on building intuition and common-sense understanding.
Keywords
Statistics, experimental design, probability, intuition,
common sense
INTENDED AUDIENCE
The target audience of this tutorial is CHI
practitioners -- human factors engineers, user-interface designers,
product testers -- who conduct usability tests and other empirical
studies, and need to use statistical analysis. It is intended for those
who have never taken a statistics course, and also for those who have
but would like to improve their understanding. People who entered the
CHI field from computer science, thereby missing college statistics
courses, will find it especially useful. The tutorial will focus
primarily on experimental methods, rather than methods for analyzing
observational or ethnographic data. This tutorial is not intended for
those who are already facile in experimental design and the use of
statistics.
THE TUTORIAL
This tutorial tries a new approach to teaching statistics
to CHI practitioners. It avoids the problems of traditional approaches
to teaching statistics, instead focussing on: building intuition and a
common-sense understanding of statistical concepts; avoiding
mathematics (beyond high-school algebra); learning when various kinds
of analyses are appropriate and inappropriate; and keeping analyses
simple so as to facilitate comprehension.
The instructors help participants develop intuitions by beginning with
simple probability (using concrete examples such as drawing marbles
from urns and tossing coins), moving on to what it means to take
measurements, and discussing what it takes to convince others that your
results mean what you say they mean. From this basis, the tutorial
proceeds slowly to statistical tests, always keeping the simple
probability and measurement examples and the notion of "convincing
evidence" in sight. In so doing, participants will develop intuitive
notions of: probability, measures of central tendency and deviation,
probability distributions, statistical power, how to design empirical
studies, and how to recognize faulty designs and analyses. The
instructors will also describe typical statistical "bloopers," using
examples culled from the popular press and the CHI literature as well
as made-up examples. Tutorial participants will be given an opportunity
to take and summarize measurements, design hypothetical experiments
plan the corresponding data analyses, and, if time permits, evaluate
experimental analyses reported by others.
The tutorial will cover the following topics:
- Why conduct experiments? Why use statistics?
- Simple examples of using statistics.
- Simple probability, with concrete examples.
- Summary statistics.
- Simple statistical tests.
- Common errors in using statistics.
In each case we will relate the statistical principle or test to real
world situations commonly faced by CHI practitioners. We will also
cover when to use particular designs and analyses, practical aspects of
how to use them (e.g., data collection tips) and when to find a
statistical consultant.
After completing this tutorial, participants will:
- Know why statistics are used, when to use them, and when not to.
- Have an intuitive grasp of statistical concepts, e.g.: probability,
central tendency and deviation, probability distributions, hypothesis
testing, statistical power.
- Understand at a common-sense level how to design simple but convincing
experiments.
- Understand a small set of simple statistical tests that will be useful
to them as CHI practitioners.
- Be able to evaluate the methods used by other practitioners in published
reports; in particular, be able to recognize common errors in applying
statistics.
- Be a more critical reader of supposedly "scientific" results reported
in the popular press.
PROBLEMS OF TRADITIONAL STATISTICS PEDAGOGY
Design and statistical
analysis of empirical experiments is usually taught in one of two ways,
neither of which is ideal for human-computer interaction professionals
who conduct experiments to resolve design and/or research issues.
Some statistics courses use a highly mathematical approach, deriving --
or asking students to derive -- the formulae that underlie probability
distributions or that define decision statistics. We call this the
"derivational" approach. The derivational approach is supposed to build
deep understanding of statistical analysis and experimental design. A
typical homework assignment in a class that used this approach would
not be to perform a statistical analysis on some data, but rather to
prove, say, that analysis of variance is a special case of linear
regression. While dragging students through lengthy and abstract
derivations may work for students who plan to become statisticians or
mathematicians, it doesn't work for those who just want to know how to
use statistics in their job-domain. In such courses, the majority of
students spend much of the time with their eyes glazed over. Since much
of what is presented passes over their head, they fail to develop the
deep understanding that the derivational approach is supposed to produce, and what little they do learn does not prepare them for designing real-world experiments and performing statistical analyses on the results.
The other common approach is commonly referred to as the "cookbook"
approach. Instructors who use the cookbook approach skip the
derivations and cut to the chase, i.e., the formulae needed to compute
various decision statistics. Formulae are given in forms that expedite
computation and are accompanied by examples that show how the formulae
are used. This approach is obviously intended for practitioners, not
statisticians. Deep understanding is definitely not the goal of the
cookbook approach. Indeed, the basic assumption is that one need not
understand statistics in order to use them. While it is true that one
need not understand statistics as thoroughly as people who develop new
statistical analyses do, some understanding is required to know what is
appropriate for the problem at hand, how to interpret results
correctly, and how to avoid violating assumptions underlying the
formulae. These days, the cookbook approach is often accompanied by
instruction in the use of the instructor's favorite statistical analysis
software, so students can even avoid learning the formula for a statictical test, the conditions and assumptions underlying its correct use, etc. All they "need" learn is how to operate the software. Students often emerge from such courses believing that any set of numbers entered into the software will yield a meaningful result. But, of course, garbage in, garbage out. So whereas students who learned statistics by the derivational approach tend to avoid statistics for the rest of their lives, students who learned it by the cookbook approach produce a lot of garbage.
In addition to having their own specific problems, the derivational and
cookbook methods of teaching statistics share the problem of
overemphasizing complex, high-power analyses that are rarely needed by
CHI practitioners. Statistical software exacerbates this by fostering
analytic overkill: applying a statistical cannon to a gnat-sized
problem. This may be appropriate for the domains of psychology and
communications, where the problems encountered often require complex
designs and analyses, and the effects sought are often subtle. However,
CHI practitioners -- those of us who conduct usability tests of
products or prototypes -- rarely, if ever, need complex analyses like
multiple regression, analysis of co-variance, factor analysis,
auto-correlation, or time-series analysis. Even three-way analysis of
variance and non-linear regression are borderline. Furthermore, we
usually don't care about small effects: if effects and differences
aren't strong enough to practically jump out of the data, they are
unimportant. Therefore, we can stick to analysis that detect only strong effects, which are, by and large, simpler. The bottom line is that for most of what we do, we can get by with a small repertoire of simple, easy-to-understand tests, so we don't need much of what is taught in statistics classes.
INSTRUCTOR BACKGROUNDS
Jeff Johnson has worked in the field of
Computer-Human Interaction since 1978. Before entering the CHI field,
he earned B.A. and Ph.D. degrees in experimental psychology from Yale
and Stanford Universities, respectively. Since then, he has worked at
Cromemco as an applications software developer, user interface
designer, and manager; at Xerox as a designer and implementor of
document systems; at US West and Hewlett-Packard Laboratories as a CHI
researcher; and at Sun/FirstPerson as a designer and evaluator of user
interfaces. He has published articles on topics including: the desktop
metaphor, the Xerox Star user interface, user-interface modes, document
editors, application development tools, user-interface responsiveness,
and task-specific user interfaces. He is active in Computer
Professionals for Social Responsibility and from 1991 to 1994 served as
Chair of its Board of Directors. He taught statistics in graduate
school, designed, co-implemented, and wrote the manual for an early
statistical package for PCs, and co-taught an early version of this tutorial at HP labs.
Robin Jeffries has degrees in mathematics, computer science, psychology
and Russian. She has studied and taught statistics both as a
statistician and a psychologist. She did her graduate work in
psychology at the University of Colorado and Post-doctoral work both at
Colorado and at Carnegie-Mellon University. She was a researcher in HCI
at Hewlett-Packard Laboratories for 11 years, and is currently User
Interface Architect for the Developer Products Business Unit of Sun
Microsystems. She has published in a variety of areas, including
comparisons of user interface evaluation methods and cognitive studies
of computer programers. She has taught statistics to students from
grade schoolers to college graduates, including co-teaching an earlier
version of this tutorial at HP Labs.