Design Principles for Data Presentation: Translating Analysis Into Visualization
Total Page:16
File Type:pdf, Size:1020Kb
Design principles for data presentation: Translating analysis into visualization Andrew Eppig, UC Berkeley CAIR Annual Conference 8 November 2012 Overview • Design Principles • Data Sample and Summary • Why Charts Matter • Visualization Process • Chart Selection • Layout • Aesthetics • Self-Sufficiency Check • Institutional Examples and Applications CAIR 2012 2 Andrew Eppig, 8 November 2012 Guiding Design Principles • Good visualizations start with good data and detailed analysis • Know your data • Good visualizations directly answer specific, focused questions • Know what question(s) you are asking • Good visualizations get out of the way of the data • Let the data tell its story without excess clutter or distraction “Too often we pay more attention to ‘pretty’ than to the most important element: information.” -- Dona Wong, The Secrets of Graphics Presentation CAIR 2012 3 Andrew Eppig, 8 November 2012 Data Sample, Summary, and Metrics Name Weight (lb.) Height (in.) BMI Batman 210 74 27.0 Michael Phelps 165 75 20.6 Wonder Woman 130 72 17.6 Hope Solo 140 69 20.7 weight[lb] BMI = 703 x height[in]2 Gender Group N BMI Mean BMI Std. Dev. Comics 1,239 26.0 4.2 Male Athletes 403 23.8 3.5 Models 493 21.6 2.3 Comics 505 20.3 3.4 Female Athletes 254 22.0 3.4 Models 489 18.2 2.7 CAIR 2012 4 Andrew Eppig, 8 November 2012 Default Excel Charting Average BMI by Group and Gender 35.000 30.000 25.000 20.000 Female 15.000 Male 10.000 5.000 0.000 Models Athletes Comics CAIR 2012 5 Andrew Eppig, 8 November 2012 Excessive Chart Junk Average BMI by Group and Gender 27.000 26.000 26.000 25.000 24.000 23.800 23.000 22.000 22.000 BMI 21.600 Models 21.000 20.300 Athletes 20.000 Comics 19.000 18.200 18.000 17.000 Female Male CAIR 2012 6 Andrew Eppig, 8 November 2012 Improved Excel Charting Average BMI by Group and Gender Comics 26! Male! Athletes 24! Models 22! Comics 20! Female! Athletes 22! Models 18! CAIR 2012 7 Andrew Eppig, 8 November 2012 Visualization Checklist • What stories does the data tell? Which story do you want to tell? • What visualization will best aid the story? • Who is the audience? • What metric should you use? • Which type of chart should you use? Junk Charts What is the layout of the visualization? Kaiser Fung, Fung, Kaiser • • How can details enhance the chart? Source: • Font, color, lines/shading, and text “What are the content-reasoning tasks that this display is supposed to help with?” -- Edward Tufte, Beautiful Evidence CAIR 2012 8 Andrew Eppig, 8 November 2012 Super humans - or - Analysis, Writing, Art, and Lettering by: Andrew Eppig Super Models? A nalysis Question: Are Comic Book Superheroes’ bodies more like Top Athletes’ or Top Models’ bodies? Are comparisons the same for both men and women? To account for differences in height and weight, Body Mass Index (BMI) is used: BMI = 703 x weight / height². Comparing the BMI distributions will reveal similarities or differences. Blue Shaded area shows middle 50% of superhero BMI distributions Chart SelectionBody Mass Index Distributions Male Superheroes FeMale Superheroes (N = 1,239) (N = 505) 50% of distribution is 50% of distribution is INSIDE shaded area INSIDE shaded area Median BMI = 25.2 Median BMI = 19.7 Probability Probability 0.0 0.1 0.0 0.0 0.1 0.2 15 20 25 30 35 15 20 25 30 35 Body Mass Index Body Mass Index “Meaningful quantitative information always involves relationships. When displayed in graphs, these relationshipsMale Athletes always boil down to one or more of feMale Athletes eight specific relationships: time series,(N = ranking, 403) part-to-whole, deviation, (N = 254) distribution, correlation, geospatial, nominal39% of comparison.” distribution is 31% of distribution is -- Stephen Few, Designing Effective TablesINSIDE and Graphsshaded area INSIDE shaded area Median BMI = 23.2 Median BMI = 21.4 CAIR 2012 9 Andrew Eppig, 8 November 2012 Probability Probability 0.0 0.1 0.0 0.1 0.2 15 20 25 30 35 15 20 25 30 35 Body Mass Index Body Mass Index Male Models feMale Models (N = 493) (N = 489) 17% of distribution is 28% of distribution is INSIDE shaded area INSIDE shaded area Median BMI = 21.6 Median BMI = 17.7 Probability Probability 0.0 0.1 0.0 0.1 0.2 15 20 25 30 35 15 20 25 30 35 Body Mass Index Body Mass Index S ummary: Male superheroes Tend to have a higher bmi than top MAle Athletes and a much higher bmi than top male models. So Male superheroes are beyond super human - but more like top male athletes than like top male models. Female superheroes tend to have a lower BMI than top female athletes and a higher bmi than top female models. So female superheroes are Neither super humans nor Super models but between Top female athletes and top female models. Female superheroes also have less variation in their BMI than male Superheroes or Top athletes of either gender. Sources: DC Comics (dc.wikia.com); marvel comics (Marvel.com/universe); 2008 US Olympic Team (www.2008.nbcolympics.com); Models.com (models.com) Super humans - or - Analysis, Writing, Art, and Lettering by: Andrew Eppig Super Models? A nalysis Question: Are Comic Book Superheroes’ bodies more like Top Athletes’ or Top Models’ bodies? Are comparisons the same for both men and women? To account for differences in height and Visualizationweight, Body Mass Index (BMI) is used: BMI = 703 x weight Layout / height². Comparing the BMI distributions will reveal similarities or differences. Blue Shaded area shows middle 50% of superhero BMI distributions Body Mass Index Distributions Male Superheroes FeMale Superheroes (N = 1,239) (N = 505) 50% of distribution is 50% of distribution is INSIDE shaded area INSIDE shaded area Median BMI = 25.2 Median BMI = 19.7 Probability Probability 0.0 0.1 0.0 0.1 0.2 15 20 25 30 35 15 20 25 30 35 Body Mass Index Body Mass Index Male Athletes feMale Athletes (N = 403) (N = 254) 39% of distribution is 31% of distribution is INSIDE shaded area INSIDE shaded area Median BMI = 23.2 Median BMI = 21.4 Probability Probability 0.0 0.1 0.0 0.0 0.1 0.2 15 20 25 30 35 15 20 25 30 35 Body Mass Index Body Mass Index Male Models feMale Models (N = 493) (N = 489) 17% of distribution is 28% of distribution is INSIDE shaded area INSIDE shaded area Median BMI = 21.6 Median BMI = 17.7 Probability Probability 0.0 0.1 0.0 0.0 0.1 0.2 15 20 25 30 35 15 20 25 30 35 Body Mass Index Body Mass Index S ummary: Male superheroes Tend to have a higher bmi than top MAle Athletes and a much “Tufte’s (1990)higher bmirecommendation than top male models. So Male superheroes of ‘small are beyond super multiples human - but more like’ [...] uses the top male athletes than like top male models. replicationFemale in superheroes the display tend to have ato lower facilitate BMI than top female athletes comparison and a higher bmi than to the top female models. So female superheroes are Neither super humans nor Super models but implicitbetween modelTop female athletes of and topno female change models. Female superheroes between also have lessthe variation displays.” in their BMI than male Superheroes or Top athletes of either gender. -- AndrewSources: Gelman, DC Comics (dc.wikia.com); marvel Exploratory comics (Marvel.com/universe); 2008 Data US Olympic Team Analysis (www.2008.nbcolympics.com for); Models.com Complex (models.com) Models CAIR 2012 10 Andrew Eppig, 8 November 2012 Aesthetic Considerations • Font: How do you increase legibility and decrease distraction? • Color: Which color palette is appropriate? • Line/Shading: Which weight, color, and style will enhance the final product? • Text: Can adding labels and narrative Plate Corp. ChemicalColor Source: provide useful context? “Hue contrast is easy to overuse to the point of visual clutter. A better approach is to use a few high chroma colors as color contrast in a presentation consisting primarily of grays and muted colors.” -- Maureen Stone, Choosing Colors for Data Visualization CAIR 2012 11 Andrew Eppig, 8 November 2012 Final Infographic Super humans SuperHeroes - or - Analysis, Writing, Art, and Lettering by: Andrew Eppig Super Models? A nalysis Question: Are Comic Book Superheroes’ bodies more like Top Athletes’ or Top Models’ bodies? Are comparisons the same for both men and women? To account for differences in height and weight, Body Mass Index (BMI) is used: BMI = 703 x weight / height². Comparing the BMI distributions will reveal similarities or differences. Blue Shaded area shows middle 50% of superhero BMI distributions Body Mass Index Distributions Male Superheroes FeMale Superheroes (N = 1,239) (N = 505) 50% of distribution is 50% of distribution is INSIDE shaded area INSIDE shaded area Median BMI = 25.2 Median BMI = 19.7 Probability Probability 0.0 0.1 0.0 0.1 0.2 15 20 25 30 35 15 20 25 30 35 Body Mass Index Body Mass Index Male Athletes feMale Athletes (N = 403) (N = 254) 39% of distribution is 31% of distribution is INSIDE shaded area INSIDE shaded area Median BMI = 23.2 Median BMI = 21.4 Probability Probability 0.0 0.1 0.0 0.0 0.1 0.2 15 20 25 30 35 15 20 25 30 35 Body Mass Index Body Mass Index Male Models feMale Models (N = 493) (N = 489) 17% of distribution is 28% of distribution is INSIDE shaded area INSIDE shaded area Median BMI = 21.6 Median BMI = 17.7 Probability Probability 0.0 0.1 0.0 0.0 0.1 0.2 15 20 25 30 35 15 20 25 30 35 Body Mass Index Body Mass Index S ummary: Male superheroes Tend to have a higher bmi than top MAle Athletes and a much higher bmi than top male models.