<<

Journal of Clinical 60 (2008) 311e317

Radar plots: a useful way for presenting multivariate health care M. Joan Saarya,b,* aGage Occupational and Environmental Health Unit, Department of Occupational and Environmental Health, St. Michael’s Hospital, Toronto, ON, Canada bDepartment of Medicine, University of Toronto, Toronto, ON, Canada Accepted 14 April 2007

Abstract Objective: Health care researchers have not taken full advantage of the potential to effectively convey meaning in their multivariate data through graphical presentation. The aim of this paper is to translate knowledge from the fields of analytical chemistry, toxicology, and marketing research to the field of medicine by introducing the radar , a useful graphical display method for multivariate data. Study Design and Setting: Descriptive study based on literature review. Results: The radar plotting technique is described, and examples are used to illustrate not only its programming language, but also the differences in tabular and bar approaches compared to radar-graphed data displays. Conclusion: Radar graphing, a form of radial graphing, could have great utility in the presentation of health-related research, especially in situations in which there are large numbers of independent variables, possibly with different measurement scales. This technique has particular relevance for researchers who wish to illustrate the degree of multiple-group similarity/consensus, or group differences on mul- tiple variables in a single graphical display. Ó 2008 Elsevier Inc. All rights reserved.

Keywords: Statistical data interpretation; Multivariate analysis; Computer ; dissemination methods; Outcome assessment; Consensus

1. Graphing multivariate data when the number of variables and groups to be displayed is large, or when the measurement scales are different. For ex- There are some data for which meaning is not necessar- ample, potential difficulties in data communication emerge ily clear when presented in tabular or text formats. As the when there are multiple variables for each of several number of variables measured increases, so does the diffi- groups. The challenge for the reader is to comprehend the culty in describing their meaning. Survey data is a case meaning of multiple-group differences on a list of vari- in point. Survey data are increasingly used to gather infor- ables. We may know the average of all cases or the percent- mation about a of topics including, but not limited to, age of explained by an independent variable, but symptoms, quality of life, perceptions of care, and satisfac- we do not find out if a particular case fits a pattern [1]. Fur- tion. When such data are gathered on multiple groups and thermore, despite the existence of numerous books that de- then compared, the usual method of comparison is to per- tail various ways of graphing multivariate data, the field of c2 form statistical tests, be they , ANOVA, multivariate medicine continues to rely heavily on text, tabular, and sim- , or t-tests and then describe the numer- ple graphical description of data. The purpose of this paper ical data in text or format. However, although data ta- is to introduce radar graphing as a useful technique for the bles are sometimes excellent methods for storing data, they presentation of multivariate health care data. are often difficult media for communicating and analyzing Most graphs in popular media present limited amounts it [1]. of information. The and the trend graph are exam- In addition to text or tables, research results can also be ples of graphical forms that usually present limited data. presented in graphical formats. Graphical displays are par- Other graphical forms that limit the sophistication of the ticularly suitable for illustrating relationships and trends graphical information displayed include pie that concisely. However, graphs can become visually ‘‘difficult’’ are only capable of breaking a whole into its component parts, and bivariate plots that show [only] the relationship * Corresponding author. Occupational Health Clinic, St. Michael’s Hospital, 30 Bond Street, 4th Floor, Shuter Wing, Toronto, ON, Canada between two variables, the ‘‘norm’’ for the relationship, M8V 3E3. Tel.: 416-709-6737; fax: 416-255-0689. and [1]. Displaying multivariate relationships in E-mail address: [email protected] two dimensions requires ‘‘some ingenuity’’ [2,3].

0895-4356/08/$ e see front matter Ó 2008 Elsevier Inc. All rights reserved. doi: 10.1016/j.jclinepi.2007.04.021 312 M.J. Saary / Journal of Clinical Epidemiology 60 (2008) 311e317 method and have a series of spokes or rays projecting from What is New? a central point, with each ray representing a different vari- able label. The values of the variables are encoded into the What this paper adds to existing literature: lengths of the rays, and the values so plotted are sometimes 1. Introduces readers of clinical medical literature to connected to form an enclosed figure [5]. They are akin to a useful form of graphing more common to the profile plots that have been wrapped about a central point. fields of engineering, chemistry, and marketing. Dominant perceptual properties often include size and 2. Illustrates how to produce a with shape of the resulting figure [3]. In some situations, charac- standard, widely available software. teristic shapes such as stars are used to represent optimal configurations [6]. In other cases, symmetrical polygons are also used as indicators of the normal or nominal condi- tions [5]. In the 1950s, during his investigation of variation in nat- Multivariate data can be represented visually in a variety of ural populations, Anderson [7] used the metroglyph as two-dimensional ways such as glyphs, profiles, Andrews’ a semigraphical method of analysis, illustrating the data curves, Chernoff faces, and boxes [4]. In searching for al- from four individuals for whom ratings on a 3-point scale ternative ways to examine and display patterns in data, for five different qualities were available. Although a bota- a form of graphing called the radar plot was identified. This nist himself, Anderson proposed that his semigraphical type of graph is practical and profoundly useful for a wide glyphs could be useful in fields as diverse as physiology, variety of data present in the current medical literature. My morphology, psychology, and linguistics [7]. Later in the interest in radar graphs developed when I sought to suc- 1970s, Kiviat graphs were introduced by Kiviat and Ko- cinctly express stakeholders’ views on numerous variables lence as a for monitoring computer system hardware concurrently. I had gathered an enormous amount of rich performance [8,9]. Kiviat plots are now commonly used in interview data and wanted to describe the degree of consen- a wide range of fields, predominating in business manage- sus or discord among the groups. I sought to determine ment as a tool for comparing performance metrics [10], and ways to present the data that would enhance the patterns in engineering, computing, and information technology for found among groups and that would supplement text-based monitoring system development trends, etc. [11]. In fact, presentation. There are abundant applications for this versa- many fields use radial plots in one form or another. Star tile graphing method in both clinical and basic science charts [12], with their side-by-side glyph arrangement for health care research. different groups, are commonly seen in the fields of analyt- For clinical studies, examples of uses of radar graphs ical chemistry, toxicology, and petroleum geology [13e16]. might include the following: Alternate names for similar plots include circle , 1. Changes over time in multiple variables polygons, and snowflakes [4]. a. In a single individual b. In different groups 3. Radar plots in medical research 2. Multipleetreatment group differences on multiple- outcome measures Radial plots in any of their various forms have not been 3. Difference between disease conditions on multiple widely used in the field of medicine. Scant examples of their variables. use include distinguishing metabolic profiles of different cancer classes with star plots [17], and using star charts to Radar graphing could also be considered an effective display EEG (electroencephalogram) spectral values [18]. mechanism for graphing basic science data; some examples Of particular interest for this paper is the radar plot, so of which might include named because of the way it resembles a Position In- 1. Comparative compositions of pharmaceuticals dicatordthe typical two-dimensional layout of radar return 2. Molecular or genetic similarity between cases or from a radial trace commonly used in air traffic control, groups weather forecasting, onboard ships, etc. As with other 3. Biologic marker composition/variability between dif- forms of radial plotting, the use of radar charts in the med- ferent tissues or tumors. ical literature is rare, but present. Yamaguchi et al. [19] used radar charts for monitoring multiple-item self-ratings of sleep disturbance over time; Kosaka et al. [20] used radar 2. Radial plots charts to express quality-of-life factors in breast surgery pa- tients; Minemawari and Kato [21] used the radar chart as Radial plots have existed for years as important descrip- a geriatric assessment system; and Jette and Jette [22] pres- tive tools for multivariate data. In general, the common fea- ent comparisons of health-outcome data measured using ture of radial plots is that they are a circular graphing various with radar charts. M.J. Saary / Journal of Clinical Epidemiology 60 (2008) 311e317 313 The radar plot, when used as a method to display data, is much simpler than when it is associated with statistical Town A MD Knowledge Town B analyses, which can sometimes be complex because of 5 Town C the multidimensionality. This is perhaps one key reason why the clinical science of medicine has not adopted radar Availability MD Attitude 2.5 graphing; at the outset, it might appear to be too complex a mathematical technique to be practical. Another chal- lenge with such plots, as with others, is that the order of ar- 0 rangement of variables can impact perception of the data Cost Simplicity [3]. Clearly though, graphs can be arranged along a contin- uum from descriptive to analytical. Motivating the audience to access the data is an important purpose for graphs. A Wait Time Access graph can break up extensive narrative and encourage the reader to continue to engage the material [1]. Fig. 1. Radar chart comparing three groups on seven variables. I became enthusiastic about the radar chart as a way to graphically illustrate stakeholder consensus. Typically, con- other two. In addition, it is also clear that all three patient sensus is presented as percentage agreement. Correlations groups were satisfied with MD Attitude, compared to Ac- or kappas can also be used, but are not appropriate for cer- cess, Cost, and Availability where there is greater disper- tain types of data, or with numerous groups where multiple sion. Patients in Town C give poorer ratings (the inner correlations require follow-up, and sample sizes large ring) than the other two groups on all but two variables. enough to support statistical power for such analyses may Now, comparing the radar chart to the same data pre- be difficult to obtain. Percentages are often plotted as bar sented in a typical bar chart (see Fig. 2), the radar chart charts, but each variable must then be either presented on is notably less cluttered. It provides an immediate sense a separate chart, or listed separately such that the degree of the big picture, as well as the detail for each individual of overlap or agreement is not obvious. However, when variable. The radar chart form of graphical presentation is each stakeholder is represented by a particular shape on a more efficient way to display a wide variety of data in a radar graph that emerges from their results on each of a single picture. In addition, the radar chart can accommo- several measured variables, the overlap becomes clear. date additional information without becoming overwhelm- To illustrate, a fictitious scenario was created in which ing, simply by adding spokes to the wheel. patients from three different towns were asked to rate their An example of the use of radar graphs for large volumes level of satisfaction with their with the health of information is seen in the work of El Ansari and Phillips care system on a scale from 1 (dissatisfied) to 5 (totally sat- [23]. Data from four different stakeholder groups are over- isfied) on seven different variables: physician knowledge laid on a graph of 16 variables each ranging from 1 to 7. (MD Knowledge), physician attitude (MD Attitude), sim- Data from multiple groups on multiple variables can be eas- plicity of the system (Simplicity), access to emergency care ily summarized on a single intelligible graph, the text ex- (Access), wait time for tests (Wait Time), costs incurred if planations of the same taking pages to elaborate. This is any (Cost), and availability of specialists (Availability). not to say that the graph oversimplifies the data, but only Group results are presented in Table 1 using the that patterns are more easily ascertained. traditional approach of providing means for all the vari- ables. Although informative, the table does not give an overall sense of each group’s opinions. 4. Generating the radar plot d Comparing the radar chart (see Fig. 1 generated in Ex- Radar plots can be produced by a variety of commercially cel, Microsoft ExcelÓ 2006 Microsoft Corporation) to the available computing programs. The output from two of such tabular data in Table 1, total satisfaction is represented by the outermost ring. In other words, the ideal situation is rep- 6 Town A resented by the perimeter. Clearly, from this chart it is ap- Town B 5 Town C parent that patients in Town B are more satisfied than the 4 3 Table 1 Means for satisfaction variables 2 1 Mean satisfaction ratings 0 MD MD Wait Knowledge Attitude Simplicity Access Time Cost Availability Cost Access Simplicity Wait Time Town A 1 5 4 4 5 3 3 MD Attitude Availability Town B 5 5 4 5 5 5 4 MD Knowledge Town C 2 5 3 2 3 2 1 Fig. 2. Bar chart comparing three groups on seven variables. 314 M.J. Saary / Journal of Clinical Epidemiology 60 (2008) 311e317

Table 2 1 Data for SASÓ chart production examplednumber of deaths 40 by location, gender, and treatment group Age Gender Treatment Deaths 29 11 1 10 11 2 4 11 3 7 11 4 65 19 2 11 5 15 12 1 2 12 2 14 12 3 2 12 4 1 12 5 19 21 1 14 1 - Control 21 2 2 2 - 0.25 mg 21 3 5 3 - 0.50 mg 4 - 0.75 mg 21 4 7 5 - 1.00 mg 21 5 2 22 1 1 4 3 22 2 20Fig. 3. Number of deaths in 1 year for each treatment: results from SASÓ. 22 3 7 22 4 5 22 5 4and one continuous variable was created. For this (see Table 2), the outcome measure was the number of deaths over a 1-year period in a control and four treatment programs will be presented as the reader is guided through the levels of a new drug (1 5 control, 2 5 0.25 mg, process of producing the charts: the SASÓ statistical soft- 3 5 0.50 mg, 4 5 0.75 mg, and 5 5 1.00 mg) stratified by ! ware package (SAS Institute, Cary, NC, USA), and Microsoft gender (1 5 male, 2 5 female) and age (1 5 50 years, ExcelÓ 2006, Microsoft Corporation. 2 5>50 years). When using the GRADAR procedure in SASÓ, the data An example of code to generate the simple SASÓ radar typically consists of one or more group variables and one or plot in Fig. 3 comparing only the five treatment levels more outcome variables for which there is a count or fre- would be: quency for each level of the group variable (see Table 2). proc gradar data=example; The vertices of the radar plot are determined by the levels of a single variable that is given in the CHART statement. chart treat / freq=count ; The spokes in the chart are positioned much like a clock run; quit; starting at the 12-o’clock position and moving in a clock- wise direction. For this example, vertices are created for each level of In Excel, the radar plot is generated by using the Insert the treatment group variable, and a star is created with each function that allows the option to insert one of a variety of spoke representing one of the five levels of treatment, and charts. Radar plots are among a number of other less com- monly used graphing styles available on this menu includ- ing surface, doughnut, bubble, and stock plots. In this Control example, Excel does not prepare the chart by performing 40 calculations on the raw data; hence, the sums (or means) 30 of the raw-data groups must be represented in a separate new table (see Table 3). 20 1.00 mg 0.25 mg To demonstrate the production of radar charts in SASÓ, 10 a fictitious data set including several categorical variables 0 Table 3 Data for ExcelÓ chart production exampledsums of number of deaths for each level of treatment derived from same raw data in Table 2 Treatment group Total deaths

Control 27 0.75 mg 0.50 mg 0.25 mg 40 Total Deaths 0.50 mg 21 0.75 mg 19 Fig. 4. Number of deaths in 1 year for each treatment: results from 1.00 mg 40 ExcelÓ. M.J. Saary / Journal of Clinical Epidemiology 60 (2008) 311e317 315 each point on the spoke reflecting the magnitude of the var- had a higher number of deaths among males, whereas treat- iable COUNT, which in this case would be the number of ment with 0.25 mg has markedly a greater number of deaths in a year for each of the control and four treatment deaths for females. The gender difference in numbers of categories. Fig. 4 illustrates the same radar chart generated deaths is less substantial for treatments with 0.50 mg, from the data in an ExcelÓ spreadsheet. 0.75 mg, and 1.00 mg. Group differences could also be en- If another group variable is also considered, say gender, hanced in this type of graphing by removing the radial grid then the values of COUNT are aggregated across gender lines altogether and focusing on the resulting shapes of the within each level of treatment. In this example, to display graphed data. the deaths in each treatment group on separate charts for The simple examples in this paper demonstrate the util- each gender as in Fig. 5 (Fig. 6 for ExcelÓ), use the ity of radar graphing for illustrating group differences. In ACROSS5!variableO option in the CHART statement: this case, if the number of treatment options was to in- crease, or if the other grouping variable (i.e., age) was also proc gradar data=example; overlaid, the radar graph would remain an appealing pre- sentation method compared to tabular or other graphing chart treat / freq=count across=gender; formats because the graph would remain uncluttered. run;

quit; 5. Conclusion Figs. 7 and 8, however, are more efficient radar charts because one gender is overlaid on top of another. In ExcelÓ Researchers must both convince audiences of their anal- specifying the data ranges for each series will allow multi- ysis and allow audiences to examine the data and determine ple groups’ results to be displayed on a single chart. In the meaning for themselves. Often, however, tables and ba- SASÓ, use the OVERLAY5!variableO option in the sic figures are the means relied upon to convey information. CHART statement: Although the examples in this paper are relatively simplis- tic for explanatory purposes, it is clear that additional vari- proc gradar data=example; ables that increase the number of spokes, and additional group variables that increase the number of overlays, do chart treat / freq=count overlay=gender; not hinder the ease of interpretation the way such additional run; information complicates bar charts or text. This paper has described the usefulness and simplicity of quit; a radial plotting technique called radar plotting for graphing multivariate data. Although commonly used in other fields Although it is clear from the resulting shapes in Figs. 5 such as business management and engineering, it has not and 6 that the genders appear to have differences in death found a stronghold in the presentation of health-related rates for the various treatment options, the overlaid Figs. research results. Herein lies an important opportunity for 7 and 8 much more distinctly indicate that the control group knowledge translation to the medical community.

1 34 Male34 Female

18 18

5 3 2 3

1 - Control 2 - 0.25 mg 3 - 0.50 mg 4 - 0.75 mg 5 - 1.00 mg 4 3

Fig. 5. Gender differences in death rate by treatment separately from SASÓ. 316 M.J. Saary / Journal of Clinical Epidemiology 60 (2008) 311e317

Males Number of Deaths Male Control Control Female 30 40 Deaths 30 20 20 1.00 mg 0.25 mg 1.00 mg 10 0.25 mg 10

0 0

.075 mg 0.50 mg 0.50 mg .075 mg Fig. 8. Gender differences in death rate by treatment overlaid from ExcelÓ. Females

Control Acknowledgments 40 Deaths I would sincerely like to acknowledge Michael Manno, 30 MSc (Gage Occupational and Environmental Health Unit, 20 Department of Occupational and Environmental Health, 1.00 mg 0.25 mg St. Michael’s Hospital) for his technical help in preparing 10 the graphs, and his assistance with commentary relating 0 to programming language. No compensation was received for such contributions. I would also like to acknowledge the help of my PhD supervisor, Dr. Linn Holness, MD, MHSc, FRCPC (Gage Occupational and Environmental Health Unit, Department .075 mg 0.50 mg of Occupational and Environmental Health, St. Michael’s Hospital; Department of Medicine, University of Toronto; Fig. 6. Gender differences in death rate by treatment separately from Department of Public Health Sciences, University of Toron- ExcelÓ. to), and my thesis committee members Dr. Monique Gignac, PhD (Department of Public Health Sciences, University of Toronto; Division of Outcomes and Popula- tion Health, University Health Network) and Dr. Cameron Mustard ScD (Department of Public Health Sciences, Uni- 1 versity of Toronto; Institute for Work and Health) for their 34 supervision and constructive commentary about the manu- script content. No compensation was received for such contributions. 18 Joan Saary has been supported by a fellowship award from CIHR (Canadian Institutes of Health Research). Fund- 5 3 2 ing was in the form of salary support. The funder had no role in the design and conduct of the study; collection, management, analysis, and interpretation of the data; and preparation, review, or approval of the manuscript. Male 1 - Control Female 2 - 0.25 mg 3 - 0.50 mg 4 - 0.75 mg References 5 - 1.00 mg [1] Henry GT. Graphing data: Techniques for display and analysis. Thou- sand Oaks, CA: Sage; 1995. 4 3 [2] Friendly M. for multivariate data. SAS SUGI Conference, Accessed June 15, 2006. Available at www.math. Fig. 7. Gender differences in death rate by treatment overlaid from SASÓ. yorrku.ca/SCS/sugi/sugi16-paper.html 1991. M.J. Saary / Journal of Clinical Epidemiology 60 (2008) 311e317 317

[3] Friendly M, Kwan E. Effect ordering for data displays. Comput Stat [14] Halpern HI. Development and applications of light-hydrocarbon- Data Anal 2003;43:509e39. based star diagrams. Am Assoc Pet Geol Bull 1996;79:801e15. [4] Du Toit SHCD, Steyn AGW, Stumpf RH. Graphical exploratory data [15] Tughu N, Ladiwala A, Breneman CM, Cramer S. Identification of analysis. New York: Springer-Verlag; 1986. chemically selective displacers using parallel batch screening exper- [5] Budsaba K, Smith CE, Riviere JE. Compass plots: a combination of iments and quantitative structure efficacy relationship models. Anal star plot and analysis of means to visualize significant interactions in Chem 2003;75:5806e16. complex toxicology studies. Toxicol Mech Methods 2000;10: [16] Wever HE. Petroleum and source rock characterization based on C7 313e32. star plot results: examples from Egypt. Am Assoc Pet Geol Bull [6] Elliott RW. Kiviat-graphs as a means for displaying performance 2000;84:1041e54. data for on-line retrieval systems. J Am Soc Inf Sci 1976;27: [17] Vion-Dury J, Favre R, Sciaky M, Kriat M, Confort-Gouny S, 178e82. Harle JR, et al. Graphic-aided study of metabolic modifications of [7] Anderson E. A semigraphical method for the analysis of complex plasma in cancer using proton magnetic resonance spectroscopy. problems. Technometrics 1960;2:387e91. NMR Biomed 1993;6:58e65. [8] Kiviat PJ, Kolence KW. Software unit profiles and Kiviat figures. [18] Pola P, Cruccu G, Dolce G. Star-like display of EEG spatial values. ACM SIGMETRICS Perf Evaluat Rev 1973;2(3):13e20. Electroencephalogr Clin Neurophysiol 1980;50:527e9. [9] Morris MF. Kiviat graphsdconventions and ‘‘figures of merit’’. [19] Yamaguchi N, Matsubara S, Momonoi F, Morikawa K, Takeyama M, ACM SIGMETRICS Perf Evaluat Rev 1974;3(3):2e8. Maeda Y. An attempt of radar chart expression of a self-rating scale [10] Gillooly C. A business view. InformationWeek [Manhasset] for sleep disturbance. Psychiatry Clin Neurosci 1998;52:165e7. 1998;679:147e50. [20] Kosaka A, Osaku M, Watabiki Y, Hojo M, Kasezawa A, Hamao F. [11] Pinzger M, Gall H, Fischer M, Lanza M. Visualizing multiple evolu- QOL of breast-conserved patients analyzed by radar chart. Biother- tion metrics. In: Proceedings SoftVis ’05dACM symposium on apy 1995;9:1023e30. software . St. Louis, MO: Association for Computing [21] Minemawari Y, Kato T. The radar chart method and its analysis as Machinery; 2005. a comprehensive geriatric assessment system for elderly disabled [12] Chambers JM, Cleveland WS, Kleiner B, Tukey PA. Graphical patients. Jap J Geriatr 1999;36:206e12. methods for . Belmont, CA: Duxbury Press; 1983. [22] Jette DU, Jette A. Health status assessment in the occupational health [13] Agematsu Y, Ikarashi I, Furuyama Y, Kobayashi N, Barada K, setting. Orthop Clin North Am 1996;27:891e902. Tauchi K. A new visual method for evaluating multiple data (diag- [23] El Ansari W, Phillips CJ. Interprofessional collaboration: a stake- nostic radar chart) in general toxicological study. J Toxicol Sci holder approach to evaluation of voluntary participation in commu- 1993;18:133e42. nity partnerships. J Interprof Care 2001;15:351e68.