Organizing and Displaying Data

The principal goal of this chapter is to help you understand the most common approaches to organizing and displaying data and statistical results so that you are prepared to efficiently and effectively convey these to an audience. This chapter will prepare you to:

Employ techniques for displaying data and statistical results in tables and graphs.

Choose the best format to display data and statistical results.

Accurately interpret data and statistical results presented in a graph or table.

Bar chart

Boxplot

Frequency distribution

Nurses in practice, leadership, and research must present data or statistical results to accomplish a variety of purposes—you may need to make a case for changes in practice, for the use of resources, or perhaps you need to convince a patient that their behavior is leading to poor health outcomes. Visual representations of data and results are an essential component of dissemination, whether you are preparing a written report for your institution, giving an oral presentation with visual aids, or writing a manuscript for publication. Following data collection and analysis, what you have are a bunch of numbers and characters; it is important to be able to communicate the most salient aspects of the data to others. Similarly, as you are reading reports, you need to determine if the investigator made a defensible choice in the data analysis and presentation of data, and be able to understand a table or graph in the report.

If the number of data values is small, it may be easy to interpret the data set using language only; that is, we simply explain in writing or in an oral report what our findings are. However, if the data set is large or complex, it can be very challenging to figure out what the data have to say. Imagine that you have two data sets: one with measurements on infant mortality from 20 countries, and one with measurements from 200 countries. Which one will be easier to understand? Of course, it is the one with infant mortality measurements from 20 countries!

So, how can the nurse represent data and results concisely, accurately, and clearly when the data set is large? One important approach is to use tables, graphs, and charts. The old adage “a picture is worth a thousand words” is also true for data points and statistics. A graph or table depicting the data often tells the story in a more compelling fashion than words alone. However, data presentations can be clean and clear or muddy and misleading, depending upon the quality of our choices for display or the decisions of the investigator. In deciding what techniques to use for data display, we must ask ourselves two important questions. The first question is, “When should we use a graph or table?” Graphs and tables are likely to be useful when there is a large amount of, or complex, information to report. The second question is, “What is the best type of graph or table to display our data?” Sometimes, a simple bar chart may be adequate to present data. In other cases, displays that are more complex are needed to communicate precisely with the audience. Data displays should fit with the variable type and its level of measurement, and account for the audience characteristics.

Do you recall the different levels of measurement and types of variables we discussed in Chapter 3? Whether a variable is measured at the nominal, ordinal, interval, or ratio level will, in part, determine what data display methods you will choose. Suppose that we collect data on the gender of subjects. Gender is measured at the nominal level of measurement. A simple bar chart or pie chart may be a good way to convey this information because there are likely to be a limited number of response categories. Now consider data on income of nurses. Income is measured at the ratio level of measurement, and the data values are not limited to preset categories. A simple bar chart or pie chart will not be an appropriate display of these data because each income level reported by an individual subject will appear on the chart. Note that a bar or pie chart may be used if income measurements are categorized into a limited number of groups (e.g., 0μ$25,000, $25,001μ$50,000, and ≥ $50,001).

As a nurse in an advanced role, you are expected to accurately interpret and decide on the best approach for displays of data. The goal of this chapter is to provide guidance on what factors to consider when choosing how to best present your data and statistical results.

CASE STUDY

Rothwell, C. (2018). Progress review webinar: Hearing and other sensory or communication disorders and vision presentation. Washington, DC: U.S. Department of Health and Human Services, Office of Disease Prevention and Health Promotion. Retrieved from https://www.cdc.gov/nchs/data/hpdata2020/hp2020_ENT_VSL_and_V_Progress_Review-Presentation_R.pdf

A thoughtful analysis of data displayed in a simple and clear chart can quickly communicate the results of your work. For instance, in Healthy People 2020 the objectives of increased newborn hearing screening, audiologic evaluation, and intervention services are identified (Healthy People 2020, 2019). Nurses in hospitals and clinics contribute to achieving these objectives by developing and implementing screening and referral protocols. The line chart that follows shows the progress toward the 2020 goals in hearing screening, evaluation, and intervention from 2003 through 2015.

Consider your current efforts in improving patient health outcomes and how you might use a chart to show your results; for example, monitoring depression screening and referrals, investigating patterns of medication adherence, and tracking achievement of cholesterol reduction goals.

A line graph shows the progress toward the 2020 goals in hearing screening, audiologic evaluation, and intervention services of infants, from 2003 through 2015.

Healthy People 2020. (2019). Hearing and other sensory or communication disorders. Washington, DC: U.S. Department of Health and Human Services, Office of Disease Prevention and Health Promotion. Retrieved from https://www.healthypeople.gov/2020/topics-objectives/topic/hearing-and-other-sensory-or-communication-disorders/objectives

One of the most common ways of presenting data is a frequency distribution, which shows the possible values of a variable and the corresponding frequency of those values. The simplest frequency distribution has two columns—one for data values and the other for corresponding frequencies. Table 5-1 is an example frequency distribution displaying the number of calls per night received at an emergency department (ED) in a single month.

Table 5-1 Frequency Distribution of Number of Calls at an Emergency Department

Number of Calls	Frequency
0	2
1	5
2	7
3	16
4	1

The first column shows the number of calls received per night at an ED, ranging from zero to four calls. The second column shows how frequently an ED received the corresponding number of calls in 1 month. There were 2 nights when the ED received no calls and 16 nights when the ED received three calls.

A frequency distribution table can be extended by adding additional columns, such as cumulative frequency and cumulative percentage. Cumulative frequency is the sum of frequency of the current category with that of previous categories, and cumulative percentage is the ratio of cumulative frequency of the category of interest to the total number of subjects. Table 5-2 shows the extended frequency distribution table of Table 5-1.

Table 5-2 Extended Frequency Distribution of Number of Calls at an Emergency Department

Number of Calls	Frequency	Cumulative Frequency	Cumulative Percentage
0	2	2	0.06
1	5	7	0.23
2	7	14	0.45
3	16	30	0.97
4	1	31	1.00

A frequency distribution can be either ungrouped or grouped. Our previous example was an ungrouped frequency distribution. If the data are measured at the categorical level, either nominal or ordinal, an ungrouped frequency distribution is the usual choice, as there will be a limited number of category responses. Table 5-3 shows a frequency distribution table for gender.

Table 5-3 Frequency Distribution Table for Gender

Gender	Frequency
Male	20
Female	70
Trans	5
Total	95

If the variables are measured at the interval or ratio level of measurement, the choice of ungrouped or grouped frequency distribution depends on the range of the data values. If the range of data values is small, such as found in Tables 5-1 and 5-2, an ungrouped frequency distribution table still may be appropriate. If not, a grouped frequency distribution may be a more efficient way of displaying the data.

Suppose that you want to create a frequency distribution of mortality rates of 200 countries. The data will range over many different values, and an ungrouped frequency distribution table would be quite large to display the data. In this case, a grouped frequency distribution table will show the data more concisely as the distinct intervals of data values will be grouped to simplify the information about a variable. Table 5-4 shows a grouped frequency distribution table of mortality rates of 200 countries.

Table 5-4 Grouped Frequency Distribution Table of Mortality Rates of 200 countries

Mortality Rates (Death per 1,000 Live Births)	Frequency
0μ10	9
11μ20	27
21μ30	42
31μ40	111
41 and above	11

Although a grouped frequency distribution provides good summative information on a large or complex data set, we lose information on individual data values. For example, how would you answer the question, “Which of these countries have an infant mortality rate of 5 or 6 deaths per 1,000 live births?” It is impossible to answer that question by looking at the grouped frequency distribution table.

To create a frequency distribution in Microsoft Excel, you will open Frequency.xlsx and click on “Recommended PivotTables” under the Insert tab, as shown in Figure 5-1. In the Choose Data source box, you will enter A1:B21 as the range, as shown in Figure 5-2. Clicking “OK” will move you to the Recommended PivotTables box, as shown in Figure 5-3. Leaving “Count of Subject by Frequency” selected as default and clicking “OK” will produce the output (Figure 5-4).

Locating the Recommended PivotTables in Excel.

An Excel screenshot shows the selection of the Recommended PivotTables command of the Tables group under the Insert menu.

Courtesy of Microsoft Excel © Microsoft 2020.

Defining a data range in Excel.

An Excel screenshot shows the Choose Data source box dialog box which appears on selection of the Recommended PivotTables command.

Courtesy of Microsoft Excel © Microsoft 2020.

Selecting the Recommended PivotTables in Excel.

An Excel screenshot displays the PivotTables formats to be selected.

Courtesy of Microsoft Excel © Microsoft 2020.

Example output for frequency distribution in Excel.

A screenshot from an excel worksheet displays the output of the count of subject by frequency table.

Courtesy of Microsoft Excel © Microsoft 2020.

To create a frequency distribution in IBM SPSS Statistics software (“SPSS”), you will open Frequency.sav and go to Analyze > Descriptive Statistics > Frequencies, as shown in Figure 5-5. In the Frequencies box, you will then move a variable of interest—in this case, Frequency—into “Variable(s),” as shown in Figure 5-6. Clicking “OK” will then produce the output, as shown in Figure 5-7.

Selecting Frequencies in SPSS.

A screenshot of an I B M, S P S S, Statistics Editor shows the selection of Frequencies from the Descriptive Statistics drop-down list under the Analyze menu.

Reprint Courtesy of International Business Machines Corporation, © International Business Machines Corporation. “IBM SPSS Statistics software (“SPSS”)”. IBM^®, the IBM logo, ibm.com, and SPSS are trademarks or registered trademarks of International Business Machines Corporation.

Defining a variable frequency in SPSS.

A screenshot of an I B M, S P S S, Statistics Editor shows the Frequencies dialog box.

Reprint Courtesy of International Business Machines Corporation, © International Business Machines Corporation. “IBM SPSS Statistics software (“SPSS”)”. IBM^®, the IBM logo, ibm.com, and SPSS are trademarks or registered trademarks of International Business Machines Corporation.

Example output of frequency distribution table in SPSS.

A screenshot of an I B M, S P S S, Statistics Editor shows the output of frequency distribution table.

Reprint Courtesy of International Business Machines Corporation, © International Business Machines Corporation. “IBM SPSS Statistics software (“SPSS”)”. IBM^®, the IBM logo, ibm.com, and SPSS are trademarks or registered trademarks of International Business Machines Corporation.

Determining what type of frequency distribution table is best depends again on how the data are being measured and the range of values. If the range of data values is small, choosing an ungrouped frequency distribution table and retaining as much of the information as possible is probably best. If the data range is large or very complex, a grouped frequency distribution table will convey information in a more concise format, even though some details of the information will be lost.

Nurses using data and statistical results for evidence-based practice, quality/process improvement, and research may choose to convey information about data in graphs and charts that would be difficult or cumbersome to examine in text format. In fact, graphs and charts are the best method of describing data when the data set is large. Graphs and charts are visually impactful and, when well designed, can be easily understood. There are many different types of graphs and charts available, and we will now discuss how to choose the right graphs or chart.

A useful chart or graph should show the data or statistics in a meaningful, clear, and efficient manner. Poor choices may lead to ambiguous or misleading interpretations of the data. Suppose that we needed to present the collected measurements of the weights of 100 patients. Because each patient will have different measurements in weight, choosing a pie chart as a description method will not display the data clearly, as shown in Figure 5-8. When there are many data values to display, seeing each data value as a separate category does not help us understand the data. In the case of patient weights, the use of a histogram is a much better way to understand the data (Figure 5-9).

Mistakenly used pie chart.

A pie chart shows 51 closely placed sectors, with the data not displayed vividly and the shades used for each sector not able to be differentiated clearly. The pie chart is drawn for 51 weight measurements, with data ranging between 80 and 154.

Histogram using the same data set as Figure 5-8.

A histogram shows the collected measurements of the weights of 100 patients.

There are many types of graphs and charts in Excel and SPSS, and we will explain how to create each with examples of variables. We also discuss the appropriate levels of measurement for each graph and chart. Each data set used here is also included for your practice in the online resources for this text, accessed using the code found in the front of this text.

Discrete or Categorical Data

Bar charts and pie charts are two useful ways to represent discrete data (i.e., data that are categorical, with a fixed number of categories measured at the nominal or ordinal level). They are commonly used charts as they are easy to create, use, and interpret.

Bar Chart

The bar chart is the most appropriate choice for variables measured at the nominal and ordinal level of measurement and can be used to display one or more variables. If a bar chart is used with the ordinal level of measurement, ordering the rank helps the reader in interpreting the chart (e.g., if discussing pressure ulcers, listing the measurements in order—stage I, stage II, stage III, stage VI). A typical bar chart has the response categories on the horizontal axis and the corresponding frequencies of each category on the vertical axis; this chart helps you discern much about the data, such as the most/least common category, the difference of one bar relative to the other, and changes in frequency over time.

To create a simple bar chart for a variable in Excel, you will open Location.xlsx. Please note that the data should be organized in a frequency table to create a bar chart. Click on the arrow by the Insert Column or Bar Chart button under the Insert tab and then select either “2-D Column” (vertical) or “2-D Bar” (horizontal) from the list, as shown in Figure 5-10. Clicking on the chart type, “2-D Column” in this case, will produce the output (Figure 5-11).

Selecting a bar chart in Excel.

An Excel screenshot displays the types of bar graphs to be selected. The data in the worksheet are as follows. The column headings are row labels with a drop-down list, and count of subjects. Row 1: 1, 45. Row 2: 2, 55. Row 3: Grand Total, 100.

Courtesy of Microsoft Excel © Microsoft 2020.

Example bar chart for Location in Excel.

A bar graph shows total subjects in rural and urban locations. The horizontal axis lists rural and urban, and vertical axis labeled, count of subject, ranges from 0 to 60, in increments of 10. The data from graph is as follows. Rural: 45; Urban: 55.

Courtesy of Microsoft Excel © Microsoft 2020.

To create a simple bar chart for a variable in SPSS, you will open Location.sav, go to Graph > Legacy Dialogues, and select the type of graph/chart desired—in this case, Bar, as shown in Figure 5-12. In the Bar Charts box, you will leave “Simple” and “Summaries for groups of cases” selected as the default, and click “Define,” as shown in Figure 5-13. In the Define Simple Bar box, you will then move the variable of interest—in this example, Location—into “Category Axis,” as shown in Figure 5-14. Click “OK” to produce the output (Figure 5-15).

Selecting a bar chart in SPSS.

A screenshot in I B M, S P S S, Statistics Editor shows the selection of the Graphs menu from the menu bar, from which the Legacy Dialogs option is chosen, which in turn lists different types of graphs. The bar option is selected.

Reprint Courtesy of International Business Machines Corporation, © International Business Machines Corporation. “IBM SPSS Statistics software (“SPSS”)”. IBM^®, the IBM logo, ibm.com, and SPSS are trademarks or registered trademarks of International Business Machines Corporation.

Selecting a simple bar chart in SPSS.

A screenshot in I B M, S P S S, Statistics Editor shows the selection of the bar option from the Graphs menu.

Reprint Courtesy of International Business Machines Corporation, © International Business Machines Corporation. “IBM SPSS Statistics software (“SPSS”)”. IBM^®, the IBM logo, ibm.com, and SPSS are trademarks or registered trademarks of International Business Machines Corporation.

Defining a variable in a bar chart in SPSS.

A screenshot in S P S S Editor defines a variable in a simple bar chart.

Reprint Courtesy of International Business Machines Corporation, © International Business Machines Corporation. “IBM SPSS Statistics software (“SPSS”)”. IBM^®, the IBM logo, ibm.com, and SPSS are trademarks or registered trademarks of International Business Machines Corporation.

Example bar chart for nurses’ job location in SPSS.

A bar graph in S P S S shows the job count in rural and urban locations. The horizontal axis lists rural and urban, and vertical axis labeled, count, ranges from 0 to 60, in increments of 10. The data from graph is as follows. Rural: 45; Urban: 55.

Reprint Courtesy of International Business Machines Corporation, © International Business Machines Corporation. “IBM SPSS Statistics software (“SPSS”)”. IBM^®, the IBM logo, ibm.com, and SPSS are trademarks or registered trademarks of International Business Machines Corporation.

Note that a bar chart can also be created horizontally (i.e., the response categories on the vertical axis and the frequencies of each category on the horizontal axis), as in Excel. To create a horizontal bar chart, double-click on the chart and then click the “Transpose Chart Coordinate System” button in the Chart Editor box, as shown in Figure 5-16. An example output is shown in Figure 5-17. Additional examples where a bar chart can be handy to present the data include ethnicity, insurance category (Medicare/Medicaid), marital status, patient acuity, and gender.

Defining a horizontal bar chart in SPSS.

Reprint Courtesy of International Business Machines Corporation, © International Business Machines Corporation. “IBM SPSS Statistics software (“SPSS”)”. IBM^®, the IBM logo, ibm.com, and SPSS are trademarks or registered trademarks of International Business Machines Corporation.

Example horizontal bar chart in SPSS.

A horizontal bar chart of in an S P S S Editor shows the job count in rural and urban locations. Data from the graph are as follows: Rural: 45; Urban: 55.

Reprint Courtesy of International Business Machines Corporation, © International Business Machines Corporation. “IBM SPSS Statistics software (“SPSS”)”. IBM^®, the IBM logo, ibm.com, and SPSS are trademarks or registered trademarks of International Business Machines Corporation.

Pie Chart

A pie chart is a circular chart where pieces within the chart represent a corresponding proportion of each category, and it is an appropriate choice for nominal and ordinal level of measurement. A pie chart is simple to create, use, and understand, much like a bar chart when it is created for a single variable. Pie charts are useful for visualizing the most commonly occurring class compared with the whole and the relative size of different classes. However, it may be difficult to compare data when the percentages are pretty similar across categories or when comparing across different pie charts.

To create a simple pie chart for a variable in Excel, you will open Ethnicity.xlsx and note that the data should be in the frequency table to create a pie chart. Click on the arrow by the Insert Pie or Doughnut Chart button under the Insert tab and then select a “2-D Pie” from the list, as shown in Figure 5-18. Clicking on the chart type will produce the output (Figure 5-19).

Selecting a pie chart in Excel.

An Excel screenshot shows the selection of the required pie chart.

Courtesy of Microsoft Excel © Microsoft 2020.

Example pie chart for Ethnicity in Excel.

An Excel screenshot shows a pie chart display of the count of people in different ethnic categories. Approximate data from the chart in percent are as follows. African American: 24; Asian: 10; Caucasian: 17; Hispanic: 17; Native American: 15; Other: 17.

Courtesy of Microsoft Excel © Microsoft 2020.

To create a simple pie chart for a variable in SPSS, you will open Ethnicity.sav and go to Graph > Legacy Dialogues > Pie, as shown in Figure 5-20. In the Pie Charts box, you will leave “Summaries for Groups of Cases” checked as default and click “Define,” as shown in Figure 5-21. In the Define Pie box, you will then move a variable of interest, Ethnicity, into “Define Slices by,” as shown in Figure 5-22. Clicking “OK” will then produce the output. An example output is shown in Figure 5-23.

Selecting a pie chart in SPSS.

A screenshot in I B M, S P S S, Statistics Editor shows the selection of the Graphs menu from the menu bar, from which the Legacy Dialogs option is chosen, which in turn lists different types of graphs. The pie option is selected.

Reprint Courtesy of International Business Machines Corporation, © International Business Machines Corporation. “IBM SPSS Statistics software (“SPSS”)”. IBM^®, the IBM logo, ibm.com, and SPSS are trademarks or registered trademarks of International Business Machines Corporation.

Selecting a simple pie chart in SPSS.

A screenshot in I B M, S P S S, Statistics Editor shows the selection of the pie option from the Graphs menu.

Reprint Courtesy of International Business Machines Corporation, © International Business Machines Corporation. “IBM SPSS Statistics software (“SPSS”)”. IBM^®, the IBM logo, ibm.com, and SPSS are trademarks or registered trademarks of International Business Machines Corporation.

Defining a variable in a pie chart in SPSS.

A screenshot in S P S S Editor defines a variable in a pie chart.

Reprint Courtesy of International Business Machines Corporation, © International Business Machines Corporation. “IBM SPSS Statistics software (“SPSS”)”. IBM^®, the IBM logo, ibm.com, and SPSS are trademarks or registered trademarks of International Business Machines Corporation.

Example pie chart for nurses’ ethnicity in SPSS.

An S P S S screenshot shows a pie chart display of the count of nurses in different ethnic categories. Approximate data from the chart in percent are as follows. African American: 24; Asian: 10; Caucasian: 17; Hispanic: 17; Native American: 15; Other: 17.

Reprint Courtesy of International Business Machines Corporation, © International Business Machines Corporation. “IBM SPSS Statistics software (“SPSS”)”. IBM^®, the IBM logo, ibm.com, and SPSS are trademarks or registered trademarks of International Business Machines Corporation.

Continuous Data

When the variable is continuous (i.e., interval and ratio), both bar charts and pie charts become inefficient in displaying the collected data. It is important to show how the data are distributed with continuous data, because the data values can range differently, unlike with categorical data. Better choices for continuous variables are histograms, stem and leaf plots, and boxplots.

Histogram

A histogram is similar to a bar chart in structure, which explains why histograms are often mistaken for bar charts. However, you can think of it as a graphical way of presenting information from a frequency distribution. It organizes a group of data points into several intervals, and the bar in a histogram represents the frequency in corresponding intervals, not in predefined limited numbers of categories as in a bar chart. With a histogram, you can understand the general shape of the data distribution (i.e., what are the data trends) and the most commonly occurring data values.

To create a histogram in Excel, you will open Age.xlsx. With any cell in column A selected, click on the arrow by the Insert Statistic Chart button under the Insert tab, and then select “Histogram” from the list, as shown in Figure 5-24. Clicking on the chart type will produce the output (Figure 5-25). Note that you can change “Chart Title” by double-clicking and typing in it.

Selecting a histogram in Excel.

An Excel screenshot displays the types of histographs to be selected. The data in the worksheet has Age as column heading with a list of numerical data.

Courtesy of Microsoft Excel © Microsoft 2020.

Example histogram for Age in Excel.

A histogram shows the frequency of age range.

Courtesy of Microsoft Excel © Microsoft 2020.

To create a histogram in SPSS, you will open Age.sav and go to Graph > Legacy Dialogues > Histogram, as shown in Figure 5-26. In the Histogram box, you will then move a variable of interest, Age, into “Variable,” as shown in Figure 5-27. Clicking “OK” will then produce the output. An example output is shown in Figure 5-28. Note that a histogram can be obtained in other places, such as “Explore.”

Selecting a histogram chart in SPSS.

A screenshot in I B M, S P S S, Statistics Editor shows the selection of the histogram option from the Graphs menu. The data in the worksheet lists different ages.

Reprint Courtesy of International Business Machines Corporation, © International Business Machines Corporation. “IBM SPSS Statistics software (“SPSS”)”. IBM^®, the IBM logo, ibm.com, and SPSS are trademarks or registered trademarks of International Business Machines Corporation.

Defining a variable in a histogram in SPSS.

A screenshot in I B M, S P S S, Statistics Editor shows the selection of the histogram option from the Graphs menu. The data in the worksheet lists different ages.

Reprint Courtesy of International Business Machines Corporation, © International Business Machines Corporation. “IBM SPSS Statistics software (“SPSS”)”. IBM^®, the IBM logo, ibm.com, and SPSS are trademarks or registered trademarks of International Business Machines Corporation.

Example histogram for nurses’ age in SPSS.

A histogram in S P S S shows the frequency of age range.

Reprint Courtesy of International Business Machines Corporation, © International Business Machines Corporation. “IBM SPSS Statistics software (“SPSS”)”. IBM^®, the IBM logo, ibm.com, and SPSS are trademarks or registered trademarks of International Business Machines Corporation.

Stem and Leaf Plot

Stem and leaf plots are also used for showing the distribution of continuous data. These are similar to histograms, but they have greater flexibility and display more information. In addition to the overall shape of distribution, a stem and leaf plot shows information regarding individual data values. Stem and leaf plots are not available in Excel, so we go straight to SPSS.

To create a stem and leaf plot in SPSS, you will open Satisfaction.sav and go to Analyze > Descriptive Statistics > Explore, as shown in Figure 5-29. In the Explore box, you will move a variable of interest, Job Satisfaction, into “Dependent List,” as shown in Figure 5-30. Note that you can move the categorical variable into “Factor List” if you want to create separate stem and leaf plots for a categorical variable; this will create separate plots for different categories of the variable. Stem and leaf plot is checked as the default in the “Plots” button, as shown in Figure 5-31, so clicking “OK” will then produce the output. Figure 5-32 is an example of a stem and leaf plot.

Selecting a stem and leaf plot in SPSS.

A screenshot in I B M, S P S S, Statistics Editor depicts the stem and leaf plot selection.

Reprint Courtesy of International Business Machines Corporation, © International Business Machines Corporation. “IBM SPSS Statistics software (“SPSS”)”. IBM^®, the IBM logo, ibm.com, and SPSS are trademarks or registered trademarks of International Business Machines Corporation.

Defining a variable in a stem and leaf plot in SPSS.

A screenshot in S P S S Editor defines a variable in a stem and leaf plot. The numerical data in the worksheet lists Satisfaction values.

Reprint Courtesy of International Business Machines Corporation, © International Business Machines Corporation. “IBM SPSS Statistics software (“SPSS”)”. IBM^®, the IBM logo, ibm.com, and SPSS are trademarks or registered trademarks of International Business Machines Corporation.

Defining a stem and leaf plot in SPSS.

A screenshot in I B M, S P S S, Statistics Editor defines the stem and leaf plot selection. The numerical data in the worksheet lists Satisfaction values.

Reprint Courtesy of International Business Machines Corporation, © International Business Machines Corporation. “IBM SPSS Statistics software (“SPSS”)”. IBM^®, the IBM logo, ibm.com, and SPSS are trademarks or registered trademarks of International Business Machines Corporation.

Example stem and leaf plot for nurses’ job satisfaction in SPSS.

A table displays the job satisfaction stem-and-leaf plots.

Reprint Courtesy of International Business Machines Corporation, © International Business Machines Corporation. “IBM SPSS Statistics software (“SPSS”)”. IBM^®, the IBM logo, ibm.com, and SPSS are trademarks or registered trademarks of International Business Machines Corporation.

As you can see in Figure 5-32, the variable, Satisfaction, looks similar on both sides from the center, and the actual data values are shown. In this way, stem and leaf plots not only show the distribution, but also information about individual data values.

Boxplot

A boxplot can be used to display more information than any other chart discussed so far in this chapter. It is a good choice for variables measured on the continuous scale and allows for comparisons across groups. A boxplot does not show individual data values as a stem and leaf plot does, but it does display other information, such as the overall distribution, the center of the distribution, the quartile, and possible outliers.

To create a boxplot in Excel, you will open Heartrate.xlsx. With any cell in column A or B selected, click on the arrow by Insert Statistic Chart button under the Insert tab and then select a “Box and Whisker” from the list, as shown in Figure 5-33. Clicking on the chart type will produce the output (Figure 5-34). Note that you can change “Chart Title” by double-clicking and typing on it.

Selecting a boxplot in Excel.

An Excel screenshot shows the selection of the Box and Whisker option in Excel, which is under the Charts group under the Insert menu.

Courtesy of Microsoft Excel © Microsoft 2020.

Example boxplot for Heartrate in Excel.

An Excel screenshot shows the box plot for heart rate.

Courtesy of Microsoft Excel © Microsoft 2020.

To create a boxplot in SPSS, you will open Heartrate.sav and go to Graph > Legacy Dialogues > Boxplot, as shown in Figure 5-35. In the Boxplot box, you will leave “Simple” and “Summaries for Groups of Cases” selected as the default, and click “Define,” as shown in Figure 5-36. In the Define Simple Boxplot box, you will move a variable of interest, Heart Rate, into “Variable” and Gender into “Category Axis,” as shown in Figure 5-37. Clicking “OK” will then produce the output. An example output is shown in Figure 5-38.

Selecting boxplot in SPSS.

A screenshot in I B M, S P S S, Statistics Editor shows the selection of the boxplot.

Reprint Courtesy of International Business Machines Corporation, © International Business Machines Corporation. “IBM SPSS Statistics software (“SPSS”)”. IBM^®, the IBM logo, ibm.com, and SPSS are trademarks or registered trademarks of International Business Machines Corporation.

Selecting a simple boxplot in SPSS.

A screenshot in I B M, S P S S, Statistics Editor shows the selection boxplot from the Graphs menu. The numerical data in the worksheet lists two columns, Heart rate with values between 30 and 117 at random and Gender with values 1 and 2 at random.

Reprint Courtesy of International Business Machines Corporation, © International Business Machines Corporation. “IBM SPSS Statistics software (“SPSS”)”. IBM^®, the IBM logo, ibm.com, and SPSS are trademarks or registered trademarks of International Business Machines Corporation.

Defining variables in a boxplot in SPSS.

A screenshot in S P S S Editor defines a variable in a simple boxplot chart.

Reprint Courtesy of International Business Machines Corporation, © International Business Machines Corporation. “IBM SPSS Statistics software (“SPSS”)”. IBM^®, the IBM logo, ibm.com, and SPSS are trademarks or registered trademarks of International Business Machines Corporation.

Example boxplot for 174 patients’ heart rate in SPSS.

A screenshot in S P S S Editor shows a boxplot of patients’ heart rate.

Reprint Courtesy of International Business Machines Corporation, © International Business Machines Corporation. “IBM SPSS Statistics software (“SPSS”)”. IBM^®, the IBM logo, ibm.com, and SPSS are trademarks or registered trademarks of International Business Machines Corporation.

A boxplot can be drawn either vertically, as shown in Figure 5-38, or horizontally. It is one of the important charts used to describe the data in exploratory data analysis. Interpretation of a boxplot is as follows:

The box in the plot contains the middle 50% of the data set. The middle line in the box represents the 50th percentile, the exact middle number of the entire data set, whereas the upper edge represents the 75th percentile and the lower edge represents the 25th percentile.

If the middle line is not exactly in the middle of the box, it is an indication that the data is not equally distributed in both sides from the center.

The ends of the vertical lines, which are called “whiskers,” represent the minimum and maximum data values. The lower whisker equals 1.5 times the interquartile range (IQR) below the first quartile (25th percentile), and the upper whisker equals 1.5 times the IQR above the third quartile (75th percentile).

Any data value outside of the whiskers is considered to be a possible outlier, which is defined as an unusual data value in the current data set.

We need to give you an explanation on percentile before we finish this discussion on boxplots. Percentile is a measure of location and tells us how many data values fall below a certain percentage of observations. For example, if you were in the 75th percentile on an exam, then you did better than 75% of the other people taking that exam.

There are other graphs and charts that may be useful for displaying data. They include line charts and scatterplots. Line charts are used to examine trends of variables over time, and scatterplots are used to examine the relationship between variables.

Line Chart

Similar to the bar chart and pie chart, the line chart is a good choice for displaying the frequency of categories. It is created by connecting dots representing the data values of each category, as shown in Figure 5-39. In this example, the horizontal axis represents “Month,” a categorical variable, and the vertical axis represents “Systolic Blood Pressure,” a continuous variable.

Example line chart for systolic blood pressure (SBP) in SPSS.

A line chart shows the systolic blood pressure in S P S S.

Reprint Courtesy of International Business Machines Corporation, © International Business Machines Corporation. “IBM SPSS Statistics software (“SPSS”)”. IBM^®, the IBM logo, ibm.com, and SPSS are trademarks or registered trademarks of International Business Machines Corporation.

A line chart is useful when trying to find and compare changes over time or when trying to define meaningful patterns of variables. To create a line chart in Excel, you will open SBP.xlsx. With any cell in column A selected, click on the arrow by the Insert Line or Area Chart button under the Insert tab and then select “2-D Line” from the list, as shown in Figure 5-40. Clicking on the chart type will produce the output (Figure 5-41). Note that you can change “Chart Title” by double-clicking and typing on it.

Selecting a Line chart in Excel.

An Excel screenshot displays the types of line graphs to be selected; 2 D line graph is selected. The data in the worksheet shows the column heading as S B P, with the numerical data between 100 and 108 at random.

Example line chart for SBP in Excel.

A line chart shows the S B P values in Excel.

To create a line chart in SPSS, you will open SBP.sav and go to Graph > Legacy Dialogs > Line, as shown in Figure 5-42. In the Line Charts box, you will leave “Simple” and “Summaries for Groups of Cases” selected as the default, and click “Define,” as shown in Figure 5-43. In the Define Simple Line box, you will then move a variable of interest, Systolic Blood Pressure, into “Variable” after clicking a radio button for “Other Statistics” (e.g., mean) and Month into “Category Axis,” as shown in Figure 5-44. Clicking “OK” will then produce the output, as shown in Figure 5-39.

Selecting a line chart window in SPSS.

A screenshot in I B M, S P S S, Statistics Editor shows the line graph option selected.

Reprint Courtesy of International Business Machines Corporation, © International Business Machines Corporation. “IBM SPSS Statistics software (“SPSS”)”. IBM^®, the IBM logo, ibm.com, and SPSS are trademarks or registered trademarks of International Business Machines Corporation.

Selecting a simple line chart window in SPSS.

A screenshot in I B M, S P S S, Statistics Editor shows the selection of the line option from the Graphs menu.

Reprint Courtesy of International Business Machines Corporation, © International Business Machines Corporation. “IBM SPSS Statistics software (“SPSS”)”. IBM^®, the IBM logo, ibm.com, and SPSS are trademarks or registered trademarks of International Business Machines Corporation.

Defining a variable in a line chart in SPSS.

A screenshot in S P S S Editor defines a variable in a simple line chart.

Reprint Courtesy of International Business Machines Corporation, © International Business Machines Corporation. “IBM SPSS Statistics software (“SPSS”)”. IBM^®, the IBM logo, ibm.com, and SPSS are trademarks or registered trademarks of International Business Machines Corporation.

Scatterplot

Scatterplots are used when an investigator wishes to examine the relationship between two continuous variables—for example, age and systolic blood pressure. Relationships can be in either positive or negative directions. A positive relationship means that both variables move in the same direction, such as increased smoking and the increased probability of getting lung cancer, whereas a negative relationship means that the variables move in opposite directions, such as lower self-esteem and increased depression. Figure 5-45 shows an example of a scatterplot.

Example scatterplot for a relationship between height and weight in SPSS.

A scatterplot in S P S S shows the relationship between height and weight.

Reprint Courtesy of International Business Machines Corporation, © International Business Machines Corporation. “IBM SPSS Statistics software (“SPSS”)”. IBM^®, the IBM logo, ibm.com, and SPSS are trademarks or registered trademarks of International Business Machines Corporation.

To create a scatterplot in Excel, you will open WeightHeight.xlsx. With any cell in column A selected, click on the arrow by Insert Scatter (X, Y) or Bubble Chart button under the Insert tab, and then select “Scatter” from the list, as shown in Figure 5-46. Clicking on the chart type will produce the output (Figure 5-47). Note that you can change “Chart Title” by double-clicking and typing on it, and also adjust scales on both the x-axis and y-axis by double-clicking and changing the range.

Selecting Scatterplot in Excel.

An Excel screenshot displays the types of scatterplots to be selected. The data in the worksheet shows the column headings as Weight and Height, with ten rows of numerical data.

Example scatterplot for weight and height in Excel.

A scatterplot in Excel shows the relationship between height and weight.

To create a scatterplot in SPSS, you will open WeightHeight.sav and go to Graph > Legacy Dialogs > Scatter/Dot, as shown in Figure 5-48. In the Scatter/Dot box, you will leave “Simple Scatter” selected as the default, and then click “Define,” as shown in Figure 5-49. In the Simple Scatterplot box, you will move an independent variable, Height, into “X Axis” and a dependent variable, Weight, into “Y Axis,” as shown in Figure 5-50. Clicking “OK” will then produce the output, as shown in Figure 5-45.

Selecting a scatterplot in SPSS.

A screenshot in S P S S shows the selection of Graphs menu from the menu bar, from which Legacy Dialogs option is chosen, under which Scatter / Dot is selected. The data in the sheet show column headings as Weight and Height with numerical values.

Reprint Courtesy of International Business Machines Corporation, © International Business Machines Corporation. “IBM SPSS Statistics software (“SPSS”)”. IBM^®, the IBM logo, ibm.com, and SPSS are trademarks or registered trademarks of International Business Machines Corporation.

Selecting a simple scatterplot in SPSS.

A screenshot in I B M, S P S S, Statistics Editor shows the selection of the Scatter / Dot option from the Graphs menu.

Reprint Courtesy of International Business Machines Corporation, © International Business Machines Corporation. “IBM SPSS Statistics software (“SPSS”)”. IBM^®, the IBM logo, ibm.com, and SPSS are trademarks or registered trademarks of International Business Machines Corporation.

Defining variables in a scatterplot in SPSS.

A screenshot in S P S S Editor defines a variable in a simple Scatterplot.

Reprint Courtesy of International Business Machines Corporation, © International Business Machines Corporation. “IBM SPSS Statistics software (“SPSS”)”. IBM^®, the IBM logo, ibm.com, and SPSS are trademarks or registered trademarks of International Business Machines Corporation.

We have discussed many different methods of displaying data and statistical results, and most of these tables and charts are easy to create, use, and understand. However, if any of these charts are not carefully selected and designed, they can present false or misleading information. You should consider four elements when selecting the best type of graph or table for your data.

First, you need to ask yourself whether you have chosen the most appropriate type of graph for the data. This means considering the level of measurement and the size and complexity of the data set. For example, you would not want to generate a histogram to display the data values on an ethnicity variable, or a bar chart for the sodium content level of 100 patients. Second, you should make sure that you have provided enough information on each component of the graph. The title for the graphic should be descriptive, and the variable(s) should be clearly identified. Third, make sure that the independent variable is placed on the horizontal axis, while the dependent variable is placed on the vertical axis. If it is reversed, the graph may illustrate a different picture than what you want to show. Finally, you also have to ensure that the graph has been drawn on the proper scale, as it may show a different scenario than what is actually occurring if it is incorrectly drawn. Figure 5-51 is a perfect example of how an inappropriately scaled graph can be misleading. Although the data do not show a strong relationship between the number of years worked at the current job and job satisfaction (as shown in the graph on the left), the graph on the right shows a very strong positive relationship because of the inappropriately defined scale.

Example of data distortion.

Two scatterplots depict the relationship between years at work and job satisfaction, with the second scatterplot showing an inappropriate scaled graph.

Reprint Courtesy of International Business Machines Corporation, © International Business Machines Corporation. “IBM SPSS Statistics software (“SPSS”)”. IBM^®, the IBM logo, ibm.com, and SPSS are trademarks or registered trademarks of International Business Machines Corporation.

Graphs and charts are a useful and efficient way of displaying data, especially when the amount of data to present is large. When the table, graph, or plot is created carefully and appropriately, your data will be more clearly understood and more meaningful. The careful nurse researcher, clinician, and leader should keep in mind what each chart is good for and choose a chart accordingly.

Bar charts and pie charts are good for displaying the frequency or percentage of given categories. Line charts are also good when the variable for the horizontal axis is categorical.

Histograms, stem and leaf plots, and boxplots are good for displaying the distribution of continuously measured variables. A histogram is similar to a bar chart in structure, but it is used to show the distribution of data values. Stem and leaf plots show the same information as a histogram, but they show the actual data. A boxplot is probably the chart with the most information, as it shows information on potential outliers, the center values, and the 25th and 75th percentiles.

However, all of these charts can be easily manipulated to produce false information if they are not carefully designed. Therefore, we should think carefully about which type of graphs or charts will best fit the data, and be certain to include enough information on each component of the graph so that readers can accurately interpret the display.

What are the purposes of constructing a graph or table to display information about a variable?

Levels of measurement are an important factor in determining which chart to use. Explain why this is the case.

Refer to the graph that follows for questions 3 to 5.

Does this chart seem to be appropriate for this data? Why or why not? Explain your answer.

Is the title appropriately worded?

Is there any component of this chart that you think is not complete or is confusing? Explain.

A histogram shows the frequency of exercising twice a week.

True or false: A histogram is useful when an investigator is trying to display information about a categorical variable.

True or false: Bar charts and pie charts can convey similar types of information.

True or false: There is no chart that allows an investigator to identify possible outliers.

Which of the following is a good example of data that can be appropriately displayed with a line chart?

Ethnicity

Systolic blood pressure

Income

Age

Which of these charts allows an investigator to examine a possible relationship between two continuous variables?

Histogram

Bar chart

Scatterplot

Line chart

Healthy People 2020

. (2019). Hearing and other sensory or communication disorders. Washington, DC: U.S. Department of Health and Human Services, Office of Disease Prevention and Health Promotion. Retrieved from https://www.healthypeople.gov/2020/topics-objectives/topic/hearing-and-other-sensory-or-communication-disorders/objectives

Rothwell, C. (2018). Progress review webinar: Hearing and other sensory or communication disorders and vision presentation. Washington, DC: U.S. Department of Health and Human Services, Office of Disease Prevention and Health Promotion. Retrieved from https://www.cdc.gov/nchs/data/hpdata2020/hp2020_ENT_VSL_and_V_Progress_Review-Presentation_R.pdf

section name header

Learning Objectives ⬇

Key Terms ⬆ ⬇

Introduction ⬆ ⬇

Grouping Data ⬆ ⬇

Graphs and Charts ⬆ ⬇

Other Graphs and Charts ⬆ ⬇

Presenting the Data in the Best Format ⬆ ⬇

Summary ⬆ ⬇

Critical Thinking Questions ⬆ ⬇

Self-Quiz ⬆ ⬇

Reference ⬆