Text Complexity by Dr

Total Page:16

File Type:pdf, Size:1020Kb

Text Complexity by Dr Author Monographs Text Complexity By Dr. Timothy Shanahan University of Illinois at Chicago Member, National Reading Panel; President, International Reading Association, 2006 Chair, National Literacy Panel and National Early Literacy Panel It would be too costly and All texts are not equal. Some texts inconvenient to perform the above- are harder to read and comprehend. explained process for every text, of Just as there are individual differences course, so a second fundamental in children, there are individual premise of readability measurement differences in the texts that we ask is that the measure must specifically children to read. Some children summarize only text features or read better than others, and there characteristics, and not how easily are a variety of reasons for these read the text has been in the past. On differences in reading abilities. The the basis of this enumeration of text same is true of texts. Some texts are features, the formula has successfully easier or harder, and there are several predicted reader comprehension for reasons for these differences. texts. The purpose of this essay is to describe what factors Readability measures have evolved since they first cause texts to differ and to explore the relationship of appeared. Now, most readability formulas only measure text difficulty and children’s learning. two factors: word complexity and sentence complexity (Klare, 1984). Thus, measuring the readability or What makes text complex? complexity of a text involves an evaluation of the words: counting the average number of letters or syllables, Since the 1920s, there has been interest in measuring checking the frequency of the words (common words text difficulty or readability (Lively & Pressey, 1923). The are usually easier than rare words), or considering basic purpose of these measures has been to try to how abstract or concrete the words may be. Similarly, place texts along a continuum or scale, from easiest to average sentence or clause lengths, as well as other hardest. The most straightforward way to do this would measures of sentence or grammatical complexity, are be to have a group of people read the various texts and included. These word and sentence measures are then complete comprehension tests based on each one. The combined into mathematical formulas that are used to texts that most people understood would be rated as predict actual reading comprehension. If a formula does easy, and the ones that fewer people comprehended or well with some set of texts, then it is assumed that it that had fewer questions answered correctly would be will do equally well with other texts that have not been rated as more difficult. Readability estimates, however, actually tested with readers. are not quite this direct. Text Complexity Readability measures often try to express student beyond sentence complexity and vocabulary performance in terms of grade levels. (Measures that do sophistication—that are likely to influence reading not use grades usually provide some kind of conversion comprehension? One such factor is the complexity chart to show how to translate the readability score to of the ideas themselves. Some topics (rocket science, a grade level.) The idea is that the readability formula brain surgery, the infield fly rule), are more complicated, is revealing the reading level at which readers would sophisticated, or subtle than others. Also, ideas connect be able to understand the text. Thus, a fifth-grade text with each other across a text through a feature called would be readable or comprehensible to the average coherence. Readers have to interpret pronouns, fifth-grade reader. synonyms, and other connectors Early on, these formulas improved to make sense of a text. The greater our ability to predict text difficulty, Readability only the distance across ideas, the more but not by very much (they were measures the varied the connections, and the able to anticipate about 35 percent variations in the text; more competition for a link (if the pronoun “he” is mentioned, how of the variation in actual reading reader variations will comprehension performance). Over many male characters could be the years, readability measures present variability the topic of discussion?), the harder improved, and most of them are now that the readability the text will be. Text structure or able to explain about 50 to 60 percent measure cannot organization matters, too. Ideas can be arranged across text in many ways of the comprehension variance. One capture specifically. recent approach, the implementation (e.g., time sequence, causation, lists, of Lexiles, is able to explain about 90 comparison). How straightforward percent of reading comprehension the structure is will help to determine differences (Smith, 2012). how complicated the text will be to readers. Some texts may be more-or-less complete, including sufficient Why aren’t these measures able to do better than that? background information and context to help readers Why don’t they predict 100 percent of the variation to interpret the ideas; other, more difficult texts in reading performance? There are two basic reasons may require that readers bring a lot of this kind of for this. The first has to do with the nature of the information to the text. Finally, texts may use various measures of themselves. Remember, only word and literary devices (e.g., irony, repetition, metaphor) or data sentence difficulty are being considered. There are presentation tools (e.g., tables and charts), adding to the other text features that may impact complexity, but interpretive burden. these are not accounted for directly in these formulas. These measures do well because other unmeasured Another reason for the imperfect measurement of text features tend to be correlated with the word readability formulas has to do with what we are trying and sentence challenges which are measured in the to predict: reading comprehension. Readers differ formulas. Thus, texts that have simple words and in what they bring to a text in terms of background sentences will usually be about simpler concepts and knowledge, motivation, abilities to cope with ambiguity, will use simpler or more straightforward organizations. and so on. Even if two students have the same It is possible, however, that some of these other features “reading level,” they may perform differently with a will make a text either easier or harder for readers to particular text, which would reduce the accuracy of understand than predicted. Accuracy is important, but the text-level prediction. Reading comprehension will so is efficiency (adding other variables that are harder always be the product of two factors: the text and the to measure only improves accuracy a small bit, and reader. Readability only measures the variations in the these other features are much harder and more time- text; reader variations will present variability that the consuming to measure). readability measure cannot capture specifically. What are some of the other text complexity factors— Text Complexity Relationship of Text Complexity and reveal the student/text match that leads to the greatest Learning amount of learning. Readability formulas examine text complexity features Unfortunately, there have been few such studies, to predict reading comprehension. What connection and instructional practice has usually relied upon does text complexity have to learning to read? Scholars assertion rather than empirical evidence. The most have long claimed that the match of text difficulty and a influential claims have indicated that students needed reader’s ability to read will be an important determinant to be taught from texts they could read with 95 to of learning (Betts, 1946). 98 percent oral reading accuracy and with a reading comprehension of 75 to 89 percent (Betts, 1946). More The original idea of this reader/text match concept recent claims for optimum text complexity levels is that it is important to place students in texts that are based on minor adjustments to Betts’ original are not too easy (if the students can read a text easily assertions, even though we now know that Betts without help from a teacher or someone else, then made his numbers up without evaluating them against there would be little opportunity to learn from that student learning (Shanahan, 1983). text) or not too hard (if the students struggle too much to make sense of a text, there would be a lot to learn, The few studies that do exist (Morgan, Wilcox, & but they might be overwhelmed or frustrated, or the Eldredge, 2000) suggest that students do better—that teacher might find it difficult to provide sufficient is, learn more—with texts that are somewhat harder support to promote learning). than Betts claimed, and there is clear evidence that From this basic conception school texts have been getting easier and easier over a period came the idea that there are The most straightforward of about seventy years for independent, instructional, and approach to the problem frustration levels of performance. grades 3 to 12 (Chall, Conrad, Independent level refers to would be to test children in & Harris 1977; Hayes, Wolfer, texts that students can read terms of their ability to read & Wolfe, 1996). Past efforts to successfully without help; texts at particular readability place students in texts have striven to protect students from frustration level refers to those levels and then to randomly texts that are so hard for the confronting too much difficulty, students to comprehend that assign those children to and yet, such protections also it would be difficult for them reading instruction that place an important limitation on to learn; and instructional employed texts written at opportunities to learn. “Just as it’s impossible to build muscle level would be the optimum different levels. level that would provide an without weight or resistance, opportunity for learning, but it’s impossible to build robust without being too hard or too frustrating.
Recommended publications
  • The Lexile Framework: Unnecessary and Potentially Harmful
    The Lexile Framework: Unnecessary and Potentially Harmful Stephen Krashen CSLA (California School Library Association) Journal 24(2): 25-26, 2001 The Lexile Framework attempts to solve a problem that doesn't exist. It is a readability formula that "stands firmly in the tradition of classic readability formulas" (Stenner, 1996, p. 23) that assigns reading levels to texts based on word frequency and sentence length. The Lexile Framework is intended to help teachers and librarians recommend supplementary reading that is at the right reading level: "For example, an eighth-grade girl who is interested in sports but is not reading at grade level might be able to handle a biography of a famous athlete. The teacher may not know, however, whether that biography is too difficult or too easy for the student. " All the teacher has to do is use the Lexile Framework on the text and the student and select a book at the right level. Then, "as the reader improves, new titles with higher text measures can be chosen to match the growing person (sic) measure, thus keeping the comprehension rate at the chosen level." (Stenner, 1996, p. 22). Not Necessary None of this is necessary, and it is probably harmful. There is a much easier way for readers to select texts: Are they comprehensible and interesting? It doesn't take long for a reader to determine this: All it takes is sampling a little of the text (reading it). Our eighth grader simply needs to have a look at a few biographies. Teachers and librarians can certainly help in text selection and they do this all the time, with great success.
    [Show full text]
  • How to Check a Readability Score with Microsoft Word
    How to Check a Readability Score with Microsoft Word Step 1: Open your Informed Consent document in Word. Step 2: Look at the top tool bar and click “Review.” It should the ninth option at the top in Word 365. Step 3: Click “Check Document” or “Spelling Check.” It should be on the right at the very top with the image of a green check mark and the letters ‘abc’. Step 4: A small window should pop up that reads “Readability Statistics,” like the one to the right. *If this window does not automatically pop up for you, try these steps: (1) Go to “File,” then “Options.” (2) Select “Proofing.” (3) Under “When correcting spelling and grammar in Word,” make sure the “Check grammar with spelling” check-box is selected. (4) Select “Show readability statistics.” (5) Try starting over with Step 2. If you still can’t get the readability statistics to show, please click the help button in the top tool bar for step-by-step instructions on how to enable this feature. Step 5: “Flesh-Kincaid Grade Level” in this window is the number the IRB is concerned with regarding readability for the population you are studying. Guidelines - Any population that requires an advanced degree (2-year degree or more) must have a reading level of 12.0 or below. - Children or other special populations (the elderly, prison inmates, etc.) must have a reading level of 3.0 or below. - The general public or any other population that does not fit into the other two categories must have a reading level of 8.0 or below.
    [Show full text]
  • Smog Readability Formula
    The Smog Readability Formula Adapted from McLaughlin, G. (1969). SMOG grading: A new readability formula. Journal of Reading, 12 (8). 639-646. The SMOG conversion tables were developed by Harold C. McGraw, Office of Educational Research, Baltimore Co. Public Schools, Towson, MD. The SMOG Readability Formula is a simple method you can use to determine the reading level of your written materials. If a person reads at or above a grade level, they will understand 90-100% of the information. Generally, you need to aim for a reading level of sixth grade or less. In addition, to ensure that the text is clear and readable, read your draft aloud. How to use the SMOG formula: 1. Count 10 sentences in a row near the beginning of your material. Count 10 sentences in the middle. Count 10 sentences near the end. (30 total sentences) 2. Count every word with three or more syllables in each group of sentences, even if the same word appears more than once. 3. Add the total number of words counted. Use the SMOG Conversion Table I to find the grade level. If your material has fewer than 30 sentences, follow the instructions for "SMOG on Shorter Passages" and use SMOG Conversion Table II. Word Counting Rules: • A sentence is any group of words ending with a period, exclamation point, or question mark. • Words with hyphens count-as-one-word. • Proper nouns are counted. • Read numbers out loud to decide the number of syllables. • In long sentences with colons or semicolons followed by a list, count each part of the list with the beginning phrase of the sentence as an individual sentence.
    [Show full text]
  • Generating and Rendering Readability Scores for Project Gutenberg Texts
    Generating and Rendering Readability Scores for Project Gutenberg Texts Ronald P. Reck RRecktek LLC. [email protected] and Ruth A. Reck University of California Davis, CA 95616 U.S.A. [email protected] Abstract Here the frequency distribution functions have been calculated for seven different types of readability measurements for over fourteen thousand texts from Project Gutenberg1 (PG). Other supporting measurements were also obtained: the average characters per word, the words per sentence, and the syllables per word. Three types of distributions have been demonstrated from the analysis of the metadata. While there are similarities among some of the scores, there is considerable interpretation yet to be made. The most complex and unique distribution function is found for the Flesch Reading Ease scores. Because of the computing intensity necessary to obtain these distributions it is only in the present age of information science that such a broad brush of characterization of a billion word data source can be made. It is essential that these be sorted by language to better interpret the meaning of the distributions. 1. Introduction Various readability measurements can serve as indicators to quantify the relative accessibility of written information. However, domain specific attributes such as complex terminology or language can direct readability scores towards higher values than the actual complexity of the text warrants. For instance, scientific writing is likely to contain long words that may not significantly increase the complexity of the writing to those familiar with the terms but make the readability value appear greater. Despite this and other limitations, readability measurements remain useful attributes for describing text, especially when the values are regarded as relative measurements from within a specific type of writing or language.
    [Show full text]
  • SUFE and the Internet: Are Healthcare Information Websites Accessible to Parents?
    Open access Original research bmjpo: first published as 10.1136/bmjpo-2020-000782 on 20 October 2020. Downloaded from SUFE and the internet: are healthcare information websites accessible to parents? Andrea Mc Carthy , Colm Taylor To cite: Mc Carthy A, Taylor C. ABSTRACT What is known about the subject? SUFE and the internet: are Background Slipped upper femoral epiphysis is healthcare information an adolescent hip disorder requiring rapid surgical ► Slipped upper femoral epiphysis (SUFE) is a complex websites accessible to intervention. Faced with the prospect of their child parents? BMJ Paediatrics Open adolescent hip disorder. Caregivers may turn to the undergoing surgery, many fearful parents will turn to the 2020;4:e000782. doi:10.1136/ internet for information but find healthcare websites internet to provide information and reassurance. Previous bmjpo-2020-000782 offer conflicting advice, resulting in increased anx- studies have shown the orthopaedic information can be iety and reduced postoperative rehab compliance. ► Additional material is difficult to comprehend. published online only. To view Objective Assess the readability of healthcare websites please visit the journal online regarding slipped upper femoral epiphysis. (http:// dx. doi. org/ 10. 1136/ Methods The term Slipped Upper Femoral Epiphysis was What this study adds? bmjpo- 2020- 000782). searched in Google, Bing and Yahoo. The websites were evaluated using readability software with seven specialised ► The study shows that for websites regarding SUFE, the average Flesich Read Ease Score score is 52 Received 30 June 2020 readability tests including the Flesch- Kincaid Reading and the average reading grade level is 8.67. Both of Revised 28 August 2020 Grade Level, the Flesch Reading Ease Score, the Simple Accepted 6 September 2020 Measure of Gobbledygook, Coleman- Liau Index, Automated these figures do not comply with acceptable stan- Readability Index and the Gunning Fog Index.
    [Show full text]
  • Readability Level Reading Formulas
    Tool 13. Assessing Readability Level Reading Formulas Readability measures are primarily based on factors such as the complexity of the printed materials by measuring the number of words in the sentence and the number of letters or syllables per word (i.e., as a reflection of word frequency.) A “score” reflects the grade level of the printed material. • The Flesch Reading Ease The Flesch Reading Ease is one of the oldest and most accurate formulas. It is a simple approach to assess the grade-level of the reader. • The Fry Graph Readability Formula Edward Fry developed readability tests based on graph. This graph-based test determined readability through high school. In 1977, Fry extended the graph to test through the college years. • The SMOG Readability Formula This is a simple method used to determine the reading level of written materials. The SMOG Readability Formula was developed by Harold C. McGraw. These formulas can be helpful but should not be your only evaluation tool because reading level is only one aspect of readability, and readability formulas are not always accurate with forms that have short sentences or phrases. The Flesch Reading Ease Readability Formula The specific mathematical formula is: RE = 206.835 – (1.015 x ASL) – (84.6 x ASW) RE = Readability Ease ASL = Average Sentence Length (i.e., the number of words divided by the number of sentences) ASW = Average number of syllables per word (i.e., the number of syllables divided by the number of words.) The output, i.e., RE is a number ranging from 0 to 100.
    [Show full text]
  • A Text Readability Continuum for Postsecondary Readiness
    Volume 19 ✤ Number 4 ✤ Summer 2008 ✤ pp. 602–632 A Text Readability Continuum for Postsecondary Readiness Gary L. Williamson North Carolina Department of Public Instruction Over the last 3 decades, a number of authors claimed that too many students were unprepared for the postsecondary world (e.g., Bock & Moore, 1986; Greene & Forster, 2003; Sum, 1999; oU.S. Department of Labor, 1991). To support this contention, some reports focused on student behaviors and attainments, such as high school dropout rates and/or graduation rates (e.g., Greene & Forster, 2003; Horn, 1998). Some focused on anal- ogous later events, such as the rates at which college students require remediation or fail to complete their programs of study (Parsad & Lewis, 2003; Rosenbaum, 2004). Others focused on the skill demands of the workplace in contrast to what stu- dents have been taught in school (Parker, 2004; Pennsylvania Department of Education, 2004). Still others focused on the literacy demands of citizenship (Kirsch, Jungeblut, Jenkins, & Kolstad, 1993). Traditional indicators of college readiness have included college transcripts, student course completions, and student performance (McCormick, 1999; Sanderson, Dugoni, Rasinski, Taylor, & Carroll, 1996). However, no one has examined the text demands of high school in contrast to those of college, the workplace, the mili- 602 Copyright © 2008 Prufrock Press, P.O. Box 8813, Waco, TX 76714 Although high school graduates may be able to successfully navigate high school course content, they may not be prepared to handle the readability of texts they encounter in various postsecondary endeavors. The average readability of high school texts is lower than the average readability of citizenship, workplace, community college, university, and graduate admissions text collections.
    [Show full text]
  • ARTICLES Readability, Logodiversity, and the Effectiveness of College
    ARTICLES Readability, Logodiversity, and the Effectiveness of College Science Textbooks Rebecca S. Burton Department of Biology, Alverno College, PO Box 343922, Milwaukee, WI 53234-3922 [email protected] Abstract: Textbooks are required in most introductory college science courses, but students may not be benefitting from the textbooks as much as their instructors might hope. Word use in the textbooks may influence textbook effectiveness. I tested whether either the amount of technical vocabulary or the readability had a significant effect on students’ ability to learn general biology concepts. I provided different versions of the same reading, then tested students on the content. On the topic with the lowest overall post-reading quiz scores, students who received readings with less technical vocabulary outperformed their peers (P = 0.03). Textbooks did not appear to be an important source of learning for students in this study; fewer than half the students reported that they were reading the assigned chapters near the start of the semester, and this number declined sharply. Students had difficulty correctly answering questions immediately after reading brief selections, indicating a low level of comprehension. Changes in textbooks and teaching strategies may improve student learning and reading compliance. Key words: readability, logodiversity, textbooks INTRODUCTION these grade level scores are based on length of Unless students can learn from their textbooks, sentences and length of words. These indices are there is little purpose in requiring them. generally intended for use in evaluating materials for Overwhelmingly, introductory college science K-12 schools or for the general public. Making courses use textbooks, but questions remain about sentences shorter and simpler will result in a lower what aspects of the actual prose result in greater grade level score, but may not make college biology student learning of scientific concepts.
    [Show full text]
  • Simply Put: a Guide for Creating Easy-To-Understand Materials
    Simply Put A guide for creating easy-to-understand materials What’s in this guide? Communication that is Clear and Understandable………………................................3 Where this Guide Fits into an Overall Communication Plan…………………….......4 Make Your Message Clear ........................................... ..............................................5 Text Appearance Matters ……….............. .................................................................9 Visuals Help Tell Your Story ....................................................................................10 Layout and Design………………………………….................................................17 Consider Culture…………………………...………………………………….…….23 Translations Take Your Message Further ..................................................................25 Testing for Readability ...............................................................................................27 Appendix A - Checklist for Easy-to-Understand Print Materials...............................29 Appendix B - Resources for Communication Planning .............................................30 Appendix C - Formulas for Calculating Readability .................................................31 Appendix D - Resources ............................................................................................38 Acknowledgements………………………………………………………………….43 Strategic and Proactive Communication Branch Division of Communication Services Office of the Associate Director for Communication Centers for Disease Control
    [Show full text]
  • The Principles of Readability
    The Principles of Readability By William H. DuBay Impact Information, 126 E. 18th Street, #C204, Costa Mesa, CA 92627, (949) 631-3309 Copyright The Principles of Readability 25 August 2004 2004 William H. DuBay. All Rights Reserved. Abstract The Principles of Readability gives a short history of literacy studies in the U.S. and a short history of research in readability and the readability formulas. Readers' Comments Please send all comments and suggestions regarding this document to: William DuBay Impact Information 126 E. 18th Street, #C204 Costa Mesa, CA 92627 Phone: (949) 631-3309 Email: [email protected] Website: http://www.impact-information.com Copyright © 2004 William H. DuBay Page ii Contents Introduction ..........................................................................................................1 Guidelines For Readability..........................................................................2 The readability formulas..............................................................................2 Are the readability formulas a problem? .....................................................2 What is readability?.....................................................................................3 Content ........................................................................................................3 The Adult Literacy Studies...................................................................................4 Grading the reading skills of students..........................................................4
    [Show full text]
  • Smart Language: Readers, Readability, and the Grading of Text 25 January 2007 © 2007 William H
    Smart Language Readers, Readability, and the Grading of Text William H. DuBay Impact Information Costa Mesa, California Copyright Smart Language: Readers, Readability, and the Grading of Text 25 January 2007 © 2007 William H. DuBay. All Rights Reserved. Abstract Smart Language gives a brief introduction to the adult literacy sur‐ veys and the research on readability and the readability formulas. Readersʹ Comments Please send all comments and suggestions regarding this document to: William DuBay Email: bdubay@impact‐information.com Website: http://www.impact‐information.com ISBN: 1‐4196‐5439‐X To George R. Klare 1922—2006 Teacher, writer, scientist, and friend Contents Introduction...............................................................................................................1 Writing for the Right Audience................................................................1 Writing for the Wrong Audience .............................................................2 What Is a Reading Grade Level? ..............................................................4 What is Readability? ..................................................................................4 The Readability Formulas .........................................................................6 How This Book is Organized....................................................................7 Part 1 How People Read ........................................................................................10 Chapter 1 The Adult Literacy Surveys .........................................................12
    [Show full text]
  • And Others TITLE the Dolch Basic Sight Vocabulary Investigation: a Replication and Validation
    DOCUMENT RESUME ED 094 329 CS 001 219 AUTHOR Johns, Jerry L.; And Others TITLE The Dolch Basic Sight Vocabulary Investigation: A Replication and Validation. PUB DATE 74 NOTE 30p.; Research study done at the Reading Clinic, Northern Illinois University EDRS PRICE MF-$0.75 HC-$1.85 PLUS POSTAGE DESCRIPTORS Basic Reading; *Beginning Reading; Readability; Reading; Reading Materials; *Reading Research; Reading Skills; *Sight Vocabulary; *Word Lists IDENTIFIERS *Dolch Word List ABSTRACT This study replicated and validated the Dolch basic sight vocabulary investigation. Dolch's method of compiling his list of 95 nouns was also replicated. Some discrepancies were found between the replication study and Dolch's investigations--a few words were left off the lists, although they apparently met all ..he criteria for inclusion, and a lack of consistency was noted in Dolch's choices for including words or his lists. It was concluded that pseudo-empirical is a correct description of Dolch's method in compiling his basic sight vocabulary. It was also determined that this list is still viable since it accounts for over 50 percent of the words currently used in reading materials for both children and adults. (Author) Y U.S DEPARTMENT OF HEALTH, EDUCATION A WELFARE NATIONAL INSTITUTE OF EDUCATION THIS DOCUMENT HAS BFEN REPRO DUCE[' E"ACTLY AS RECEIVED FROM THE PERSON OP ORGANIZATION ORIGIN ATING IT POINTS OF VIEW OR OPINIONS STATED DO NOT NECESSARILY REPRE SENT OFFICIAL NATIONAL INSTITUTE OF EDUCATION POSPION OR POLICY BEST COPY AVAILABLE The Dolch Basic Sight Vocabulary Investigation: A Replication and Validation Jerry L. Johns, Rose M.
    [Show full text]