Research and Development in the Computer and Information Sciences
Total Page:16
File Type:pdf, Size:1020Kb
DOCUMENT RESUME ED 039 891 LI 001 944 AUTHOR Stevens, Mary Elizabeth TITLE Research and Development in the Computer and Information Sciences. Volume 1 r Information Acquisition, Sensing, and Input: A Selective Literature Review. INSTITUTION National Bureau of Standards (DOC), Washington, D.C. Center for Computer Sciences and Technology. REPORT NO NBS-Monogr-113-Vol-1 PUB DATE Mar 70 NOTE 169p. AVAILABLE FROM Superintendent of Documents, U.S. Government Printing Office, Washington, D.C. 20402 (C 13.44:113/Vol.1, $1.50) EDRS PRICE EDRS Price MF-$0.75 HC Not Available from EDRS. DESCRIPTORS Automation, *Computer Science, Information Networks, *Information Processing, *Information Science, *Literature Reviews, Research, *Telecommunication ABSTRACT The series, of which this is the initial report, is intended to give a selective overview of research and development efforts and requirements in the computer and informationsciences. The operations of information acquisition, sensing, andinput to information processing systems ate considered in generalized terms. Specific topics include but are not limited to:(1) source data automation and remote sensing techniques,(2) communication systems and data transmission links, (3) audio and graphic inputs, (4) preprocessing operations upon input items,(5) character recognition, (6) speech recognition and (7) various other aspects of automatic pattern recognition. Supplemental notes and abibliography of over 640 cited references are included. (Author/NH) UNITED STATES DEPARTMENT OF COMMERCE Maurice H. Stans, Secretary NATIONAL BUREAU OF STANDARDS Lewis M. Branscomb, Director CO tr1 cD Research and Development in the um' Computer and Information Sciences 1. Information Acquisition, Sensing, and Input: A Selective Literature Review Mary Elizabeth Stevens Center for Computer Sciences and Technology National Bureau of Standards Washington, D.C. 20234 "PERMISSION TO REPRODUCE THIS COPY- RIGHTED MATERIAL BY MICROFICHE ONLY HAS BEEN GRANTED BY U.S. DEPARTMENT OF HEALTH.EDUCATION & WELFARE TO ERIC AND ORGANIZATIO OP TING OFFICE OF EDUCATION UNDER AGREEMENTS WITH E U.S. OFFICE THIS DOCUMENT HAS BEEN REPRODUCED OF EDUCATION. FURTHER REPRODUCTION EXACTLY AS RECEIVED FROM THE PERSONOR OUTSIDE THE ERIC SYSTEM REQUIRES PER- ORGAN:ZATION ORIGINATING IT. POINTSOF MISSION OF THE COPYRIGHT OWNER" VIEW OR OPINIONS STATED DO NOT NECES- SARILY REPRESENT OFFICIAL OFFICE OFEDU- CATION POSITION OR POLICY. National Bureau of Standards Monograph 113, Vol. 1 Nat. Bur. Stand. (U.S.), Monogr. 113-1, 170 pages (Mar. 1970) CODEN: NBSMA Issued March 1970 For sale by the Syperintendent of Documents, U.S. Government Printing Office Washington, D.C. 20402 (Order by SD Catalog No. C 13.44:113/Vol. 1), Price $1.50 o .4!luAki .94 Ira Foreword The Center for Computer Sciences and Technology of the National Bureau of Standards has responsibility under the authority of Public Law 89-306 (the Brooks Bill) for automatic data processing standards development, for consultation and technical assistance to Federal agencies, and for supporting research in matters relating to the use of computers in the Federal Government. This selective literature review is one of a series intended to improve interchange of information among those engaged in research and development in the fields of the computer andinformation sciences. Considered in this volume are the specific areas of information acquisition, sensing, and input, including the problems of character and pattern recognition. Names and descriptions of specific proprietary devices and equipment have been included for the convenience of the reader, but completeness in this respect is recognized to be impossible. Certain important developments have remained proprietary or have not been reported in the open literature; thus major contributors to key developments in the field may have been omitted. The omission of any method or device does not necessarily imply that it is considered unsuitable or unsatisfactory, nor does inclusion of descriptive material on commercially available instruments, products, programs, or processes constitute endorsement. LEWIS M. BRANSCOMB, Director HI Contents Page Foreword III 1. Introduction 1 1.1. General introductory considerations 1 1.2. Basic background considerations 2 1.3. A generalized information processing system 3 1.3.1. Automatic production control 3 1.3.2. Character recognition systems 3 1.3.3. An experimental ISSR system in the U.S.S.R 5 1.3.4. Implications of the generalized scheme 7 1.4. Specific background considerations 8 1.4.1. Plan of attack 8 1.4.2. Some general caveats 9 1.5. Scope and coverage of this report 10 2. Information acquisition, source data automation, and remote- data collection 10' 2.1. Basic operations and requirements 10 2.2. Source data automation 12 2.3. Measurements automation and feedback control 13 2.4. Remote sensing and data collection techniques.. 14 3. Communication systems and data transmission links............. 16 3.1. Communication links, services, and systems 16 3.1.1. Voice and digital data transmission 16 3.1.2. Graphic and facsimile transmission 17 3.2. Computer systems and communication networks 18 3.3. Store and forward techniques and delayed response.... 19 3.4. Technical problem areas 19 4. Audio and graphic inputs 21 4.1. Audio signal inputs 21 4.2. Graphic inputs 22 4.2.1. Alphanumeric inputs 22 4.2.2. Special symbol and diagramatic inputs. 23 4.2.3. Graphic inputs and on-line drafting 24 4.2.4. Pictorial and three-dimensional data input. 26 5. Preprocessing operations and pattern recognition 27. 5.1. Preprocessing operations: Image and information enhancement 27 5.2. Property filtering and feature extraction 29 5.3. Optical character recognition 31 5.3.1. Multifont character reading 31 5.3.2. Performance requirements and quality control 33 5.4. Recognition of handprinted characters, handwriting, geometric shapes, and line draw- ings 33 5.4.1. Constrained handprinting 34 5.4.2. Handwritten characters and handwriting. 35 5.4.3. Shapes and drawings 38 5.5. Speech and speaker recognition 38 5.6. Other pattern identification and recognition areas 42 5.7. Some theoretical approaches to pattern recognition and categorization 43 6. Conclusions 50 Appendix A. Background notes on research and development requirements in information acquisition, sensing, and input 53 Appendix B. Bibliography 151 /U/V List of Figures Page Figure1. A generalized information processing system... 4 Figure2. A generalized character recognition system 6 Figure3. Areas of continuing concern, information acquisition 11 Figure4. Areas of concern; information sensing, input, and preprocessing 21 Figure5. Model typescript page at expanded scale 25 Figure6. Two illustrative display formulas 25 Figure7. Image enhancement for noise reduction 28 Figure8. FOSCRIPT decoding criteria 34 Figure9. Patterns for different speakers 42 Figure 10. Fingerprint pattern showing bifurcation_.. an_d_ridge ending 44 Figure 11. Material read by Soviet print reader 56 Figure 12. Automatic preparation of smooth sheet 64 Figure 13. Data processing shared by two computers 80 Figure 14. Dickinson hand-print embellishments 113 Figure 15. FOSDIC alpha-numeric marking document 114 Figure 16. Handwritten numerals, vector crossing technique 116 Figure 17. OpScan document 119 Figure 18. PAPA machine assembly 144 . VI. Information Acquisition, Sensing, and Input: A Selective Literature Review Mary Elizabeth Stevens Center for Computer Sciences and Technology National Bureau of Standards, Washington, D.C. 20234 This report, the first of a projected series on research and development efforts and requirements in the computer and information sciences, is concerned with a selective literature review involving the operations of information acquisition, sensing, and input to information processing systems considered in generalized terms. Specific topics include but are not limited to:source data automation and remote sensing techniques, communication systems and data transmission links, audio and graphic inputs, preprocessing operations upon input items such as image enhancement and property filtering, character recognition, speech recognition, and various other aspects of automatic pattern recognition. Supple- mental notes and a bibliography of over 640 cited references are included. Key words: Audio inputs; automatic pattern recognition; character recognition; communication systems; data transmission; graphic inputs; image enhancement; remote sensing; source data automation; speech recognition. 1. Introduction This report is the first of a series intendedto the Budget Circular A-71 and Public Law 89-306 providea selectiveoverview of research and (the Brooks Bill of 1965), it is not unreasonable to development efforts and requirements in thesome- try to provide a framework for program review and what overlapping fields of the computer and infor- to pinpoint, so far as is practical, specific needs mation sciences and technologies. The projected and difficulties to be noted in selective reviews of series of reports will attempt to outline the probable the literature and other appraisals of the state of range of R & D activities in the computer and in- the art. Such clues may often indicate the desira- formation sciences and technologies through selec- bility of further research and development efforts tive reviews of the literature and to developa whether Federally conducted, Federally sponsored, reasonable consensus with respect to the opinions or independently initiated. of workers