A Abjad, 430, 891 Abugida, 430, 891 Academic Systems, 429, 448

Index A Anchor points, 717 Abjad, 430, 891 Angular radial transform (ART) descriptors, Abugida, 430, 891 526, 530, 532 Academic systems, 429, 448 Animated, 625, 630, 633, 636, 637 Accidental forgery, 932 Anisotropic Gaussian, 269–271 Accuracy, 44, 50, 51, 54, 1023, 1024, ANN. See Artificial neural networks (ANN) 1027, 1033 Annotation, 630, 639, 966–968, 976, 983–1002 Acid-free paper, 15 Application domains, 182, 198 Acquisition, 11–60, 984–986, 989–991, Approximate NN search, 170 993, 996 APTI, 450 Action plan, 919, 921, 926, 929, 930 Arabic AdaBoost, 862, 865, 866 alphabet, 430, 434–436, 449 Adaptive local connectivity map, 476 extension, 435 Address block(s), 715, 717, 720, 721, 723, 724 letters, 432, 435, 436 Address block location, 720, 723–724, 727 OCR, 450 Address database, 715, 720, 729, 730, 744 writing styles, 437–438 Address interpretation, 723 writing system, 432 Address recognition systems, 709, 710, Arabic and Persian signatures, 930 720–723, 732, 733, 744 Arabic and Syriac are cursive, 428 Adjacency Arabic recognition approaches, 446 grammars, 528, 537 Aramaic letters, 433 matrices, 541, 544 Arc detection, 505, 511, 512 Affine covariant, 629 Architectural drawings, 493, 495, 511–513 AHARONI, 440 Area under the curve (AUC), 931 Algebraic invariants, 617 Area Voronoi diagram, 146, 147, 156, 157, Allograph, 303, 304 161, 163 Alphabet(s), 7–9, 303, 304, 306, 307, 312, Arial, 323 891, 892, 896, 897, 902, 904, 906, Arrowheads, 493, 512, 513 908, 913 Artificial intelligence, 332 Alphabetic class, 429 Artificial neural networks (ANN), 617, 632, Alphabetic fields, 719 636, 821, 835 Ambiguity, 680 Aruspix, 753, 768, 771 Analysis of Invoices, 184 Aryan language, 304 Analysis System to Interpret Areas in Ascenders and descenders, 325, 442–444, 810, Single-sided Letters (ANASTASIL), 812, 814 181, 182, 207, 208 ASCII characters, 817 Analytical word recognition, 725 Asian scripts, 460–462, 471, 475, 483 D. Doermann, K. Tombre (eds.), Handbook of Document Image 1037 Processing and Recognition, DOI 10.1007/978-0-85729-859-1, © Springer-Verlag London 2014 1038 Index Aspect, 279, 284 Bidirectional long short-term memory Assamese, 301, 304 (BLSTM), 822, 823 Assessment Bi-gram probability, 410, 411, 415 function, 1029–1032 Bi-lingual script, 314, 316 methods, 1031 Billboards, 629, 640 Associative graphs, 527, 536–538 Binarization, 44, 49, 54, 189, 262, 275, 335, Assyrian script, 439 336, 338–339, 355, 466, 475–477, Attributed grammars, 695 716, 814, 984, 986, 989, 990, Attributed relational graph (ARG), 904 992–994, 1018–1022, 1024, 1033 Attributive symbols, 750 algorithms, 716 Automatic document processing, 614 of handprinted text, 367 Automatic letter sorting machines, 709 skew correction, and noise, 755 Automatic number plate recognition Binary (ANPR), 846 classifier, 738 images, 716 mask, 633 B Bipartite graph, 540 Background analysis method, 144–146, 148, Black-and-white document, 144, 147 149, 158, 164 Bleed-through, 49, 95, 97, 99 Backtracking, 148 Blind attacker, 932 Bag-of-features, 871, 873, 874 Blob noise, 621 Bag-of-words, 618, 620, 627, 629 Block(s), 752 Bangla, 301, 305, 306, 308, 309, 315, 316, Block adjacency graphs, 479 321, 326, 461, 465 Block overlay phenomena, 778 Bank check(s), 363 Blur, 46, 48, 53, 54, 57, 59, 573, 844, 853, Bank check recognition, 67 856, 857 Banners, 640 Blurring, 623, 850, 854, 856–857 Bar codes, 708, 709, 723 Boldface, 322, 325 Bar lines, 751, 761, 767, 769 BongNet, 902, 903, 910 Bar units, 752 Book Baselines, 260, 263, 265, 266, 275–276, binding, 52, 57–59 287, 684, 686–688, 691, 692, 696, scanners, 846 810–815 Boosting methods, 540, 542, 543 Baselines extraction, 442, 686, 687, 691, 692 Borda count, 607 Base-region points, 311 Border removal, 102, 103 Baum-Welch algorithm, 350 Born digital documents, 687, 690 Bayesian combination rules, 739 Bottom-up strategies, 137, 144, 148, 149, 155, Bayesian network, 902, 903 157, 565, 566, 571, 722, 723, 928, BBN, 335, 354, 355 940, 941 Behavioural characteristics, 918 Bounding box, 367, 377, 382 Belga Logos, 632, 640 projections, 761, 764 Bengali Boxing systems, 723 alphabet, 304 Brain stroke risk factors, 941 script, 303, 304 Branch-and-bound, 628, 632 Bezier curve, 893 Brightness, 35, 37, 40, 43, 44 Bibliographic Broadcasting industry, 623 citation, 209 Brodatz textures, 314 metadata, 209 Brute-force attacker, 932 Biblio system, 196, 213 B-splines, 125, 126 Bidimensional Burmese, 314, 318 grammars, 513 Business Letter, 181, 182, 185, 186, 192, 197, patterns, 514 199, 204–210, 216 Index 1039 C Chemical drawing recognition, 973–974 C4.5 algorithm, 616 Chinese, 296, 305, 307–310, 312–316, Calligraphic 318–321, 325 characters, 852 Chinese Academy of Sciences (CASIA), 465 interfaces, 951 Chinese and Japanese signatures, 930 Calliope, 20 Chinese handwriting recognition competition, Camera(s), 44–47, 52, 59 463, 465 Camera-based OCR, 844–850, 860, 861, Chinese, Japanese, and Korean (CJK) 872, 874 ideographic character, 462 Camera-captured, 546 scripts, 460–463, 465–474 Cancellable biometrics, 931 Chrominance, 628 Candidate(s), 274, 280, 281, 284 CIDK. See Class independent domain Candidate staff-line segments, 757 knowledge (CIDK) Canonical structure, 462 City name recognition, 720, 722, Caption texts, 489, 848, 854, 855, 862, 725–727 866, 872 Class dependent domain knowledge Carbon paper, 28, 31 (CDDK), 197, 204 Car license plates, 846 Classes of forgeries, 932 Case-based reasoning (CBR), 197, 211 Classification, 463, 470–473, 478, 479, CBIR. See Content-based image retrieval 482–484, 535, 538–543, 545, (CBIR) 760–764, 766–769 CCH. See Co-occurrence histograms (CCH) Classification units, 478, 479 CDDK. See Class dependent domain Classifiers, 279, 281, 282, 284–286, knowledge (CDDK) 446–449, 451 Census forms, 363 Classifiers combination, 383, 689, 692 Chain Codes, 120–123, 369, 377, 384, 525, Class independent domain knowledge (CIDK), 527, 534 197, 207 Character Client-entropy measure, 930 forms, 363 Cluttered background, 623 recognition, 331–356, 359–385, 427–453, CNN. See Convolutional neural network 459–484, 524, 525, 534, 707, 718, (CNN) 725, 729, 737, 739, 1013, 1025, Coding scheme, 809, 817, 826, 827 1026, 1033 Cognitive psychology, 195 segmentation, 466, 467, 469, 807, 811, Color, 18–20, 28–30, 33, 34, 36–38, 40, 44, 824, 830, 834 46, 47, 50, 598–600, 603, 605–610, Character error rate (CER), 1026 612–615, 617, 623, 629, 631, 635, Character recognition rate (CRR), 1026 848, 849, 851, 853, 862–864, 868 Character Shape Coding, 807, 809–811, clustering, 147, 164, 165, 626, 864, 817–818, 824, 826–828 867, 868 Charged couple devices (CCDs), 39 descriptors, 628 Chart analysis, 998 naming, 604, 611, 638 CHAYIM, 440 printing, 30, 34–36 Check spaces, 625–628, 631 forms, 731, 734–736 Color clustering, dither, 164 models, 735 Combination rules, 934 processing, 705–745 Command parameter, 921, 930 readers, 710 Commercial Check 21, 744 language, 301 Check recognition applications, 708–711, OCRs, 452 714–716, 718, 731, 735, 741, 745 systems, 429, 448, 450 Check recognition system, 707, 708, 712, 731, Committee of experts, 716, 719 733, 743, 744 Comparative study, 759, 762 1040 Index Comparison, 1012–1016, 1018, 1022, 1024, Correlation filter, 545 1029, 1031, 1032 Cost value, 1028 Competitions, 759, 1013, 1022, 1028, Courtesy amount recognition, 710, 711, 719, 1032–1034 731, 733–741 datasets, 448 Creole languages, 295 Complementary metal-oxide silicon (CMOS), Critter, 323 39, 40 Cross-correlation, 100, 102, 114, 118 Complete graphic object recognition, Cross-related attributes, 194, 195 963–964, 976 Cryptographic key generation, 931 Complex background, 844, 851, 861 Cryptography, 303 Complexity of a signature, 929 Cube search, 906 Compression, 46, 48–50 Cultural habits, 930 Compression artifacts, 623 Currency sign, 731, 736, 738 Computational complexity, 714, 727 Cursive script, 890, 892, 896, 902, 906–908 Computer language, 295 Curvature, 317, 534, 535 Concavity features, 343 Curvature Scale Space (CSS), 601 Conditional random field (CRF), 870 Curvilinear, 260, 265, 266, 268 Condorcet method, 607, 609 Cyrillic, 305, 314, 316, 318, 319, 321 Confidence scores, 715, 716, 718 Confidence values, 719, 731, 734, 739–741 Conjunct consonants, 477 D Connected components Damaged characters, 74, 95, 97, 99, 101, 127 analysis, 85, 99, 101, 102, 496, 498 Data acquisition devices, 922 labeling, 74–76, 100, 102 Data-driven approaches, 214 Connected components based method, Data reduction, 893, 895 148, 149 techniques, 928 Connectionist Temporal Classification (CTC), Dataset(s), 448–451, 453, 596, 602, 610, 612, 821, 822 613, 620, 631, 632, 636, 639–641, Consistency checking, 579 983–1002 Consistency model, 924 Data variability, 583 Consonant, 462–464, 477, 478 DAVOS, 182, 209 Constraints, 762, 764, 765 DCT, 863, 868 Content 3-D document shape, 123, 126 ownership, 623 Decision stream, 779, 781 making, 719–720, 733, 736, 741 Content-based image retrieval (CBIR), strategy, 739 544, 545 Decorative characters, 872–874 Content-based video retrieval, 848, 849 Defects, 46, 48, 51, 53, 57 Contest, 1013, 1018, 1032–1033 Deformations, 988 Context/contextual, 680, 681, 683, 685, 691, Degradation, 68, 260, 282, 984, 986, 988, 991, 693–695, 862, 864, 873, 919, 994, 998 924, 931 Degraded document, 812, 817, 824 information, 627, 638, 690, 696, 752, Delaunay triangulation, 146, 149, 761–765, 770 155–158, 213 knowledge, 568, 571, 582 Delayed strokes, 890, 891, 894, 896, Contours 901, 905 extraction, 816 Delta features, 895 tracking, 756–760, 767 Denoising, 953, 954, 975 Contrast, 40, 43, 44, 47–50 Descenders,

A Abjad, 430, 891 Abugida, 430, 891 Academic Systems, 429, 448

Yiddish Diction in Singing

Similarities and Dissimilarities of English and Arabic Alphabets in Phonetic and Phonology: a Comparative Study

The Ogham-Runes and El-Mushajjar

Schwa Deletion: Investigating Improved Approach for Text-To-IPA System for Shiri Guru Granth Sahib

Assessment of Options for Handling Full Unicode Character Encodings in MARC21 a Study for the Library of Congress

Arabic Alphabet - Wikipedia, the Free Encyclopedia Arabic Alphabet from Wikipedia, the Free Encyclopedia

A Comparative Study of Shan and Standard Thai Morphology

Proposal for Ethiopic Script Root Zone LGR

Finite-State Script Normalization and Processing Utilities: the Nisaba Brahmic Library

The Gentics of Civilization: an Empirical Classification of Civilizations Based on Writing Systems

ISO/IEC JTC1/SC2/WG2 N 2029 Date: 1999-05-29

1 Working Draft for Supporting Blueberry Revision of XML