Proposal to Encode the Kaithi Script in Plane 1 of ISO/IEC 10646

Proposal to Encode the Kaithi Script in Plane 1 of ISO/IEC 10646 Anshuman Pandey University of Michigan Ann Arbor, Michigan, U.S.A. [email protected] May 21, 2007 Contents Proposal Summary Form i 1 Introduction 1 2 Characters Proposed 2 3 Technical Features 8 4 Background 12 5 Orthography 20 6 Regional Variants and Typeface Styles 32 7 Relationship to Other Scripts 33 8 References 42 List of Figures 1 Geo-political extent of the Kaithi script in South Asia. ................... 14 2 Relationship of Kaithi to selected Nagari-based scripts . ................... 34 3 A comparison of the three regional forms of Kaithi . .............. 46 4 A list of Kaithi conjuncts used in the Maithili style of Kaithi ................. 47 5 Currency, weights, and measures signs used in Kaithi . ................ 48 6 Specimen of hand-written Bhojpuri style of Kaithi . ................ 49 7 Specimen of hand-written Maithili style of Kaithi . ................ 50 8 Specimen of hand-written Magahi style of Kaithi . .............. 51 9 Excerpt from a specimen of Maithili written in the Magahi style of Kaithi . 52 10 Excerpt from a specimen of Awadhi written in Kaithi . ............... 53 11 Excerpt from a specimen of Bengali written in Kaithi . ................ 54 12 A specimen of Magahi printed in Kaithi type . ............ 55 13 A specimen of Maithili printed in Kaithi type . .............. 56 14 A specimen of Bhojpuri printed in Kaithi type . .............. 57 15 Table of the Kaithi script . ......... 58 16 Inventory of Kaithi letters . ........... 59 17 Comparison of numerals of Kaithi and other scripts . ................ 59 18 Folios 1b and 2a from the Mahagan¯ . apatistotra in Devanagari and Kaithi . 60 19 Folios 1a and 4a from the Mahagan¯ . apatistotra in Devanagari and Kaithi . 61 20 Excerpt from a plaint from the district court of Patna, Bihar.................. 62 21 Excerpt from a plaint from the district court of Bhagalpur,Bihar ............... 63 22 Excerpt from a statement from the district court of Ranchi,Bihar............... 64 23 Rent receipt from the former Principality of Seraikella . ................... 65 24 Title and first pages of the Book of Genesis in Kaithi type . ................. 66 25 Title and first pages of the New Testament in Kaithi type . ................ 67 26 Entries for the ‘Bihari’ languages in The Book of a Thousand Tongues ............ 68 27 A folio from the ”Ekad. ala”¯ manuscript of Miragavat¯ ¯ı ..................... 69 28 A folio from the Tale of Sudama ................................ 70 29 A letter to the Supreme Civil Court of Appeals in Calcutta . ................. 71 30 Comparison of Kaithi, Gujarati, and Devanagari types from the Linguistic Survey of India . 72 31 Comparison of hand-written Kaithi and Gujarati letters . ................... 73 32 Comparison of Kaithi and Devanagari . ........... 73 33 A comparison of the Kaithi script with the Devanagari and Mahajani . 74 34 Comparison of Kaithi drawn with the headstroke and Devanagari............... 75 35 Comparison of writing techniques in Kaithi and Devanagari.................. 76 36 Comparison of scripts descended from proto-Bengali . ................. 77 37 Comparison of Kaithi with other scripts used for writing Hindi ................ 78 38 Comparison of Kaithi with other Indic scripts . ............... 79 39 Comparison of Kaithi with other Indic scripts . ............... 80 40 A family tree of north Indian scripts showing Kaithi as a branch of Nagari . 81 41 The position of the Kaithi script with regard to others . .................. 81 List of Tables 1 GlyphchartforKaithi............................... ....... 5 2 Character Names and Properties . ......... 6 3 Comparison of metal and digitized Kaithi fonts . .............. 7 4 A comparison of consonants of Kaithi, Gujarati, Devanagari, and Syloti Nagri . 40 5 A comparison of vowels of Kaithi, Gujarati, Devanagari, and Syloti Nagri . 41 6 A comparison of digits of Kaithi, Gujarati, Devanagari, and Syloti Nagri . 41 ISO/IEC JTC 1/SC 2/WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS FOR ADDITIONS TO THE REPERTOIRE OF ISO/IEC 106461 Please fill all the sections A, B and C below. Please read Principles and Procedures Document (P & P) from http://www.dkuug.dk/JTC1/SC2/WG2/docs/principles.html for guidelines and details before filling this form. Please ensure you are using the latest Form from http://www.dkuug.dk/JTC1/SC2/WG2/docs/summaryform.html. See also http://www.dkuug.dk/JTC1/SC2/WG2/docs/roadmaps.html for latest Roadmaps. A. Administrative 1. Title: Proposal to Encode the Kaithi Script in Plane 1 of ISO/IEC 10646 2. Requester’s name: University of California, Berkeley Script Encoding Initiative (Universal Scripts Project); author: Anshuman Pandey ([email protected]) 3. Requester type (Member Body/Liaison/Individual contribution): Liaison contribution 4. Submission date: May 21, 2007 5. Requester’s reference (if applicable): N/A 6. Choose one of the following: (a) This is a complete proposal: Yes (b) or, More information will be provided later: No B. Technical - General 1. Choose one of the following: (a) This proposal is for a new script (set of characters): Yes i. Proposed name of script: Kaithi (b) The proposal is for addition of character(s) to an existing block: No i. Name of the existing block: N/A 2. Number of characters in proposal: 73 3. Proposed category: C - Major extinct 4. Is a repertoire including character names provided?: Yes (a) If Yes, are the names in accordance with the “character naming guidelines” in Annex L of P&P document?: Yes (b) Are the character shapes attached in a legible form suitable for review?: Yes 5. Who will provide the appropriate computerized font (ordered preference: True Type, or PostScript format) for publishing the standard?: Anshuman Pandey; True Type format (a) If available now, identify source(s) for the font and indicate the tools used: The font contains normalized forms of letters found in hand-written and printed Kaithi documents. It was drawn with Metafont and converted to True Type with FontForge. 6. References: (a) Are references (to other character sets, dictionaries, descriptive texts etc.) provided?: Yes (b) Are published examples of use (such as samples from newspapers, magazines, or other sources) of proposed characters attached?: Yes 7. Special encoding issues: (a) Does the proposaladdress other aspects of character data processing (if applicable) such as input, presentation, sorting, searching, indexing, transliteration etc. (if yes please enclose information)? Yes; see proposal for additional details.. 8. Additional Information: Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script. Examples of such properties are: Casing information, Numeric information, Currency information, Display behaviour information such as line breaks, widths etc., Combining behaviour, Spacing behaviour, Directional behaviour, Default Collation behaviour, relevance in Mark Up contexts, Compatibility equiv- alence and other Unicode normalization related information. See the Unicode standard at http://www.unicode.org for such information on other scripts. Also see http://www.unicode.org/Public/UNIDATA/UCD.html and associ- ated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard. Character properties and numeric information are included. 1 Form number: N3102-F (Original 1994-10-14; Revised 1995-01, 1995-04, 1996-04, 1996-08, 1999-03, 2001-05, 2001-09, 2003-11, 2005-01, 2005-09, 2005-10, 2007-03) C. Technical - Justification 1. Has this proposal for addition of character(s) been submitted before?: No 2. Has contact been made to members of the user community (for example: National Body, user groups of the script or characters, other experts, etc.)? No (a) If Yes, with whom?: N/A i. If Yes, available relevant documents: N/A 3. Information on the user community for the proposed characters (for example: size, demographics, information technology use, or publishing use) is included? Yes (a) Reference: Awadhi, Bhojpuri, Magahi, and Maithili speakers; as well as linguists, historians, legal scholars working with sources from colonial South Asia. 4. The context of use for the proposed characters (type of use; common or rare): Common (a) Reference: Court records from colonial India, pedagogical materials from north India, commercial and accounting records; religious and literary texts; bibles printed in north India during the 19th and early 20th century. Other contexts discussed at length in the text of the proposal). 5. Are the proposed characters in current use by the user community?: Yes, by scholars working in fields enumer- ated above. It is difficult to verify whether the script is presently in active use in India. (a) If Yes, where? Reference: In India, the United States, and other localities. 6. After giving due considerations to the principles in the P&P document must the proposed characters be entirely in the BMP?: No (a) If Yes, is a rationale provided?: N/A i. If Yes, reference: N/A 7. Should the proposed characters be kept together in a contiguous range (rather than being scattered)? Yes 8. Can any of the proposed characters be considered a presentation form of an existing character or character se- quence? No (a) If Yes, is a rationale for its inclusion provided?: N/A i. If Yes, reference:

Proposal to Encode the Kaithi Script in Plane 1 of ISO/IEC 10646

On the Origin of the Indian Brahma Alphabet

Linguistic Survey of India Bihar

The Festvox Indic Frontend for Grapheme-To-Phoneme Conversion

LAST FIRST EXP Updated As of 8/10/19 Abano Lu 3/1/2020 Abuhadba Iz 1/28/2022 If Athlete's Name Is Not on List Acevedo Jr

Handwriting Recognition in Indian Regional Scripts: a Survey of Ofﬂine Techniques

Review of Research

The Original Pronunciation of Sanskrit Devanàgará – Transliteration – IPA-Symbols

The What and Why of Whole Number Arithmetic: Foundational Ideas from History, Language and Societal Changes

Tai Lü / ᦺᦑᦟᦹᧉ Tai Lùe Romanization: KNAB 2012

Sanskrit Alphabet

Indic Loanwords In Tocharian B, Local Markedness, And The Animacy

5892 Cisco Category: Standards Track August 2010 ISSN: 2070-1721