Bull AIX 5L National Language Support Guide and Reference

Bull AIX 5L National Language Support Guide and Reference AIX ORDER REFERENCE 86 A2 68EM 02 Bull AIX 5L National Language Support Guide and Reference AIX Software October 2005 BULL CEDOC 357 AVENUE PATTON B.P.20845 49008 ANGERS CEDEX 01 FRANCE ORDER REFERENCE 86 A2 68EM 02 The following copyright notice protects this book under the Copyright laws of the United States of America and other countries which prohibit such actions as, but not limited to, copying, distributing, modifying, and making derivative works. Copyright Bull S.A. 1992, 2005 Printed in France Suggestions and criticisms concerning the form, content, and presentation of this book are invited. A form is provided at the end of this book for this purpose. To order additional copies of this book or other Bull Technical Publications, you are invited to use the Ordering Form also provided at the end of this book. Trademarks and Acknowledgements We acknowledge the right of proprietors of trademarks mentioned in this book. AIXR is a registered trademark of International Business Machines Corporation, and is being used under licence. UNIX is a registered trademark in the United States of America and other countries licensed exclusively through the Open Group. Linux is a registered trademark of Linus Torvalds. The information in this document is subject to change without notice. Bull will not be liable for errors contained herein, or for incidental or consequential damages in connection with the use of this material. Contents About This Book . vii Highlighting . vii Case-Sensitivity in AIX . vii ISO 9000 . vii Related Publications . vii Chapter 1. National Language Support Overview . .1 Separation of Messages from Programs . .1 Conversion between Code Sets . .1 Input Method Support . .2 Converters Overview . .2 Using the Message Facility . .2 Setting National Language Support for Devices . .3 Changing the Language Environment . .4 Changing the Default Keyboard Map . .5 ICU4C Libraries . .5 Chapter 2. Locales . .7 Understanding Locale . .7 Understanding Locale Categories . .8 Understanding Locale Environment Variables . .9 Understanding the Character Set Description (charmap) Source File . .11 Understanding the Locale Definition Source File . .11 Multibyte Subroutines . .12 Wide Character Subroutines . .12 Bidirectionality and Character Shaping . .12 Code Set Independence . .12 File Name Matching . .13 Radix Character Handling . .13 Programming Model . .14 Chapter 3. Subroutines for National Language Support . .15 Locale Subroutines . .15 Time Formatting Subroutines . .20 Monetary Formatting Subroutines . .21 Multibyte and Wide Character Subroutines . .23 Internationalized Regular Expression Subroutines . .45 Related Information . .48 Chapter 4. Code Sets for National Language Support . .49 Single-Byte and Multibyte Code Sets . .50 Unique Code-Point Range . .50 Data Representation . .51 Character Properties . .52 ASCII Characters . .53 Code Set Strategy . .55 Code Set Structure . .55 ISO Code Sets . .57 IBM PC Code Sets . .70 UCS-2 and UTF-8 . .81 Related Information . .83 Chapter 5. Converters Overview for Programming . .85 © Copyright IBM Corp. 2002, 2005 iii Standard Converters . .85 Using the iconv Command . .86 Understanding libiconv . .86 Using Converters . .89 List of Converters . .91 Writing Converters Using the iconv Interface . .113 Related Information . 123 Chapter 6. Input Methods . 125 Input Method Introduction . 125 Input Method Names . 126 Input Method Areas . 126 Input Method Command . 127 Programming Input Methods . 127 Working with Keyboard Mapping . 129 Using Callbacks . 130 Bidirectional Input Method . 133 Cyrillic Input Method (CIM) . 134 Greek Input Method (GIM) . 135 Japanese Input Method (JIM) . 137 Korean Input Method (KIM) . 143 Latvian Input Method (LVIM) . 144 Lithuanian Input Method (LTIM) . 144 Thai Input Method (THIM) . 144 Vietnamese Input Method . 145 Simplified Chinese Input Method (ZIM-UCS) . 145 Single-Byte Input Method . 146 Traditional Chinese Input Method (TIM) . 148 Universal Input Method . 149 Reserved Keysyms . 150 Related Information . 151 Chapter 7. Message Facility . 153 Creating a Message Source File . 153 Creating a Message Catalog . 157 Displaying Messages outside of an Application Program . 159 Displaying Messages with an Application Program . 159 Example of Retrieving a Message from a Catalog . 161 Writing Messages . 161 Chapter 8. Culture-Specific Data Handling . 165 Culture-Specific Tables . 165 Culture-Specific Algorithms . 165 Example of Loading a Culture-Specific Module for Arabic Text for an Application . 165 Layout (Bidirectional Text and Character Shaping) Overview . 167 Appendix A. Supported languages and locales . 171 Appendix B. National Language Support (NLS) Reference . 183 National Language Support Checklist . 183 List of National Language Support Subroutines . 188 Appendix C. Character Maps . 193 ISO Code Sets . 193 IBM Code Sets . 214 iv National Language Support Guide and Reference Appendix D. NLS Sample Program . 233 Message Source File for foo . 233 Creation of Message Header File for foo . 233 Single Source, Single Path Code-set Independent Version . 233 Single Source, Dual-Path Version Optimized for Single-Byte Code Sets . 235 Appendix E. Use of the libcur Package . 239 Index . 243 Contents v vi National Language Support Guide and Reference About This Book This book provides application programmers with complete information about enabling applications for ® national language support for the AIX operating system. It also provides system administrators with complete information about enabling networked environments for national language support for the AIX operating system. Programmers and system administrators can use this book to gain knowledge.

Bull AIX 5L National Language Support Guide and Reference

AIX Globalization

Tru64unix Technicalreferenceforusingchinese Features

Supporting the Chinese Language in Oracle Text

Ultim T H a E E T Tools for Exploring Asian Languages

What Is an IME (Input Method Editor) and How Do I Use It ?

Traditional Chinese Solaris User's Guide

Chinese Information Processing

Character Data Representation Architecture (CDRA)

Tru64 UNIX Technical Reference for Using Chinese Features

Introduction to I18n

Chinese Character Components, Radicals and Phonetics

Simplified Chinese Solaris User's Guide