Journal of East Asian Libraries
Volume 2006 Number 138 Article 17
2-1-2006
No. 138 Journal of East Asian Libraries
Journal of East Asian Libraries
Follow this and additional works at: https://scholarsarchive.byu.edu/jeal
BYU ScholarsArchive Citation Libraries, Journal of East Asian (2006) "No. 138 Journal of East Asian Libraries," Journal of East Asian Libraries: Vol. 2006 : No. 138 , Article 17. Available at: https://scholarsarchive.byu.edu/jeal/vol2006/iss138/17
This Full Issue is brought to you for free and open access by the Journals at BYU ScholarsArchive. It has been accepted for inclusion in Journal of East Asian Libraries by an authorized editor of BYU ScholarsArchive. For more information, please contact [email protected], [email protected]. TABLE OF CONTENTS
Number 138 February 2006
From the President i
Articles
Mi Chu Wiens World Digital Library and E-Resources in the Asian Division, Library of Congress 1
Akira Miyazawa Ideograph Variant Forms and Usage Control in NACSIS-CAT 5
Sarah Jeong Hanbok, Korean Traditional Dress: A Selected Annotated Bibliography 9
Patrick Lo and Owen Tam To Extend Functionalities of WebPAC by Developing the Library Online Catalogue into a Library Resources Portal—the Lingnan University Experience 17
Wen-ling Liu The 2005 Tokyo International Book Fair 45
Reports
OCLC Dialog Forum for Chinese Digital Content 48
Report of the Meeting of the North American Coordinating Council on Japanese Library Resources, September 16-17, 2005 50
CEAL Statistical Report 2004-2005 58
Committee Activities 86
New Appointments 107
Retirements 109
Vacancy Announcements 110
Indexes 113
Journal of East Asian Libraries, No. 138, February 2006
January 30, 2006
FROM THE PRESIDENT
Dear Members and Friends of CEAL:
It has been an honor to have served as your CEAL President for the years 2003-2006 alongside such talented and dedicated members as yourselves and those of the Executive Board. This issue of the Journal of East Asian Libraries (JEAL) is the last issue for which I will be writing the introductory message, as I will be proudly turning over the CEAL presidential duties to our new President, Philip Melzer, at the 2006 Plenary Session in San Francisco. Please join me in welcoming our new President.
Since last summer, one of the major activities of CEAL has been planning the 2006 IFLA Seoul Pre- conference Meeting scheduled for Friday, August 18, 2006. The Special Committee for 2006 IFLA Seoul has been working extremely hard planning this event. Your participation in this as well as the 72nd IFLA General Conference and Council is highly encouraged. I would like to thank Joy Kim and Philip Melzer, Co-Chairs of the Special Committee, and members of the Special Committee for their hard work. In addition, I would like to thank Shi Deng, Chair of the Subcommittee on Review and Comment on Resource Description and Access (RDA) Draft, as well as members and advisors of the subcommittee, for their dedication to reviewing and commenting on the RDA draft.
There is much anticipation and excitement surrounding the upcoming San Francisco meeting as it will be marked with distinction. The pre-conference SCCTP Advanced Serials Cataloging workshop, sponsored by the Committee on Technical Processing, will take place at UC Berkeley. In addition to the Plenary Session and committee meetings, there are various round tables, consortiums, library tours, and receptions. Each of the CEAL committees will have special programs to share with us all.
This issue of our journal includes five articles: World Digital Library and E-Resources in the Asian Division, Library of Congress; Ideograph Variant Forms and Usage Control in NACSIS-CAT; Hanbok, Korean Traditional Dress: A Selected Annotated Bibliography; To Extend Functionalities of WebPAC by Developing the Library Online Catalogue into a Library Resources Portal—the Lingnan University Experience; and The 2005 Tokyo International Book Fair. Also in this issue are the CEAL Statistical Report 2004-2005, OCLC Dialog Forum for Chinese Digital Content Report, and NCC September 16-17, 2005 meeting report, plus committee activities, new appointments, and retirements.
As the 2006 CEAL Meeting approaches, I want to thank Nominating Committee members for their hard work in identifying and encouraging new faces to serve on the CEAL Executive Board. I also would like to thank outgoing Executive Board members for their support, dedication, and contributions during the last few years. I will always remember and appreciate their work to help our organization succeed. I am sure that 2006, the Year of the Dog, will be another exciting, challenging, and rewarding year for us all!
Sincerely,
Abraham J. Yu, President Council on East Asian Libraries (CEAL)
i Journal of East Asian Libraries, No. 138, February 2006
WORLD DIGITAL LIBRARY AND E-RESOURCES IN THE ASIAN DIVISION, LIBRARY OF CONGRESS
Mi Chu Wiens Asian Division, Library of Congress
On November 22, 2005, in an article in the Washington Post, the Librarian of Congress, Dr. James H. Billington, announced a great initiative, proposing the cooperative building of a World Digital Library, which would offer “the promise of bringing people closer together by celebrating the depth and uniqueness of different cultures in a single global understanding.”1 The Library has had fifteen years of experience with digital activity and in the past five years has launched various digitization projects with other countries. Within the framework of the Library, the Asian Division has also made great strides in its efforts, illuminated by three recent successful cooperative agreements.
On October 1, 2004, the Library and Academia Sinica (Taipei, Taiwan) signed an agreement for the digitization of China-related maps. The digital experts from Academia Sinica Computing Center paid two working visits in 2004 and 2005, created digital images of 21,000 maps and 840 aerial photographs, and took them back to Taiwan for post-digitization processing. This digital file will be incorporated into the Chinese Civilizations in Time and Space (CCTS) database and supported by a historical geographic information system with spatial-temporal applications.
Prior to the Library-Academia Sinica collaboration, the Asian Division invited Prof. Li Xiaocong of Beijing University to review and provide descriptions for three hundred pre-1900 Chinese maps, most of which were brought to the Library by Dr. Arthur W. Hummel, the first Chief of the Asian Division (then named the Department of Chinese Literature). During his four-month stay in 2002, Prof. Li examined in detail Chinese manuscripts and woodblock printed maps, including those in scrolls, on fans, and rubbings of maps on stone, dating from the 12th century. Professor Li’s work resulted in the publication of a bilingual edition (Chinese and English), with illustrations, entitled Descriptive Catalogue of the Traditional Chinese Maps in the Library of Congress (Beijing, Wen wu Publishing Co., 2004).
Another achievement was the international cooperation between the Library and National Central Library (Taipei, Taiwan) in the digitization of the Library’s most valuable Chinese rare books. The origins of the Chinese collection go back to 1869, when the Library received ten works in 905 volumes from the Tongzhi Emperor of China (reigned 1856-1875), as part of an exchange authorized by the Congress two years earlier. During the decade after these books arrived in Washington, the Library acquired another 237 titles from Caleb Cushing (1800-1879), the first U.S. minister to China, who purchased Chinese books in 1844 when he negotiated the first U.S. treaty with China. The collection quickly grew in the early twentieth century with the acquisition of some 6,000 volumes in 1901-1902 from William W. Rockhill (1854-1914), the American diplomat and China specialist, and a gift of 198 works in 1,965 volumes from the Chinese government at the conclusion of the Louisiana Purchase Exposition in 1904. The Library also purchased collections from Dr. Feng Jinggui in 1913, Rockhill again in 1915, and Berthold Laufer (1874-1934) in 1928. Returning after thirteen years as a missionary teacher in China, Arthur Hummel (1884-1975), as the first Chief of the Asian Division, made major efforts during his tenure of twenty-six years (1928-1954) to acquire rare Chinese books and presided over the growth of Chinese collection to its world-class status.
The agreement between the Division and the National Central Library (Taipei, Taiwan) on the digitization of the Division’s Chinese rare books was signed in May 2005. Since then a team of technical specialists from Taiwan has been working on site to digitize selected titles, which are being reviewed by two Chinese rare book experts, Dr. Poon Ming-sun and Mr. Fan Bangjin, for authentication and annotation regarding the condition of these rare books. Funded by Chiang Ching-kuo Foundation and by the Library’s Conservation
1 Journal of East Asian Libraries, No. 138, February 2006
Division, the goal of this collaborative project is for the two libraries to share each other’s digitized databases of Chinese rare books and make them easily and freely accessible to researchers worldwide.
Most recently, in December 2005, the Asian Division acquired exclusive use of the Geographic Information Systems (GIS) of Chinese Civilization in Time and Space (CCTS) and Taiwan History and Culture in Time and Space (THCTS), which will give researchers a tremendously useful online tool in locating geographic areas of China and their related local histories and links to other databases. The Division has requested permission to gain access to some twenty-two other databases created by Academia Sinica, mostly in the humanities and social sciences.
The Asian Division has also participated in digital activity within the Library. In June 2004 the Naxi Manuscript Collection website,2 the first Area Studies LC-only Global Gateway collection, was released. Dr. Mi Chu Wiens of the Asian Division started the project in 1998 with a grant from the Chiang Ching-kuo Foundation for three years (1998-2001.) The collection is made up of documents written by Naxi shamanistic priests in what is known as the only living pictographic language in the world These manuscripts, which detail the unique cosmology of the Naxi people of the Yunnan Province, China, illustrate a wide range of Naxi myths and legends, such as the creation of the world, sacrifice to the serpent king and other principle gods, accounts of Naxi warriors and other people of high social standing ascending to the realm of deities, and love-suicide stories. An annotated catalog of the Library’s entire Naxi collection, a 39½-foot scroll used in funerary ceremonies, and translations by Joseph Rock of two of the manuscripts are also found on the Naxi Manuscript Collection website.
In close collaboration with the International Research Center of Japanese Studies (Nichibunken), the Asian Division digitized the Library’s Nara Ehon collection of four titles (Shizuka, Homyo Doji, Shigure and Soga Monogatari) and made available over 2,000 digital images, including 173 color illustrations, on the Library’s OPAC. Nara Ehon, a type of colorfully illustrated manuscript book of stories and tales, were produced during the Muromachi period (1333-1573) up to the middle of the Edo period (1615-1868.) The Nara Ehon books are considered to be the earliest popular illustrated books in Japan.
The Asian Division is also currently digitizing The Tale of Genji, a set of sixty-volumes. This is the second collection in the Asian Division to be digitized in collaboration with Nichibunken. The Library’s holding is a rare and complete set of the 1654 edition including not only the main text of 54 volumes of Genji Monogatari [Tale of Genji], but also Meyasu (a commentary on keywords and phrases in the main text) in three volumes, Keizu (genealogy) in one volume, Yamaji no Tsuyu (a sequel to the main text) in one volume and Hikiuta (index) in one volume.
The Japanese Ukiyo-e collection in the Prints and Photographs Division also benefited from the collaboration with Nichibunken. Under the agreement signed by the Library and Nichibunken in February 2005, a multi-institutional team of Japanese art historians consisting of Professor Shugo Asano (Chiba City Museum of Art), Professor Monta Hayakawa (Nichibunken), Ms. Shuko Koyama (Edo-Tokyo Museum), and Professor Juliann Wolfgram (California Technological Institute) led by Professor Atsushi Aiba (Nichibunken) visited the Library to study 2,331 Japanese prints mainly from the Edo-period. The team identified the prints by artist, date, title, series title for sets of prints, format, size, and other descriptive details of all these Japanese prints during their three-week stay. The Library has already finished the scanning of the Ukiyo-e prints collection with the financial support from Nichibunken. The product of this large-scale collaboration between Nichibunken and its multi-institutional scholarly team and the Library will be made fully available to the international research community through the Library of Congress and Nichibunken websites in 2006.
June 2005 in cooperation with the Japan Map Center of the Geographical Survey Institute in Japan, the Geography and Map Division digitized and made available a collection of large-scale Japanese maps (1816- 1819) by Inoh Tadataka via the Library’s webpage http://www.loc.gov/rr/geogmap/. This collection is the largest and most complete holdings of Inoh maps (207 out of 214) in the world, including Japan.
2 Journal of East Asian Libraries, No. 138, February 2006
The rapid transformation of the digital environment and the ever-increasing number of electronic resources available have given great impetus to the Division and its staff to strive for identifying and acquiring e- resources on Asia as part of their standard responsibilities for collection development.
At present forty-five of the Library’s 219 subscription databases deal with Asian studies. These databases, such as Aardvark (Asian resources for librarians), Asian Development Bank Economics & Statistics (issued by the Economics and Research Department Asian Development Bank), Asian Law Bibliographic Database (from the University of Melbourne), Bibliography of Asian Studies (from Association for Asian Studies), Country Studies (produced by Federal Research Division, the Library of Congress), and Treaties and International Agreements (by Oceana Publications), are all accessible in the Library’s reading rooms. The Library also has a number of electronic databases in various divisions and departments provided free-of-charge.
In recent years Asian Division has made great strides in acquiring databases on Asia from outside of the Library. In the China area we have acquired Ren min ri bao dian zi ban 人民日报电子版 = The People’s Daily Electronic Version, 1946-2004, China Data Online, China National Knowledge Infrastructure (CNKI)’s CAJ and CCND databases, and most recently we gained access to a 100,000-title monograph database from Superstar Digital Library. We have also acquired Si ku quan shu (Wen yuan ge edition) and Encyclopedia of Taiwan.
The Library has also acquired two important subscription databases for the Japanese collection: Directory of Japanese Scientific Periodicals (from the National Diet Library, covering 13,875 serial titles on science and technology published in Japan) and Kodansha Encyclopedia of Japan (based on the 1993 publication entitled Japan: an Illustrated Encyclopedia, with 11,000 entries). In addition, the Library currently subscribes to full-text databases of five major Japanese newspapers including Mainichi, Sankei, Chunichi, Tokyo and Yomiuri newspapers through a U.S. based commercial aggregator/vendor, Factive.com.
In the Korea area recent acquisitions were Chosun Daily Newspaper Archive (covering articles from the Chosun Ilbo, the most widely read newspaper in South Korea), Korean Studies Database (by KRPIA, covering history, literature and traditional medicines), Korean Studies Information Service System (KISS, a database of full-text articles from 6000 journals published by 1200 Korean academic institutions), and Law n B- Korean Law Database (Law and business, available in Law Library reading room only).
A recent addition is Library of Congress Asian Collections: an Illustrated Guide,3 the electronic version of a 2000 publication. Other databases include Korean Bibliography,4 which contains approximately 4,800 records of books about Korea in English up to 1995 held by the Library of Congress, and Korean Serials,5 which includes 6,325 periodical titles, including 177 titles from North Korea. Within the framework of the Library’s Portals to the World, Asian Division provides Asian Portals6 with links to electronic resources from forty countries and areas of Asia.
Other divisions of the Library, such as the Federal Research Division, Geography & Map Division, Prints & Photographs Division, and Manuscript Division also provide access to databases that contain information related to China, Japan, Korea, and other Asian countries. Noteworthy are Country Studies,7 Library of Congress Geography and Maps: an Illustrated Guide,8 Global Legal Information Network (GLIN),9 the Guide to Law Online,10 an annotated guide prepared by the Law Library of Congress Public Services Division, and The Floating World of Ukiyo-e: Shadow, Dreams and Substance, an online exhibition catalog.11 There are also various online finding aids in Manuscript Division12 for personal papers of eminent personages, such as Caleb Cushing, Henry R. Luce, and Owen Lattimore, that contain Asia-related documents and records.
(With contributions from Lily Kecskes and Eiichi Ito)
1 Washington Post, Tuesday, November 22, 2005: A29
2 http://international.loc.gov/intldl/naxihtml/naxihome.html
3 Library of Congress Asian Collections: an Illustrated Guide: http://www.loc.gov/rr/asian/guide/
3 Journal of East Asian Libraries, No. 138, February 2006
4 Korean Bibliography: http://lcweb2.loc.gov/misc/korhtml/korbibhome.html
5 Korean Serials: http://www.loc.gov/rr/asian/koreanserials
6 Asian Portals: http://www.loc.gov/rr/asian/area AD.html
7 Country Studies: http://memory.loc.gov/frd/cs/
8 Library of Congress Geography and Maps: an Illustrated Guide: http://www.loc.gov/rr/geogmap/guide/
9 Global Legal Information Network (GLIN): http://www.glin.gov
10 Guide to Law Online: http://www.loc.gov/law/public/law-guide.html
11 The Floating World of Ukiyo-e: Shadow, Dreams and Substance: http://www.loc.gov/law/public/law-guide.html
12 Manuscript Division: http://www.loc.gov/rr/mss/f-aids/mssfa.html#c
4 Journal of East Asian Libraries, No. 138, February 2006
IDEOGRAPH VARIANT FORMS AND USAGE CONTROL IN NACSIS-CAT
Akira Miyazawa National Institute of Informatics, Japan
Preface
In the 2005 CEAL meetings in Chicago, I heard from several people that UNICODE, EACC and IME problems were a current issue in the OCLC and RLIN new cataloging systems. Sometimes I was asked a question about how our NACSIS-CAT cataloging system handles these problems. Of course, NACSIS-CAT has nothing to do with EACC, though it has been using UNICODE more than five years. Still, I thought it might be helpful for American CJK catalogers to know about character set handling in NACSIS-CAT. I would be glad if you find something useful in this article.
1. Introduction: NACSIS-CAT
Japanese catalogers in CEAL may not need introduction to NACSIS-CAT. However, for other area librarians, a brief introduction to NACSIS-CAT may be helpful. NACSIS-CAT is a shared cataloging system, similar to the OCLC and RLIN cataloging systems, operated by National Institute of Informatics (NII) Japan. It is part of a government supported program to provide a national information infrastructure for research activities. The system was developed by the Center for Bibliographic Information at the University of Tokyo and started its operation in 1984. National Center for Science Information Systems (NACSIS) was established in 1986 to operate this system, along with other scholarly database services. NACSIS was reorganized to NII, enhancing research function of broad fields of informatics in 2000.
In April 2005, over 1300 libraries (mostly Japanese university libraries including some European and Asian libraries) are contributing to the union catalog, which comprises more than seven million monograph titles, two hundred and eighty thousand serial titles, about eighty million holdings records, 1.3 million author name authority records and twenty-four thousand uniform title authority records. Weekly increase is about ten thousand titles and a hundred thousand holdings records. In addition to the union catalog, more than thirty three million MARC records of various countries, including Japan, USA, UK, Germany, Korea, and China, are loaded to be referenced for cataloging. Recent developments have enabled RLIN and OCLC databases to be referred online for cataloging.
First generation NACSIS-CAT system used JIS (Japanese Industrial Standard) character code with private extension for European languages. Japanese, most Latin script written languages, and Russian could be handled with this character code. In 1996, a new NACSIS-CAT server client based system was introduced, and this new system was enhanced to use Unicode in year 2000. Now, it can handle almost all the languages.
NACSIS-Webcat Plus (http://webcatplus-international.nii.ac.jp/en/) is an open access web interface to the whole union catalog database produced by NACSIS-CAT. NACSIS-Webcat Plus and NACSIS-CAT both utilize CJK unified index, which enables cross variant form search (i.e., to retrieve or with ).
2. Unification, Source Code Separation and Z-variant
For a discussion of Unicode usage, some basic concepts and terms should be introduced. The first one is “unification”. This is a basic concept of Unicode [1]. By the glossary of Unicode, it is “the process of identifying characters that are in common among writing systems”. For example Õ in Turkish is unified with Õ in German and named LATIN CAPITAL LETTER O WITH DIAERESIS. Its code point is U+00D6. Hanzi, kanji or hanja in Unicode is called CJK unified ideograph. That means the repertory was made from Chinese, Japanese and Korean standard character sets with “unification”.
Unification of CJK unified ideographs was not a very simple job when the repertory was made in 1991 [2],
Journal of East Asian Libraries, No. 138, February 2006 explained in the Unicode Standard [5]. In short, it is to unify very similar shape variants like U+8fB6 with one dot and two dots.
In spite of the unification rules, characters U+6236, U+6237 and U+6238 are not unified in the repertory. This is due to the rule called source separation rule, or round trip rule. It is to ensure that round trip conversion between UNICODE and a source national standard code (such as JIS) does not cause loss of information. In this case, Taiwan national code distinguished these three “characters,” while Japanese, Korean and Chinese in that time made no distinction.
Variant like this case is sometimes called “Z-variant.” By the Unicode website, Z-variant is “Two CJK unified ideographs with identical semantics and unifiable shapes, for example, U+8AAA and U+8AAC.” [6].
Z-variant is caused by not only source separation rule but also by the need to keep compatibility with other sources. Korean pronunciation variants of a hanja necessitate 268 compatibility ideographs which are completely same form. Other reasons of Z-variant are described in the Unicode Standard [5].
There is another type of variant in the glossary called “Y-variant”. It is “Two CJK unified ideographs with identical semantics and non-unifiable shapes, for example, U+732B and U+8C93.” [6]. Characters of , and are also Y-variant examples.
Unihan database is a kind of dictionary attached in the Unicode Standard. It is also available through web page [7]. This database lists some other variant relations such as simplified variant or traditional variant. These variants can be classified as Y-variants.
One problem here is that there is no normative source of Z-variants or Y-variant. Unihan database is not a normative part of the standard. Y-variant is unquestionably language dependent. U+53F6 is simplified character of U+8449 in modern Chinese, but not in Japanese. U+82B8 is simplified character of U+85DD in modern Japanese, but not in Chinese.
Z-variants are less language dependent. Appendix S of ISO 10646 lists examples of Z-variants by source separation rules, but it is not an exhaustive list. In fact, the plan of providing complete source separation list was abandoned by the standard developer due to difficulties foreseen. Judgment about Z-variant or not is subject to personal view. There are certain number of people who claim U+9AD9 is not Z-variant of U+9AD8, but Y-variant. Z-variants interpretation of Unihan database is too broad, in my opinion. For example, U+85DD is listed as a Z-variant of U+82B8, though it might be just by a mistake. U+4E0A and U+4E04 is also a Z-variant.
The concept of Z-variant and Y-variant is acceptable for most of people. But actual assignment of variant relation to individual character is much more debatable.
3. Character set usage control
NACSIS-CAT system has a concept and a mechanism of character set usage control from the first generation system. It is to limit usage of certain characters in the character set and unify them with other characters in the set. This is introduced mainly to avoid cumbersome distinction between full and half width Latin alphabets.
Japanese encoding systems have two set of Latin alphabets due to parallel usage of double byte and single byte character sets. Double byte Latin alphabets (often called zenkaku or full-width) are distinguished from single byte alphabets. But, such distinction has meaning only in computers. There is no way to tell if an alphabet written on a title page is full width or not.
In NACSIS-CAT system, full-width and normal alphabets are regarded as same characters. To ensure this, the system normalizes input data. Full width alphabets input by users are converted to normal alphabets before stored in the database. This conversion is applied for all the input to the system, including search terms. You will not see full-width alphabets in the output from the system.
Character set usage control is intended to limit the character set, so that the complexity of cataloging k i d d Sit l l li ti b th t it t f th h t t
Journal of East Asian Libraries, No. 138, February 2006
4. CJK unified ideograph usage of NACSIS-CAT
When NACSIS-CAT started to support UNICODE, its character set usage control was expanded. In brief, Z-variants are unified. That is, for example, U+8AAA and U+8AAC are regarded as the same character. If a user inputs U+8AAA in a data entry, it is automatically converted to U+8AAC by the script level normalization process.
But, our Z-variant list is different from Unihan database. As mentioned before, Z-variants of Unihan database seems to be too broad. We selected our Z-variant list based on the Appendix S of ISO/IEC 10646.
You may ask if this causes loss of information. Yes, it may cause slight loss of information. But, catalog description can not copy everything. In Western languages, upper and lower cases are not preserved. Italic, bold or handwritten styles are not preserved. Information loss by Z-variant normalization is at the same level as those cases.
In fact, there are few complaints about this, and it definitely reduce burden of catalogers.
5. CJK unified index
Script level normalization should not be confused with a unified index to provide , and type access. Script level normalization is performed for all input data to the system. A unified index also normalizes certain characters, but it is only for the hidden index fields, which are invisible to normal users.
When a catalog record is written to the database, the system automatically extracts index words from the record. Then the words go through the unified index normalization process. This is to unify Y-variants. But our Y-variant list is, again, different from Unihan database. It is broader than Unihan database. As mentioned before, Y-variant is language dependent. We merge Chinese, Japanese and Korean Y-variants to provide one large list.
Here is an example of CJK unified index Y-variant group.