<<

54 JOURNAL OF ELECTRONIC SCIENCE AND TECHNOLOGY, VOL. 12, NO. 1, MARCH 2014

Design of a Cloud Service for Learning Chinese Pronunciation

Hsueh-Ting , Wei-Shan Tsai, and Shao- Lee

AbstractWith the fast growing of cloud computing Chinese articles in textbooks usually requires the help of infrastructure, learning from cloud services has become additional phonetic symbols. There are two Chinese more and more convenient for people worldwide. In phonetic systems: and Zhuyin. Pinyin is currently order to integrate the cloud computing technology and the official which was published by the different -learning platforms including variant mobile People' Republic of (PRC) in 1958[6]. Before 1958, apps, Windows and web-based applications, we develop Zhuyin was the standardized phonetic system in entire our Chinese learning system “analytic Chinese helper” China for Chinese (Mandarin) pronunciation. The Zhuyin with a service-oriented architecture (SOA). Based on the system incorporates 37 symbols (Fig. 1) which new architecture we designed and developed a cloud transcribe precise sounds of among service for the e-learning of Chinese on the different Chinese . At present, Zhuyin is still the Internet as a convenient resource for foreign students, major phonetic system in ’s education system for especially in the of Chinese texts. teaching and learning Chinese. At the same time, Pinyin is There are two Chinese phonetic systems: Pinyin and also used in traffic signs and maps in Taiwan. Zhuyin. Pinyin is the official Moreover, the learning of Chinese characters is also characters, and Zhuyin incorporates additional difficult for foreigners because there are two different Bopomofo symbols which transcribe precise sounds of systems of Chinese characters: Traditional Chinese Chinese characters. The proposed analytic Chinese and Simplified Chinese. Simplified Chinese characters helper provides real-time annotations with Pinyin or resulted from the simplification of Traditional Chinese Zhuyin symbols, and thereby the annotated articles can characters by the PRC government in 1956. However, be used as e-learning objects in learning Chinese. Traditional Chinese characters are still being used in

Index TermsChinese-learning, cloud service, Kong and Taiwan as well as in ancient Chinese books. e-learning, Pinyin, Zhuyin. Table 1 lists five different Chinese representations. The first row is the sentence in Simplified Chinese and the second row is in Traditional Chinese. Both of the sentences are difficult for foreign students to pronounce. Therefore, it 1. Introduction is profitable to provide additional phonetic annotation along Recently, the study of has become with the sentences. There are three common types of more and more popular worldwide. For most foreigners, the Chinese phonetic annotation (rows 3~5). The first type uses most convenient way of learning Chinese is the e-learning Pinyin symbols with marks. Tones are an important through a variety of Internet services, including websites part of Chinese, which are the variation of pitch within a and online chat rooms[1]−[4]. One of the major challenges in . There are basically four tones in . Sometimes, people use the numbers 1~4 for tones learning Chinese is the pronunciation of Chinese characters (row 4). The last row is the equivalent phonetic because these characters are not phonetic [5]. For representation using the Zhuyin system. Excluding the native Chinese students in elementary schools, reading the Bopomofo symbols, the three tone symbols “ˊ, ˇ, ˋ” are used in the Zhuyin system. In summary, the Manuscript received June 20, 2013; revised August 30, 2013. pronunciation of Chinese characters is not easy for .-. Chu is with the Department of Computer Science and Information Engineering, Asia University, 41354 (Corresponding author foreigners. For this reason, we develop the cloud service to e-mail: [email protected]) help the learning of Chinese for foreign students through .-S. Tsai is with the Department of Computer Science and the Internet[7]. Information Engineering, Asia University, Taichung 41354 ([email protected]). S.-Y. Lee is with the Department of Foreign and Literature, Asia University, Taichung 41354 (e-mail: [email protected]). Color versions of one or more of the figures in this paper are available online at http://www.intl-jest.com. Digital Object Identifier: 10.3969/.issn.1674-862X.2014.01.011 Fig. 1. Bopomofo symbols. CHU et al.: Design of a Cloud Service for Learning Chinese Pronunciation 55 Table 1: Different Chinese representations for the sentence “Asia University welcomes you.” Chinese words/annotations Transcriptions Simplified Chinese words 亚洲大学欢迎你 Traditional Chinese words 亞洲大學歡迎你 Web app Pinyin with tone marks yàzhōu dà xué huān yíng nĭ Mobile app Win app Pinyin with tone numbers ya4zhou1 da4 xue2 huan1 ying2 ni3 Zhuyin with Bopomofo* ㄧㄚˋ ㄓㄡ ㄉㄚˋ ㄒㄩㄝˊ ㄏㄨㄢ ㄧㄥˊ ㄋㄧˇ *In the Zhuyin system, the first tone is without a tone symbol.

Education Cloud 2. Two Chinese Systems

China has become the second largest economic entity in the world. People, around the world, learn Chinese for the Fig. 4. Concept of education cloud for the access of learning business or culture reasons. Hundreds of contents from different terminal devices. institutes (Fig. 2) have been opened in dozens of countries to study Chinese[8]. Of course, these institutes use 3. Design of Cloud Service for Simplified Chinese with the Pinyin phonetic system. Learning Chinese Pronunciation However, people will find that there is another Chinese system if they visit or Taiwan. It is the original In order to provide the integration of heterogeneous Chinese system which uses traditional (non-simplified) application types today including web-based and Window- characters and additional Bopomofo symbols in the based applications and variant mobile apps as shown in Fig. 4, we aim to use web services to integrate remote accesses learning systems such as the Chinese Language Education [9], [10] Center at Asia University (Fig. 3). Besides, there are ten in service oriented architecture (SOA) . It is also the Chinese centers in different universities around Taiwan. common approach of many software as a service (SaaS) (http://english.moe.gov.tw/ct.asp?xItem=9693&CtNode=41 cloud services. We aim to consider the Chinese e-learning 7). Most of the Chinese centers in Taiwan introduce the as a set of services built by cloud computing techniques. Zhuyin phonetic system to students. As a result, it is useful The cloud services were implemented on Microsoft .Net web services which are based on the simple object access for a foreign student to know both of the Chinese systems if protocol (SOAP). The Web services are deployed at an /she learns Chinese in Taiwan. internet information services (IIS) server. The web-based interface of the analytic Chinese helper (ACH) is developed by PHP (hypertext preprocessor) programs on Apache server (Fig. 5). The ACH service provides both of Pinyin and Zhuyin phonetic annotations for either simplified or traditional Chinese texts. The PHP programs use the SoapClient class to communicate with the SOA services through XML-based messages (Fig. 6 and Fig. 7).

Fig. 2. Website of Confucius Institute () which is supported by China government. http://www.chinese.cn/

Fig. 5. Website of proposed cloud service: analytic Chinese Helper (ACH) http://aiplab.net/ach/. Fig. 3. Website of the Chinese Language Education Center at Asia Fig. 6. XML file of ACH web services for service calling from University. http://clec.asia.edu.tw heterogeneous applications. 56 JOURNAL OF ELECTRONIC SCIENCE AND TECHNOLOGY, VOL. 12, NO. 1, MARCH 2014

$client = new SoapClient("ws/achws."); 4.1 ST-Mixed $params = array('Org' => $mytxt); The Chinese service is driven by an in-memory Chinese switch ($tab) dictionary. The dictionary inherits most of the entries of { [12] case 0: Chinese words from CC-CEDICT . However, the CC- $displaytext=$mytxt; CEDICT is unable to support the conversion between break; Traditional Chinese and Simplified Chinese vocabularies. case 1: We have built the mappings of Traditional Chinese and $wsresult= $client->__soapCall('AchDefinition',array('parameters'=> Simplified Chinese from the Chinese rules in the $params)); MediaWiki project[13]. Each Chinese word has one or two $displaytext=$wsresult->AchDefinitionResult; entries in the traditional and simplified mixed dictionary break; case 2: (ST-mixed dictionary). If the characters of the Chinese … word are different in traditional and simplified systems, } there are two entries. Each entry of the dictionary has the Fig. 7. PHP codes to call the ACH web services. predefined attributes such as the Pinyin string, the Zhuyin string, and the filename of sound file. The proposed cloud service, analytic Chinese helper, provides automatic annotation of speech sounds of Chinese 4.2 Segmentation Engine text to help the beginner to learn Chinese regardless of The input Chinese text from the web interface goes to using Pinyin or Zhuyin (Fig. 5). The user can paste Chinese the background SOAP service, which consists of the text unto the website, and then he can easily switch the segmentation and annotation functions. The segmentation interface between tabs. There are four tabs for different engine is implemented by a longest prefix match (LPM) annotations. The first annotation provides explanations of [14] algorithm . The segmented words then are converted into Chinese words. A balloon tip will be displayed for the Word IDs of the ST-mixed dictionary. meaning of a Chinese word if the user moves the mouse near the word. The other three tabs are for pronunciation of Chinese text. It will utter the word if the user moves the mouse near it in any of these tabs. We plan to have the analytic Chinese helper application programming interface (API) for variant mobile apps in the near future[11].

4. Architecture of the Analytic Chinese Helper Service In this section, we will give an overview of how the cloud system works as illustrated in Fig. 8. The web (a) interface is implemented in PHP, and the in-memory dictionary is implemented in # with .Net framework for efficiency. There are four major components in the system: 1) the segmentation engine; 2) the annotation engine; 3) a dictionary with mixed traditional and simplified Chinese words; and 4) a big phonetic repository of sound files. Of course, there are PHP programs for the web operations. ()

(c) Fig. 9. Annotation of Chinese text with Pinyin and Zhuyin by the analytic chinese helper service : (a) annotation with explanation in English, (b) annotation with Pinyin symbols, and (c) annotation Fig. 8. System architecture of the proposed cloud service. with Zhuyin symbols . CHU et al.: Design of a Cloud Service for Learning Chinese Pronunciation 57 Table 2: Test Chinese articles from news websites designed as an assistant tool for the students wherever they

URL Word counts learn Chinese. In particular, the service is designed for a) http://news.cnyes.com/Content/20130904/kh9rbzxcz1lgi_2.shtml 1937 those students who have learned Chinese from the Chinese b) http://finance.sina.com.cn/china/20130905/152316675862.shtml 1267 learning centers in Taiwan. Students in Taiwan will meet c) http://www.chinareviewnews.com/doc/1027/1/8/9/102718930.html 2159 the demand of using both Pinyin and Zhuyin phonetic systems. We believe the ACH cloud service is promising to the foreign students in Taiwan for better Chinese study. ms Service Responce Time 60 Acknowledgment 50 The authors are grateful to the generous supports from the 40 Department of Computer Science and Information Engineering, Annotation Asia University. Furthermore, the authors would like to express 30 Pinyin their gratitude to Prof. C. . Ka (National Taiwan University) and Bopomofo 20 Prof. C. C. (National Tsing Hua University) for their fruitful comments and discussions on the cloud system. Finally, the 10 authors would like to express their gratitude to Prof. Jeffrey Tsai 0 (the President of Asia University) for his support on the project. A B C Fig. 10. Test results of service response time. References 4.3 Annotation Engine [1] H.-Y. Chen and .-Y. , “Web-based synchronized According to the types of annotation (Pinyin, Zhuyin or multimedia lecture system design for teaching/learning English explanations), the program will retrieve different Chinese as second language,” Computers & Education, vol. − attributes from the in-memory . After 50, no. 3, pp. 693 702, 2008. [2] . Juan, “Instructional design for situational Chinese mobile that, the attributes are packaged into HTML content with learning,” in Proc. of 2012 2nd Int. Conf. on Consumer W3C (World Wide Web Consortium) ruby markup[15]. The Electronics, Communications and Networks, , 2012, W3C ruby markup is designed to add ruby texts associated pp. 353−356. with the base text as shown in Fig. 9. At the same time, the [3] . , F. Lv, J. , H. Wang, W. Luo, . Kam, et al., annotation engine inserts an URL link to a sound file in the “Let's play chinese characters: mobile learning approaches website for each segmented word. As a result, users can via culturally inspired group games,” presented at the read the Chinese text with phonetic symbols with the help SIGCHI Conf. on Human Factors in Computing Systems, of sound files. Atlanta, 2010. [4] X. Liu, . . Liu, Z. , M. , and Y. Huang, “A 5. Performance Evaluation of Chinese e-learning network platform based on Web2.0,” in Proc. of 2009 Int. Conf. on Information Management, ACH Services Innovation Management and Industrial Engineering, Xi'an, We randomly selected three articles from the Internet to 2009, pp. 522−525. test the ACH services (Table 2). All the SOA services [5] J. Norman, Chinese. New York: Cambridge University Press, (including annotation, pinyin, and Bopomofo rubies) 1988. succeeded in providing correct results for all of the three [6] S. Jian, “A pinyin keyboard for inputting Chinese articles. Fig. 10 shows the results of the response time that characters,” Computer, vol. 18, no. 1, pp. 60−63, 1985. the SOA services responded to the user operations through [7] S. , “What will cloud computing provide for Chinese the ACH website. All responses were finished between 30~ m-learning?” in Proc. of 2011 Int. Conf. on e-Education, 60 milliseconds. As a result, the ACH service can provide Entertainment and e-Management, Bali, 2011, pp. 171−174. real-time services for multiple learners simultaneously. [8] . Starr, “Chinese language education in Europe: the Confucius Institutes,” European Journal of Education, vol. 44, no. 1, pp. 65−82, Mar. 2009. 6. Conclusions [9] M. M. Alabbadi, “Cloud computing for education and learning: education and learning as a service (ELaaS),” in More and more cloud services provide convenient Proc. of 2011 14th Int. Conf. on Interactive Collaborative learning resource of Chinese language for foreign students Learning, Piestany, 2011, pp. 589−594. [7] through the Internet . Most of the Chinese learning [10] D. . Chandra and D. B. Malaya, “Role of cloud computing services are from China and thus only offer Pinyin to the in education,” in Proc. of 2012 Int. Conf. on Computing, learning. The proposed analytic Chinese helper (ACH) is a Electronics and Electrical Technologies, Tamil Nadu, 2012, cloud service installed with both Pinyin and Zhuyin. It is pp. 832−836. 58 JOURNAL OF ELECTRONIC SCIENCE AND TECHNOLOGY, VOL. 12, NO. 1, MARCH 2014 [11] . Tam and . L. F. Cheung, “An extendible and ubiquitious Hsueh-Ting Chu was born in Taiwan in E-learning software for foreigners to learn Chinese on 1969. He received his M.S. and Ph.D. iOS-based devices,” in Proc. of 2012 IEEE 12th Int. Conf. degrees from National Tsing Hwa on Advanced Learning Technologies, Rome, 2012, pp. University in 1997 and in 2002, respectively, 46−48. both in computer science. He then joined [12] CC-CEDICT. A Downloadable Chinese-English Dictionary the Department of Computer Science and with Pinyin Pronunciation for the Chinese Characters. Information Engineering, Asia University, [Online]. Available: http://www.mdbg.net/chindict/ Taichung. His research interests include bioinformatics, cloud computing, social [13] MediaWiki. A free web-based wiki software application. networking, and e-learning. [Online]. Available: http://www.mediawiki.org/

[14] H. Lim and . Lee, “Survey and proposal on binary search Wei-Shan Tsai, photograph and biography not available at the algorithms for longest prefix match,” IEEE Communications time of publication. − Surveys and Tutorials, vol. 14, no. 3, pp. 681 697, 2012. [15] W3C. Ruby Annotation. [Online]. Available: Shao-Yu Lee was born in Taiwan in 1968. http://www.w3.org/TR/ruby/ She has been working in English teaching field since 1991. In 2001, she joined the Department of Foreign Languages and Literatures Asia University, Taichung. Currently, she is a member of the International Language Center, Da-Yeh University, . Her research interest includes children English pedagogy.