<<

Linguistic Description of Complex Phenomena with the rLDCP R Package

Jose M. Alonso Patricia Conde-Clemente Gracian Trivino Centro de Investigacion´ en Universidad de Oviedo, Phedes Lab, Tecnolox´ıas da Informacion´ (CiTIUS) Asturias, Spain Asturias, Spain University of Santiago de Compostela, Santiago de Compostela, Spain [email protected]

Abstract Phenomenon

Monitoring and analysis of complex phenom- Input Data Elements

ena attract the attention of both academy and ● Communicative Goal Setting Application Parameters ● User Model Data industry. Dealing with data produced by ● Knowledge Source Acquisition Preprocessing Input Data ● Discourse History complex phenomena requires the use of ad- Perception Modeling vance computational intelligence techniques. GLMP Interpretation Data Relevance Estimation Namely, linguistic description of complex Structure based on Corpus phenomena constitutes a mature research line. Report Template Report Content Determination It is supported by the Computational Theory Generation Linguistic Realization of Perceptions grounded on the Fuzzy Sets Output Linguistic Theory. Its aim is the development of com- Customized Linguistic Messages Report putational systems with the ability to generate vague descriptions of the world in a similar way how humans do. This is a human-centric and multi-disciplinary research work. More- Figure 1: The LDCP Architecture for NLG/D2T. over, its success is a matter of careful design; thus, developers play a key role. The rLDCP describing big data (Conde-Clemente et al., 2017b); R package was designed to facilitate the de- velopment of new applications. This demo in- advising how to save energy at home (Conde- troduces the use of rLDCP, for both beginners Clemente et al., 2016); describing physical activ- and advance developers, in practical use cases. ity (Sanchez-Valdes et al., 2016); describing drivers’ behavior in driving simulations (Eciolaza et al., 2013); or describing double stars in astronomy (Ar- 1 Introduction guelles and Trivino, 2013). Trivino and Sugeno (2013) defined a framework Figure 1 depicts the LDCP architecture for Nat- for Linguistic Description of Complex Phenomena ural Generation in Data-to-text applica- (LDCP). It is based on the Computational Theory of tions (NLG/D2T). It is inspired on the well-known Perceptions (CTP) introduced by Zadeh (2001) as a NLG pipeline proposed by Reiter and Dale (2000). new tool for paving the way from computing with The development of new applications with LDCP numbers to computing with (Zadeh, 1999). comprises the following steps: CTP is rooted in the computational intelligence tech- nique best suited to deal with approximate reason- Careful analysis of the phenomenon under con- • ing and vague concepts, i.e., the Fuzzy Sets The- sideration, regarding: communicative goal, au- ory (Zadeh, 1965; Trillas and Eciolaza, 2015). dience background, and the set of natural lan- LDCP has already been successfully applied in guage expressions (corpus) most commonly several multi-disciplinary projects. For example: used in the context of the application domain.

243

Proceedings of The 10th International Natural Language Generation conference, pages 243–244, Santiago de Compostela, Spain, September 4-7 2017. c 2017 Association for Computational Design of a computational structure (the so- Science and Innovation [grant number FPI-MICINN • called Granular Linguistic Model of the Phe- BES-2012-057427]. nomenon, GLMP) which organizes all related perceptions in a similar way how humans usu- ally organize their experience by means of nat- References ural language. Luis Arguelles and Gracian Trivino. 2013. I-struve: Automatic linguistic descriptions of visual double Design of a Report Template easy to customize stars. Engineering Applications of Artificial Intelli- • in accordance with the audience requirements. gence, 26(9):2083–2092. Patricia Conde-Clemente, Jose M. Alonso, and Gracian Implementation of a computational system able Trivino. 2013. Interpretable fuzzy system allow- • to collect and process raw data, interpret them ing to be framed in a profile photo through linguis- according to the previously defined GLMP, and tic expressions. In Proceedings of 8th Conference of producing the Report with the most relevant in- the European for Fuzzy Logic and Technology (EUSFLAT) formation to convey to end-users. , pages 463–468, Milano, Italy. Patricia Conde-Clemente, Jose M. Alonso, and Gracian Conde-Clemente et al. (2017a) have developed an Trivino. 2016. Towards automatic generation of lin- guistic advice for saving energy at home. Soft Com- R package called rLDCP1 which constitutes a first puting, pages 1–15. implementation in R of the steps enumerated above. Patricia Conde-Clemente, Jose M. Alonso, and Gracian Thus, it facilitates the use of the LDCP architecture Trivino. 2017a. rldcp: R package for text generation in new applications. from data. In IEEE International Conference on Fuzzy Systems (FUZZ-IEEE), pages 1–6, Naples, Italy. 2 Structure of the Demo Patricia Conde-Clemente, Gracian Trivino, and Jose M. Alonso. 2017b. Generating automatic linguistic This demo describes how to use rLDCP from descriptions with big data. Information Sciences, scratch. Firstly, we explain how to download and 380:12–30. install rLDCP. Secondly, we detail how to run step Luka Eciolaza, Martin Pereira-Farina,˜ and Gracian Triv- by step the toy example ComfortableRoom from the ino. 2013. Automatic linguistic reporting in driv- point of view of beginners and advance developers. ing simulation environments. Applied Soft Computing, The goal is describing the comfort in a room with re- 13(9):3956–3967. spect to temperature and light intensity data values Ehud Reiter and Robert Dale. 2000. Building natural previously stored in a “.csv” file. language generation systems, volume 33. MIT Press. Then, we show how to use rLDCP for building a Daniel Sanchez-Valdes, Alberto Alvarez-Alvarez, and Gracian Trivino. 2016. Dynamic linguistic descrip- real application: The inProfilePhoto mobile app. We tions of time series applied to self-track the physical implement with rLDCP the application described activity. Fuzzy Sets and Systems, 285:162–181. in (Conde-Clemente et al., 2013) where an NLG sys- Enric Trillas and Luka Eciolaza. 2015. Fuzzy Logic: tem guided a person with visual disabilities to take An Introductory Course for Engineering Students. his/her own profile photos. Springer. Gracian Trivino and Michio Sugeno. 2013. Towards Acknowledgments linguistic descriptions of phenomena. International Journal of Approximate Reasoning, 54(1):22–34. This work is supported by the Spanish Ministry Lotfi A. Zadeh. 1965. Fuzzy sets. Information and Con- of Economy and Competitiveness [grant numbers trol, 8(3):338–353. TIN2014-56633-C3-3-R, TIN2014-56633-C3-1-R]; Lotfi A. Zadeh. 1999. From computing with numbers to the “Conseller´ıa de Cultura, Educacion´ e Or- computing with words. from manipulation of measure- denacion´ Universitaria” (accreditation 2016-2019, ments to manipulation of perceptions. IEEE Transac- ED431G/08) and the European Regional Develop- tions on Circuits and Systems, 46(1):105–119. ment Fund (ERDF); and the Spanish Ministry of Lotfi A. Zadeh. 2001. A new direction in AI: Toward a computational theory of perceptions. Artificial Intelli- 1rLDCP is an R package for text generation from data. It is gent Magazine, 22(1):73–84. freely available at [http://www.phedes.com/rLDCP]

244