<<

TENSE GENERATION IN AN INTELLIGENT TUTOR FOR FOREIGN TEACHING: SOME ISSUES IN THE DESIGN OF THE EXPERT

Danilo FUM (*), Paolo Giangrandi(°), Carlo Tasso (o) (*) Dipartimento dell~ducazione, Universita' di Trieste, Italy (o) Laboratorio di Intelligenza Artificiale, Universita' di Udine, Italy via Zanon, 6 - 33100 UDINE, Italy e.mail: tasso%[email protected]

ABSTRACT Intelligent Tutoring Systems, in the specific domain of foreign language teaching The paper presents some of the results (Barchan, Woodmansee, and Yazdani, 1985; obtained within a research project aimed at Cunningham, Iberall, and Woolf, 1986; developing ET (English Tutor), an intelligent Schuster and Finin, 1986; Weischedel, Voge, tutoring system which supports Italian and James, 1978; Zoch, Sabah, and Alviset, students in learning the English . We 1986). concentrate on one of the most important modules of the system, the domain (i.e. verb) An Intelligent Tutoring System (ITS, for expert which is devoted to generate, in a cog- short) is a program capable of providing nitively transparent way, the right tense for students with tutorial guidance in a given the verb(s) appearing in the exercises (Lawler and Yazdani, 1987; Sleeman presented to the student. An example which and Brown, 1982; Wenger, 1987). A full- highlights the main capabilities of the verb fledged ITS: (a) has specific domain expert is provided. A prototype version of ET expertise; (b) is capable of modeling the has been fully implemented. student knowledge in order to discover the reason(s) of his mistakes, and (c) is able to make teaching more effective by applying different tutorial strategies. ITS technology 1. INTRODUCTION seems particularly promising in fields, like language teaching, where a solid core of facts In the course of its evolution, English has lost is actually surrounded by a more nebulous most of the complexities which still area in which subtle discriminations, personal characterize other Indo-European . points of view, and pragmatic factors are Modern English, for example, has no involved (Close, 1981). , it makes minimum use of the subjunctive and adopts 'natural' gender In this paper we present some of the results instead of the grammatical one. The obtained within a research project aimed at language, on the other hand, has become developing ET (English Tutor), an ITS which more precise in other ways: cases have thus helps Italian students to learn the English verb been replaced by prepositions and fixed word system. An overall description of ET, of its order while subtle meaning distinctions can be structure and of operation has been conveyed through a highly sophisticated use given elsewhere (Fum, Giangrandi, and of tense expressions. Learning correct verb Tasso, 1988). We concentrate here on one of usage is however extremely difficult for non the most important modules of the system, the native speakers and causes troubles to people domain (i.e. verb) expert which is devoted to who study English as a foreign language. In generate, in a cognitively transparent way, the order to overcome the difficulties which can right tense for the verb(s) appearing in the be found in this and several other grammatical exercises presented to the student. The paper areas, various attempts have been made to analyzes some issues that have been dealt utilize Artificial Intelligence techniques for with in developing the verb expert focusing developing very sophisticated systems, called

124 - on the knowledge and processing mecha- cians and people interested in computational nisms utilized. The paper is organized as accounts of language usage (see, for example: follows. Section two introduces our approach Ehrich, 1987; Fuenmayor, 1987; to the problem of tense generation in the Matthiessen, 1984). There is however no context of a tutor for second language on, and no complete theoretical teaching. Section three briefly illustrates the account of, the factors which contribute to ET general architecture and mode of tense generation. The different proposals operation. Section four constitutes the core of which exist in the literature greatly vary the paper and presents the design re- according to the different features that are quirements, knowledge bases and reasoning actually identified as being critical and their algorithms of the verb expert together with an level of explicitness, i.e. which features are example which highlights its main given directly to the tense selection process capabilities. The final section deals with the and which must be inferred through some relevance of the present proposal both in the form of reasoning framework of linguistic studies on verb generation and of intelligent tutoring systems Our interest in this topic focuses on for language teaching. developing a system for tense selection capable of covering most of the cases which can be found in practice and usable for 2. THE TENSE GENERATION teaching English as a foreign language. A PROBLEM basic requirement which we have followed in designing ET is its cognitive adequacy: not An important part of the meaning of a only the final result (i.e. the tense which is sentence is constituted by temporal generated), but also the knowledge and information. Every complete sentence must reasoning used in producing it should mirror contain a main verb and this verb, in all Indo- those utilized by a human expert in the field European languages, is temporally marked. (i.e. by a competent native speaker). The ITS The tense of the verb indicates the relation must thus be an 'articulated' or 'glass-box' between the interval or instant of time in expert. which the situation (i.e. state, event, activity etc.) described in the sentence takes place and the moment in which the sentence is uttered, 3. THE ET SYSTEM and may also indicate subtle temporal relations between the main situation and other ET is an intelligent tutoring system devoted to situations described or referenced in the same support Italian students in learning the usage sentence. Other information can be derived of . The system, organized from the mood and aspect of the verb, from around the classical architecture of an ITS the lexical category which the verb is a (Sleeman and Brown 1982), consists member of and, more generally, from several essentially of: kinds of temporal expressions that may - the Tutor, which is devoted to manage the appear in the sentence. Moreover, the choice teaching activity and the interaction with the of the tense is determined by other student, information, not directly related with temporal - the Student Modeler which is able to meaning, such as speaker's intention and evaluate the student's competence in the perspective, characteristics of specific domain, and discourse, etc. Very complex relations exist - the Domain (i.e. verb) Expert which is an among all these features which native articulated expert in the specific domain dealt speakers take into account in understanding a with by the system. sentence or in generating an appropriate tense for a given clause or sentence. In what follows, in order to better understand the discussion of the Domain Expert, a The problem of choosing the right verb tense sketchy account of the system mode of in order to convey the exact meaning a operation is given. sentence is intended to express has aroused the interest of linguists, philosophers, logi-

- 125 - At the beginning of each session, the Tutor While the sentences that are presented to the starts the interaction with the student by student are in natural language form, the verb presenting him an exercise on a given topic. expert receives in input a schematic The same exercise is given to the Domain description of the sentence. Expert which will provide both the correct Every sentence of the exercise is constituted solution and a trace of the reasoning by one or more clauses playing a particular employed for producing it. At this point, the role in it (major clauses and minor clauses at Student Modeler compares the answer of the various levels of subordination). Each clause student with that of the expert in order to is represented inside the system through a identify the errors, if any, present in the series of attribute-value pairs (called exercise former and to formulate some hypotheses descriptors) that highlight the information about their causes. On the basis of these hy- relevant for the tense selection process. This potheses, the Tutor selects the next exercise information includes, for example, the kind of which will test the student on the critical clause (main, coordinate, subordinate), aspects pointed out so far and will allow the whether the clause has a verb to be solved, Modeler to gather further information which the and form of the clause, the kind of could be useful for refining the hypotheses event described by the clause, the time previously drawn. Eventually, when some interval associated with the event described in misconceptions have been identified, the the clause, etc. Some of the exercise refined and validated hypotheses will be used descriptors must be manually coded and in order to explain the errors to the student inserted in the exercise data base whereas the and to suggest possible remediations. When a others (mainly concerning purely linguistic topic has been thoroughly analyzed, the Tutor features) can be automatically inferred by a will possibly switch to other topics. preprocessor devoted to parsing the exercise text. For instance, the schematic description of: 4. THE DOMAIN EXPERT ET > EXERCISE-1: The Domain Expert is devoted to generate the 7 (live) in this house for ten years. Now the fight answers for the exercises proposed to roof needs repairing.' the student. Usually, exercises are constituted by a few English sentences in which some of is the following (with the items automatically the verbs (open items) are given in inferred by the parser preceded by the symbol form and have to be conjugated into an @): appropriate tense. Sometimes, in order to avoid , additional information EXERCISE: ex 1 describing the correct interpretation (as far as text: 'I (live) in this house for ten years. Now the temporal point of view is concerned) of the roof needs repairing.' the sentence is given. Consequently, the @sentence_structure: el, c2 Domain Expert must be able: @clauses to resolve: cl i) to select the grammatical tense to employ for each open item of the exercise in order to CLAUSE: cl correctly describe the status of the world the sentence is intended to represent, and text: 'I (live) in this house for ten years' ii) to appropriately conjugate the verb @clause_kind: main according to the chosen tense. @clause_verb: live @ superordinate: nil Besides these basic functionalities, the @subordinate: nil tutoring environment in which the Domain @previous_coordinate: nil Expert operates imposes a further @clause_form: aff'mnative requirement, i.e. the expert must be able: @subject: I iii) to explain to the student how the solution @ subjecLcase: [singular first] has been found, which kind of knowledge @voice: active has been utilized, and why. @evenLtime: tl @time_expression: ['for ten years' t2]

- 126 - @category: state precedence, etc.) that occur between the aspect: persistent situation dealt with in the current clause and context: informal the situations described in other clauses• intentionality: nil 8. Adverbial Information, which is related to the meaning of possible temporal adverbials CLAUSE: c2 specified in the same clause. The Domain Expert operation is supported by TIME_RELATIONS: exl a knowledge base constituted by a partitioned set of production rules which express in a meet(t2, now) transparent and cognitively consistent way equal(tl, t2). what is necessary to do in order to generate a verb tense• Its activity is mostly concerned with the derivation of the tense features When solving an open item, the Domain strictly related to temporal reasoning. The Expert must infer from the exercise exercise descriptors include for this purpose descriptors all the remaining information only basic information related to the specific needed to make the final choice of the temporal adverbials or conjunctions which appropriate tense• This information is appear in the exercise. This information is constituted by several tense features, each one utilized to build a temporal model of the describing some facet of the situation that is situation described in the exercise. Initially, necessary to take into account• The choice of the temporal model is only partially known which tense features are to be considered in and is then augmented through the application the tense selection process represents a of a set of temporal relation rules• This rules fundamental step in the design of the verb constitute a set of axioms of a temporal logic - generation module. This problem has no similar to that utilized by Allen (1984)- which agreed upon solution, and it constitutes one of has been specifically developed for: (a) the most critical parts of any theory of tense representing the basic temporal knowledge generation (Ehrich, 1987; Fuenmayor, 1987; about the situations described in the exercise; Matthiessen, 1984). The main features (b) reasoning about these knowledge in order considered by the Domain Expert are listed to compute some of the tense features not below• Some of the features are already explicitly present in the schematic description included in the exercise descriptors (1 to 4), of the exercise. The first task of the expert whereas the others must be inferred by the module is therefore that of deriving possible system when solving the exercise (5 to 8): new relations which hold among situations 1. Category, which identifies the kind of described in the exercise. situation described by the clause (e.g., event, state, , activity, etc.). In the schematic description of exercise 1 we 2. Aspect, which concerns the different can see two time relations explicitly asserted: viewpoints that can be utilized for describing meet(d, now) and a situation. equal(tl, t2). 3. Intentionality, which states whether the The meaning of the fast clause is that the time situation describes a course of action that has interval t2 (corresponding to the temporal been premeditated or not. expression 'for ten years') precedes and is 4. Context, which concerns the type of contiguous to the time interval indicated by discourse in which the clause or sentence now (i.e. the speaking time)• The meaning of appears. the second clause is that the time interval tl 5. Duration, which refers to the time span (representing the state or event expressed by (long, short, instantaneous, etc.) occupied by the main verb) is equal to the time interval t2. a situation. 6. Perspective, which refers to the position From the explick time relation it is possible to along the temporal axis of the situation or to derive, by employing the following time its relation with the present time. relation rule: 7. Temporal Relations, which refer to the meet(tx, ty) & equal(tx, tz) => meet(tz, ty). temporal relations (simultaneity, contiguity, the inferred relation:

127 - meet(t1, now). 4 - equal(event_time, reference_time), The Domain Expert tries then to infer, for 5 - aspect -- persistent every exercise clause, the so-called THEN time, i.e., the moment of time which the apply the present tense. situation described in the sentence refers to (Matthiessen, 1984; Fuenmayor, 1987). In IF order to determine the reference time of every 1 - category = single_action OR clause, the expert utilizes a set of reference category = state, time identification rules whose condition part 2 - meet(evenLtime, now), takes into account the structural description of 3 - meet(reference_time, now), the sentence. 4 - equal(event_time, reference_time), 5 - duration <> short, An example of reference time identification 6 - aspect = persistent, rule is the following: 7 - context <> formal, 8 - verb accepts ing_form IF THEN 1 - clause_kind = main, apply the continuous tense. 2 - previous_coordinate = nil OR new_speaker = nil OR which provide two different (both correct) clause_form = interrogative, solutions for the open item. 3 - time_expression <> nil I'HEN Once the tense to be used has been identified, set the reference_time to the most specific the verb is conjugated utilizing an appropriate time expression set of conjugation rules. In our example the present perfect is obtained through the By applying this rule to the structural application, among others, of the following description of Exercise 1 it is possible to infer rules: that the reference time of the clause cl is the interval t2 that, being the only time expression IF present in the clause, is also the most specific tense = present perfect one. THEN the verb sequence is formed with: When all the reference times have been - of 'to have' determined, the Domain Expert looks only for - past of the verb. the clauses with open items in order to compute (through the temporal axioms) three IF particular temporal relations (Ehrich, 1987): 1 - tense = past participle, deictic (between reference time and speaking 2 - verb is regular time: RT-ST), intrinsic (between event time THEN and reference time: ET-RT) and ordering the verb sequence is formed with: (between event time and speaking time: ET- - 'ed-form' of the verb. ST). When these relations have been computed, all the needed tense features are known, and the final tense selection can be 5. CONCLUSIONS performed. Again, a set of selection rules takes care of this activity. In the paper we have presented some issues involved in the design of a verb generation In our example, the following selection rules module within a research project aimed at can be applied: developing an ITS capable of teaching the English verb system. A first prototype of ET IF has been fully implemented in MRS (LISP I - category = state OR augmented with logic and rule-programming category = iterated_action, capabilities and with specific mechanism for 2 - meet(event_time, now), representing meta-knowledge) on a SUN 3 3 - meet(reference_time, now), workstation.

- 128 - Fuenmayor, M. E. (1987) Tense Usage Our primary goal in this phase of the project and Recognition for Machine has been the cognitive adequacy of the verb . IBM Los Angeles Scientific expert. In order to develop it, we took a Centre Report 1987 - 2796, Los Angeles, pragmatic approach, starting with the CA. identification of the features traditionally considered by , constructing rules Fum, D., Giangrandi, P., and Tasso, C. of tense selection grounded on this features (1988) The ET Project: Artificial intelligence and, finally, refining features and rules in second language teaching.In: F. Lovis and according to the results obtained through their E.D. Tagg (Eds), Computers in Education. use. Amsterdam, The Netherlands: North- Holland, 511-516.a The work presented here relates both to the research carried out in the fields of linguistics Lawler, R.W. and Yazdani, M. (Eds.) (1987) and philosophy, concerning theories of verb Artificial Intelligence and Education. generation and the temporal meaning of Norwood, NJ: Ablex. verbs, respectively, and the field of intelligent tutoring systems. As far as the first topic is Matthiessen, C. (1984) Choosing Tense in concerned, we claim that teaching a foreign English. USC Research Report 84-143. language can constitute a good benchmark for University of Southern California. evaluating the soundness and completeness of such theories. In the field of foreign language Schuster, E. and Finin, T. (1986) VP2: The teaching, on the other hand, the only way to role of user modeling in correcting errors in build articulated, glass-box experts is to second language learning. In: A. G. Cohn provide them with language capabilities such and J.R. Thomas (Eds.) Artificial Intelligence as those devised and described by linguistic andlts Applications. New York, NY: Wiley. theories. Sleeman, D. H. and Brown, J. S. (eds.) (1982) Intelligent Tutoring Systems. London: Academic Press. Allen, J.F. (1984) Towards a General Theory Weischedel, R.M., Voge, W.M., and James, of Action and Time. Artificial Intelligence, M. (1978) An ,amificial Intelligence Approach 23, 123-154. to Language Instruction. Artificial Intelligence, 10, 225-240. Barchan, J., Woodmansee, B.J., and Yazdani, M. (1985) A Prolog-Based Tool for Wenger, E. (1987) Artificial Intelligence and French Analyzers. Instructional Tutoring Systems. Los Altos, CA: Morgan Science, 14. Kaufmann. Close, R.A. (1981) English as a Foreign Zoch, M., Sabah, G., and Alviset, C. (1986) Language. London: Allen & Unwin. From Structure to Process: Computer assisted teaching of various strategies of generating Cunningham, P., Iberall, T., and Woolf, B. pronoun construction in French. Proceed. of (1986) Caleb: An intelligent second language COLING-86, Bonn, FRG, 566-569. tutor. Proceed. IEEE Intern. Confer. on Systems, Man, and Cybernetics. Los Alamitos, CA: Computer Soc. IEEE, 1210- 1215. Ehrich, V. (1987) The Generation of Tense. In: G. Kempen (Ed.), Natural Language Generation. Dordrecht, The Netherlands: M. Nijhoff, 423-44.

129 -