Open NLP Based Refinement of Software Requirements

Open NLP Based Refinement of Software Requirements

International Journal of Computer Information Systems and Industrial Management Applications. ISSN 2150-7988 Volume 8 (2016) pp. 293-300 © MIR Labs, www.mirlabs.net/ijcisim/index.html Open NLP based Refinement of Software Requirements Murali Mohanan 1, Philip Samuel 2 1Department of Computer Science, SOE,Cochin University of Science and Technology, Kochi,India. [email protected] 2 Division of Information Technology, SOE,Cochin University of Science and Technology, Kochi,India. [email protected] Abstract: . Software requirements are usually written in natural object oriented model from the user requirement document. language (NL) or speech language which is asymmetric and irregular. This approach used natural language processing to analyze This paper presents a suitable method for transforming user software the written requirements and domain based ontology is used requirement specifications (SRS) and business designs written in to improve the class identification performance. The author natural language into useful object oriented models. Here a neoteric used a linguistic pattern to differentiate the class and attributes, approach is proposed to generate object oriented items from SRS. numeric pattern to analyze the relationship and parallel For NL processes like sentence detection, tokenization, parts of speech tagging and parsing of requirement specifications we structure pattern to found more classes and its attributes. But it incorporate an open natural language processing (OpenNLP) tool. It was not good enough in automatically identifying object provides very relevant parts of speech (POS) tags. This parts of oriented elements. speech tagging of the SRS is quite useful for further identification of In this paper we have addressed the problem related to the object oriented elements like classes, objects, attributes, software analysis and development phase. Here we use open relationships etc. After obtaining the required and relative natural language processing (OpenNLP) to process software information, Semantic Business Vocabulary and Rules (SBVR) are requirement specification(SRS).The OpenNLP [10],[11] is applied to identify and to extract the object oriented elements from used to produce parts of speech (POS) tag from the SRS the NL processed requirement specifications. which contains natural language statements. The POS tag captures the required details such as noun, verb, adverb, etc. of the natural language statements. Sentence splitting, Keywords:Requirement Elicitation, Software requirement tokenization and Pos tagging [12] are the phases of OpenNLP. specification, OpenNLP, SBVR, class model generation. These phases help to process the requirement specifications in NL which are easy to understand by both the user and the machine [13]. I. Introduction The recent trends of the software engineering largely depends on object oriented paradigm that widely use unified The major challenge in software design is the ability to models. Unified Modelling Language(UML) is commonly comprehend tedious, long-drawn-out user requirements as used for modelling the user software requirements, documents outlined by the clients. Software analysis if done precisely the software assets, development and redevelopment of saves a lot of time of the system analyst and the software software [1]. Our research work proposes a methodology design phase can be started right away. In the field of which is used to extract object oriented elements from NL information technology, there have been innumerable changes processed SRS. Object oriented analysis applies the object in the way this problem has been tackled. Though there are oriented paradigm to model software information systems by many traditional approaches which aim at recognising the defining classes, objects and relationships between them. functionalities of the information system, the modern The UML model is an important component for Object object-oriented approach based on Natural Language Oriented analysis and design.The existing tools such as ReBuilder [2], CM-Builder [3] , GOOAL [4], NL-OOML [5], Processing has garnered maximum popularity because of its UML-Generator [6] generates the UML class diagram strong role in object oriented modeling. automatically from the natural languages.The problem with The natural language processing is a research area in which these tools are they generate the object oriented models with many researchers proposed several methods for analyzing the lower accuracy due to the informal nature of NL and its natural language (NL) requirements [14],[15]. Nan Zhou and ambiguity [7],[8]. Xiaohua Zhou (2004) proposed a methodology to generate Dynamic Publishers, Inc., USA 294 Mohanan et al. This paper is focused on reducing the complexity in designing tools on it with just two lines of code. It is designed to be the object oriented models from the user requirements. User highly flexible and extensible. With a single option you can specifies their requirements in natural languages such as change which tools should be enabled and which should be English. Using OpenNLP, user requirement statements are disabled. Stanford CoreNLP integrates many of Stanford’s processed first. It tokenizes the input and then syntatically and NLP tools [27], including the part-of-speech (POS) semantically processes the input. Since the natural language tagger, the named entity recognizer (NER), the parser, the processing is such a difficult task, our research work is coreference resolution system, sentiment analysis, and the divided into two phases to provide to efficient work. The bootstrapped pattern learning tools. Its analyses provide the initial phase is OpenNLP tool process and second phase is foundational building blocks for higher-level and Semantic Business Vocabulary and Rules (SBVR) process domain-specific text understanding applications. Stanford [9]. CoreNLP Provides: II Related works 1)An integrated toolkit with a good range of grammatical analysis tools 2)Fast, reliable analysis of arbitrary texts The natural language processing is a research area, 3)The overall highest quality text analytics researchers proposed several methods for analyzing the NL 4)Support for a number of major (human) languages) requirements [14],[15]. Some researchers focused on class 5)Interfaces available for various major modern diagram extraction from the NL requirements. This section programming languages describes the survey of some methods that uses the NLP or domain ontology for NL requirement analyst and generates Stanford CoreNLP is very effective to quickly and painlessly the class diagram. get linguistic annotations for a text.It supports to hide Nan Zhou and Xiaohua Zhou (2004) proposed an automated variations across components behind a common API and to system to generate the class diagram from the user have a minimal conceptual footprint, so the system is easy to requirement document. This approach used a natural language learn.It also provides a lightweight framework, using plain processing to analyze the written requirements and domain Java objects (rather than something of heavier weight, such as based ontology is used to improve the class identification XML or UIMA’s [28] Common Analysis System (CAS) performance. The author used a linguistic pattern to objects). differentiate the class and attributes, numeric pattern to analyze the relationship and parallel structure pattern to found more classes and its attributes. The final output is filtered by III Proposed Methodology the domain ontology. Ambriola and Gervasi (2006) designed a framework to Aiming to give a suitable support for software developers as develop the models such as Entity Relationship diagram, Data well as software engineers we have proposed a neoteric Flow Diagram, even UML diagrams. The system take the user approach for natural language processing and object oriented requirement written in natural language as input and applied modeling. This work is focused on natural language the CICO parser for parsing and information extraction. Priyanka More and Rashmi Phalnikar (2012) proposed an processing and then to extract useful object oriented elements. approach to design the UML diagrams from the informal Software requirements are usally written in the natural natural language requirements. Requirement analysis to language or speech language which is asymmetric and provide Instant Diagrams (RAPID) is a novel tool used in this irregular. In our work the open natural language processing approach. An openNLP tool is applied for parsing and the (NLP) analyzes the user requirements and provides the parts RAPID performs the RACE stemming process to improve the of speech (Pos) tagging. The SBVR process implemented efficiency by reducing the redundancy. A Domain Ontology here is used to extract object oriented elements like classes, in RAPID tool improves the performance of concept objects, attributes, relationships etc from the NL processed identification. Object oriented items like classes are identified SRS. The SBVR process includes SBVR vocabulary by the class extraction engine. extraction and rule generation. This can be further refined to Deeptimahanti and Babar (2009) developed a UML Model form Unified Models which depict the major functionalities of Generator from Analysis of Requirements (UMGAR) a a software system. domain independent tool for designing the UML models with proper relationship. A simple requirement is generated from SBVR SBVR Vocabulary the complex user requirement by the syntactic reconstruction

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    8 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us