DNAD, a Simple Tool for Automatic Differentiation of Fortran Codes

View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by DigitalCommons@USU DNAD, a Simple Tool for Automatic Differentiation of Fortran Codes Using Dual Numbers Wenbin Yu a and Maxwell Blair b aUtah State University, Logan, Utah 80322-4130 bAir Force Research Laboratory, Wright-Patterson AFB, Ohio 45433-7542 Abstract DNAD (dual number automatic differentiation) is a simple, general-purpose tool to automatically differentiate Fortran codes written in modern Fortran (F90/95/2003) or legacy codes written in previous version of the Fortran language. It implements the forward mode of automatic differentiation using the arithmetic of dual numbers and the operator overloading feature of F90/95/2003. Very minimum changes of the source codes are needed to compute the first derivatives of Fortran programs. The advantages of DNAD in comparison to other existing similar computer codes are its programming simplicity, extensibility, and computational efficiency. Specifically, DNAD is more accurate and efficient than the popular complex-step approximation. Several examples are used to demonstrate its applications and advantages. PROGRAM SUMMARY Program Title: DNAD Journal Reference: Catalogue identifier: Licensing provisions: none Programming language: F90/95/2003 Computer: all computers with a modern FORTRAN compiler Operating system: all platforms with a modern FORTRAN compiler Keywords: Automatic differentiation, Fortran, Sensitivity analysis Classification: 4.12, 6.2 Nature of problem: Derivatives of outputs with respect to inputs of a Fortran code are often needed in physics, chemistry, and engineering. The author of the analysis code may no longer available and the user may not have deep knowledge of the Email address: [email protected] (Wenbin Yu). URL: http://www.mae.usu.edu/faculty/wenbin/ (Wenbin Yu). Preprint submitted to Computer Physics Communications 5 February 2013 code. Thus a simple tool is necessary to automatically differentiate the code through very minimum change of the source codes. This can be achieved using dual number arithmetic and operator overloading. Solution method: A new data type is defined with the first scalar component holding the function value and the second array component holding the first derivatives. All the basic operations and functions are overloaded with the new definitions accord- ing to dual number arithmetic. To differentiate an existing code, all real numbers should be replaced with this new data type and the input/output of the code should also be modified accordingly. Restrictions: None imposed by the program. Unusual features: None. Running time: For each additional independent variable, DNAD takes less time than than the running time of the original analysis code. However the actual running time depends on the compiler, the computer, and the operations involved in the code to be differentiated. 1 Introduction In this age, most problems in physics, chemistry, and engineering are solved using computer codes. It is a frequent occurrence that we need to compute the derivatives of the outputs of the computer codes with respect to the input parameters. A very typical example is the gradient-based optimization commonly used in engineering design. Gradient-based methods use not only the value of the objective and constraint functions but also their derivatives. The success of these methods hinges on accurate and efficient evaluation of the derivatives, commonly called sensitivity analysis in the literature of engineering design. Sensitivity analysis is usually the most costly step in the optimization cycle, and the optimization process often fails due to inaccurate derivative calculations [1]. There are several methods proposed for evaluating derivatives [2] of outputs of a computer code with respect to its inputs. Traditionally, finite differences are used to evaluate the derivatives if the source codes are not accessible. Basically, one perturbs each input and evaluates the difference of the outputs. The \step-size dilemma" forces a choice between a step size small enough to minimize the truncation error and a step size large enough to avoid significant subtractive cancelation errors [3]. If the source codes are accessible, then one can use so-called automatic differentiation (AD) [2] or complex-step approximation [4] to provide a more accurate and robust evaluation of the derivatives. AD, also known as computational differentiation or algorithmic differentiation, is a well-established method based 2 on a systematic application of the chain rule of differentiation to each opera- tion in the program flow [2], which can be carried out in a forward mode or a reverse mode. The forward mode can be easily implemented by a nonstandard interpretation of the program in which the real numbers are replaced by so- called dual numbers, the details of which are given in the next section. As far as programming concerned, the forward mode is usually implemented using two strategies: Source Code Transformation (SCT) and Operator Overloading (OO) [5]. A complete list of SCT or OO tools for automatic differentiation of Fortran codes can be found at www.autodiff.org. Using SCT, more source codes are generated automatically based on the original source for evaluating the derivatives. Although SCT can be implemented for all programming language and it is easier for an optimized compilation of the code, the original source code is greatly enlarged which makes it difficult to debug the extended code. Furthermore, it is very difficult to develop the AD tool for automatic generation of the additional source codes [6]. There are more than a dozen tools that use SCT for Fortran codes, with ADIFOR [7] being the most known one. OO is a more elegant approach if the source code is written in a language supporting it. In OO, one defines a new structure containing dual numbers [5] and overloads elementary mathematical operations to this new data structure. The main changes to the source codes will be re-declaring real numbers with the new data structure. It is noted that OO is not only applicable to source codes written in a supporting language such as F90/95/2003 and C++ but also applicable for the codes written in a language compatible with a supporting language, for example, codes written in F77. One just needs to compile the operator overloaded source codes using an F90/95/2003 compiler. There are several automatic differentiation tools for Fortran codes using OO, includ- ing HSL AD02 [8], AUTO DERIV [9,10], and ADF95 [11]. There are certain limitations with these tools. For example, HSL AD02 does not support the array features of F90/95/2003. AUTO DERIV [9] was very slow, and even the most recent release in [10] is still about ten times slower than analytic derivative calculation. Analytic derivative calculation means that a Fortran code containing the analytic expressions for the derivatives is used for compu- tation. ADF95, although it is much faster than AUTO DERIV for the cases tested in [11], is about six times slower than analytic derivative calculation. Another limitation of these OO AD tools is that the independent variables and dependent variables are specified inside the code, which implies that the end user needs to modify the code and recompile the differentiated version of the analysis code when the derivatives of different outputs are needed or the independent variables are different. This requires the end user to know the meaning of variables inside the code and to know how to compile the differentiated codes. Furthermore, the fact that oftentimes the developer of the differentiated version of the analysis code is not the end user who is going to use the code discourages the use of such AD tools. For this very reason, it is better to develop AD tools that can minimize and simplify the changes needed by the end user to the source codes of the analysis codes. 3 The complex-step approximation method, publicized by Martins [1], was orig- inally proposed as a better alternative to replace finite difference as it can avoid the step-size dependency and the subtractive cancelation errors inher- ent in finite difference techniques. An extremely small number can be assigned to get a second-order approximation of the derivatives. Later, the complex- step approximation was found to be similar to the automatic differentiation [6], and it can be considered as an approximate implementation of automatic differentiation using the existing complex data structure in some computer languages. Because of its clear advantages over the finite difference method, its easy implementation, and the fact that the end use of the differentiated version of the code is independent of the source codes, the complex-step approximation method is popular in the aerospace design community. However, some issues of complex-step approximation can only be resolved using automatic differentiation [6]. SCOOT [12] is an experimental variant of OO that is specialized to support differentiation of vector calculus and derivatives of geometric variables in C++. With SCOOT, any number of derivatives are computed in parallel with machine accuracy. The method is validated to support the derivatives of geometric variables in a linear aerodynamic application. We have developed a simple general-purpose module, namely dual number automatic differentiation (DNAD), using the arithmetic of dual numbers and the OO feature of F90/95/2003 for automatic differentiation of Fortran codes. In comparison to existing Fortran AD tools using OO, DNAD only requires the end user to specify the number of independent variables. It is simple and can be easily extended as needed. For the examples we have tested, it is much more efficient than other tools. In comparison to the complex-step approximation method popular in the aerospace design community, DNAD is more efficient and accurate, and it avoids the pitfalls associated with the complex-step approximation method as pointed out in [6].

Load more