A New Class of Oscillatory Radial Basis Functions

Total Page:16

File Type:pdf, Size:1020Kb

A New Class of Oscillatory Radial Basis Functions A NEW CLASS OF OSCILLATORY RADIAL BASIS FUNCTIONS BENGT FORNBERG ∗, ELISABETH LARSSON † , AND GRADY WRIGHT ‡ Abstract. Radial basis functions (RBFs) form a primary tool for multivariate interpolation, and they are also receiving increased attention for solving PDEs on irregular domains. Traditionally, only non-oscillatory radial functions have been considered. We find here that a certain class of oscillatory radial functions (including Gaussians as a special case) leads to non-singular interpolants with in- triguing features especially as they are scaled to become increasingly flat. This flat limit is important in that it generalizes traditional spectral methods to completely general node layouts. Interpolants based on the new radial functions appear immune to many or possibly all cases of divergence that in this limit can arise with other standard types of radial functions (such as multiquadrics and inverse multiquadratics). Key words. Radial basis functions, RBF, multivariate interpolation, Bessel functions. 1. Introduction. A radial basis function (RBF) interpolant of multivariate data (xk,yk), k =1, 2,...,n takes the form n s(x)= λ φ( x x ) . (1.1) k k − kk Xk=1 Here denotes the standard Euclidean vector norm, φ(r) is some radial function, and k·k an underline denotes that quantity to be a vector. The coefficients λk are determined in such a way that s(xk)= yk, k =1, 2,...,n, i.e. as the solution to the linear system λ1 y1 . . A . = . (1.2) λ y n n where the entries of the matrix A are A = φ x x , i = 1,...,n, ,j = i,j i − j 1,...,n. Numerous choices for φ(r) have been used in the past. Table 1 shows a few cases for which existence and uniqueness of the interpolants s(x) have been discussed in the literature; see for ex. [2], [3], [14], and [15]. For many of the radial functions in Table 1, existence and uniqueness are ensured for arbitrary point distributions. However, there are some that require the form of (1.1) to be augmented by some low-order polynomial terms. In the infinitely smooth cases, we have included a shape parameter ε in such a way that ε 0 corresponds to the basis functions becoming flat (as discussed extensively in for→ example [4], [7], [8], [11], [12]). The primary interest in this limit lies in the fact that it reproduces all the classical pseudospectral (PS) methods [6], such as Fourier, Chebyshev, and Legendre, whenever the data point locations are ∗University of Colorado, Department of Applied Mathematics, 526 UCB, Boulder, CO 80309, USA ([email protected]). The work was supported by NSF grants DMS-9810751 (VIGRE) and DMS-0309803. †Uppsala University, Department of Information Technology, Scientific Computing, Box 337, SE- 751 05 Uppsala, Sweden ([email protected]). The work was supported by a grant from The Swedish Research Council. ‡University of Utah, Department of Mathematics, 155 South 1400 East, JWB 233, Salt Lake City, UT 84112-0090, USA ([email protected]). The work was supported by NSF VIGRE grant DMS-0091675. 1 Type of basis function φ(r) Piecewise smooth RBFs Generalized Duchon spline (GDS) r2k log r, k N ∈ r2ν , ν > 0 and ν N 6∈ Wendland (1 r)k p(r), p a polynomial, k N − + ∈ 21 ν Mat´ern − rν K (r), ν > 0 Γ(ν) ν Infinitely smooth RBFs 2 (εr) Gaussian (GA) e− Generalized multiquadric (GMQ) (1 + (εr)2)ν/2, ν = 0 and ν 2N 6 6∈ Multiquadric (MQ) (1 + (εr)2)1/2 • 2 1/2 Inverse multiquadric (IMQ) (1 + (εr) )− • 2 1 Inverse quadratic (IQ) (1 + (εr) )− • Table 1.1 Some commonly used radial basis functions. Note: in all cases, ε> 0. distributed in a corresponding manner. The interpolant (1.1) can therefore be seen as a major generalization of the PS approach, allowing scattered points in arbitrary numbers of dimensions, a much wider functional choice, and a free shape parameter ε that can be optimized. The RBF literature has so far been strongly focused on radial functions φ(r) that are non-oscillatory. We are not aware of any compelling reason for why this needs to be the case. Although we will show that φ(r) oscillatory implies that the interpolation problem can become singular in a sufficiently high dimension, we will also show that this need not be of any concern when the dimension is fixed. The present study focuses on the radial functions J d (εr) 2 1 φd(r)= − , d =1, 2,..., (1.3) d 1 (εr) 2 − where Jα(r) denotes the J Bessel function of order α. For odd values of d, φd(r) can be alternatively expressed by means of regular trigonometric functions: 2 φ (r)= cos(εr) 1 rπ 2 sin(εr) φ (r)= 3 rπ εr 2 sin(εr) εr cos(εr) φ (r)= − 5 rπ (εr)3 . We will later find it useful to note that these φ (r) functions can also be ex- d − pressed in terms of the hypergeometric 0F1 function: 2 d 1 d φd(r)=2 − Γ 2 ψd(r) 2 1 d = 3 d = 5 0.8 d = 10 2 e− r 0.6 0.4 0.2 0 −0.2 −0.4 −8 −6 −4 −2 0 2 4 6 8 J √δr Fig. 1.1. Comparison between 2δδ! δ (2 ) for d = 3, 5, 10 (i.e. δ = 3 , 5 , 5) and the d →∞ (2√δr)δ 2 2 r2 limit e− . where ψ (r)= F ( d , 1 (εr)2). (1.4) d 0 1 2 − 4 In the d limit, the oscillations of φd(r) vanish, and Gaussian (GA) radial functions are→ recovered, ∞ as follows from the relation 2 δ Jδ(2√δr) r lim 2 δ! = e− . (1.5) δ √ δ →∞ (2 δr) Comparing the ratio above with (1.3), we have here written δ in place of d 1 and 2 − chosen ε = 2√δ. Figure 1.1 illustrates (1.5), comparing the curves for d = 3, 5, and 10 with the Gaussian limit. For these radial functions φd(r), we will prove non-singularity for arbitrarily scat- tered data in up to d dimensions (when d > 1). However, numerous other types of radial functions share this property. What makes the present class of Bessel-type basis functions outstanding relates to the flat basis function limit as ε 0. As a consequence of the limit (when it exists) taking the form of an interpolating→ poly- nomial, it connects pseudospectral (PS) methods [6] with RBF interpolants [8]. It was conjectured in [8] and shown in [16] that GA (in contrast to, say, MQ, IMQ, and IQ) will never diverge in this limit, no matter how the data points are located. The results in this study raise the question whether the present class of Bessel-type radial functions might represent the most general class possible of radial functions with this highly desirable feature. The radial functions φd(r) have previously been considered in [17] (where (1.5) and the positive semi-definiteness of the φd(r)-functions were noted), and in an example in [9] (in the different context of frequency optimization). They were also noted very 3 briefly in [8] as appearing immune to a certain type of ε 0 divergence – the main topic of this present study. → 2. Some observations regarding oscillatory radial functions. Expansions in different types of basis functions are ubiquitous in computational mathematics. It is often desirable that such functions are orthogonal to each other with regard to some type of scalar product. A sequence of such basis functions then needs to be increasingly oscillatory, as is the case for example with Fourier and Chebyshev functions. It can be shown that no such fixed set of basis functions can feature guaranteed non-singularity in more than 1-D when the data points are scattered [13]. The RBF approach circumvents this problem by making the basis functions dependent on the data point locations. It uses different translates of one single radially symmetric function, centered at each data point in turn. Numerous generalizations of this approach are possible (such as using different basis functions at the different data point locations, or not requiring that the basis functions be radially symmetric). The first question we raise here is why it has become customary to consider only non-oscillatory radial functions (with a partial exception being GDS φ(r)= r2k log r which changes sign at r = 1). One reason might be the requirements in the primary theorem that guarantees non-singularity for quite a wide class of RBF interpolants [3], [15]: Theorem 2.1. If Φ(r) = φ(√r) is completely monotone but not constant on [0, ), then for any points x in Rd, the matrix A in (1.2) is positive definite. ∞ k The requirement for φ(√r) to be completely monotone is far more restrictive than φ(r) merely being non-oscillatory: Definition 2.2. A function Φ(r) is completely monotone on [0, ), if (i) Φ(r) C[0, ) ∞ ∈ ∞ (ii) Φ(r) C∞(0, ) ∈ k ∞ (iii) ( 1)k d Φ(r) 0 for r> 0 and k =0, 1, 2,... − drk ≥ An additional result that might discourage the use of oscillatory radial functions is the following: Theorem 2.3. If φ(r) C[0, ) with φ(0) > 0 and φ(ρ) < 0 for some ρ > 0, then there is an upper limit∈ on the∞ dimension d for which the interpolation problem is non-singular for all point distributions. Proof. Consider the point distributions shown in Figure 2.1. The first row in the A matrix will have the d + 1 entries − [φ(0), φ(ρ), φ(ρ), φ(ρ),..., φ(ρ)] .
Recommended publications
  • Orthogonal Functions, Radial Basis Functions (RBF) and Cross-Validation (CV)
    Exercise 6: Orthogonal Functions, Radial Basis Functions (RBF) and Cross-Validation (CV) CSElab Computational Science & Engineering Laboratory Outline 1. Goals 2. Theory/ Examples 3. Questions Goals ⚫ Orthonormal Basis Functions ⚫ How to construct an orthonormal basis ? ⚫ Benefit of using an orthonormal basis? ⚫ Radial Basis Functions (RBF) ⚫ Cross Validation (CV) Motivation: Orthonormal Basis Functions ⚫ Computation of interpolation coefficients can be compute intensive ⚫ Adding or removing basis functions normally requires a re-computation of the coefficients ⚫ Overcome by orthonormal basis functions Gram-Schmidt Orthonormalization 푛 ⚫ Given: A set of vectors 푣1, … , 푣푘 ⊂ 푅 푛 ⚫ Goal: Generate a set of vectors 푢1, … , 푢푘 ⊂ 푅 such that 푢1, … , 푢푘 is orthonormal and spans the same subspace as 푣1, … , 푣푘 ⚫ Click here if the animation is not playing Recap: Projection of Vectors ⚫ Notation: Denote dot product as 푎Ԧ ⋅ 푏 = 푎Ԧ, 푏 1 (bilinear form). This implies the norm 푢 = 푎Ԧ, 푎Ԧ 2 ⚫ Define the scalar projection of v onto u as 푃푢 푣 = 푢,푣 | 푢 | 푢,푣 푢 ⚫ Hence, is the part of v pointing in direction of u | 푢 | | 푢 | 푢 푢 and 푣∗ = 푣 − , 푣 is orthogonal to 푢 푢 | 푢 | Gram-Schmidt for Vectors 푛 ⚫ Given: A set of vectors 푣1, … , 푣푘 ⊂ 푅 푣1 푢1 = | 푣1 | 푣2,푢1 푢1 푢2 = 푣2 − = 푣2 − 푣2, 푢1 푢1 as 푢1 normalized 푢1 푢1 푢෤3 = 푣3 − 푣3, 푢1 푢1 What’s missing? Gram-Schmidt for Vectors 푛 ⚫ Given: A set of vectors 푣1, … , 푣푘 ⊂ 푅 푣1 푢1 = | 푣1 | 푣2,푢1 푢1 푢2 = 푣2 − = 푣2 − 푣2, 푢1 푢1 as 푢1 normalized 푢1 푢1 푢෤3 = 푣3 − 푣3, 푢1 푢1 What’s missing? 푢෤3is orthogonal to 푢1, but not yet to 푢2 푢3 = 푢෤3 − 푢෤3, 푢2 푢2 If the animation is not playing, please click here Gram-Schmidt for Functions ⚫ Given: A set of functions 푔1, … , 푔푘 .We want 휙1, … , 휙푘 ⚫ Bridge to Gram-Schmidt for vectors: ∞ ⚫ Define 푔푖, 푔푗 = ׬−∞ 푔푖 푥 푔푗 푥 푑푥 ⚫ What’s the corresponding norm? Gram-Schmidt for Functions ⚫ Given: A set of functions 푔1, … , 푔푘 .We want 휙1, … , 휙푘 ⚫ Bridge to Gram-Schmidt for vectors: ∞ ⚫ 푔 , 푔 = 푔 푥 푔 푥 푑푥 .
    [Show full text]
  • Anatomically Informed Basis Functions — Anatomisch Informierte Basisfunktionen
    Anatomically Informed Basis Functions | Anatomisch Informierte Basisfunktionen Dissertation zur Erlangung des akademischen Grades doctor rerum naturalium (Dr.rer.nat.), genehmigt durch die Fakult¨atf¨urNaturwissenschaften der Otto{von{Guericke{Universit¨atMagdeburg von Diplom-Informatiker Stefan Kiebel geb. am 10. Juni 1968 in Wadern/Saar Gutachter: Prof. Dr. Lutz J¨ancke Prof. Dr. Hermann Hinrichs Prof. Dr. Cornelius Weiller Eingereicht am: 27. Februar 2001 Verteidigung am: 9. August 2001 Abstract In this thesis, a method is presented that incorporates anatomical information into the statistical analysis of functional neuroimaging data. Available anatomical informa- tion is used to explicitly specify spatial components within a functional volume that are assumed to carry evidence of functional activation. After estimating the activity by fitting the same spatial model to each functional volume and projecting the estimates back into voxel-space, one can proceed with a conventional time-series analysis such as statistical parametric mapping (SPM). The anatomical information used in this work comprised the reconstructed grey matter surface, derived from high-resolution T1-weighted magnetic resonance images (MRI). The spatial components specified in the model were of low spatial frequency and confined to the grey matter surface. By explaining the observed activity in terms of these components, one efficiently captures spatially smooth response components induced by underlying neuronal activations lo- calised close to or within the grey matter sheet. Effectively, the method implements a spatially variable anatomically informed deconvolution and consequently the method was named anatomically informed basis functions (AIBF). AIBF can be used for the analysis of any functional imaging modality. In this thesis it was applied to simu- lated and real functional MRI (fMRI) and positron emission tomography (PET) data.
    [Show full text]
  • A Local Radial Basis Function Method for the Numerical Solution of Partial Differential Equations Maggie Elizabeth Chenoweth [email protected]
    Marshall University Marshall Digital Scholar Theses, Dissertations and Capstones 1-1-2012 A Local Radial Basis Function Method for the Numerical Solution of Partial Differential Equations Maggie Elizabeth Chenoweth [email protected] Follow this and additional works at: http://mds.marshall.edu/etd Part of the Numerical Analysis and Computation Commons Recommended Citation Chenoweth, Maggie Elizabeth, "A Local Radial Basis Function Method for the Numerical Solution of Partial Differential Equations" (2012). Theses, Dissertations and Capstones. Paper 243. This Thesis is brought to you for free and open access by Marshall Digital Scholar. It has been accepted for inclusion in Theses, Dissertations and Capstones by an authorized administrator of Marshall Digital Scholar. For more information, please contact [email protected]. A LOCAL RADIAL BASIS FUNCTION METHOD FOR THE NUMERICAL SOLUTION OF PARTIAL DIFFERENTIAL EQUATIONS Athesissubmittedto the Graduate College of Marshall University In partial fulfillment of the requirements for the degree of Master of Arts in Mathematics by Maggie Elizabeth Chenoweth Approved by Dr. Scott Sarra, Committee Chairperson Dr. Anna Mummert Dr. Carl Mummert Marshall University May 2012 Copyright by Maggie Elizabeth Chenoweth 2012 ii ACKNOWLEDGMENTS I would like to begin by expressing my sincerest appreciation to my thesis advisor, Dr. Scott Sarra. His knowledge and expertise have guided me during my research endeavors and the process of writing this thesis. Dr. Sarra has also served as my teaching mentor, and I am grateful for all of his encouragement and advice. It has been an honor to work with him. I would also like to thank the other members of my thesis committee, Dr.
    [Show full text]
  • Linear Basis Function Models
    Linear basis function models Linear basis function models Linear models for regression Francesco Corona FC - Fortaleza Linear basis function models Linear models for regression Linear basis function models Linear models for regression The focus so far on unsupervised learning, we turn now to supervised learning ◮ Regression The goal of regression is to predict the value of one or more continuous target variables t, given the value of a D-dimensional vector x of input variables ◮ e.g., polynomial curve fitting FC - Fortaleza Linear basis function models Linear basis function models Linear models for regression (cont.) Training data of N = 10 points, blue circles ◮ each comprising an observation of the input variable x along with the 1 corresponding target variable t t 0 The unknown function sin(2πx) is used to generate the data, green curve ◮ −1 Goal: Predict the value of t for some new value of x ◮ 0 x 1 without knowledge of the green curve The input training data x was generated by choosing values of xn, for n = 1,..., N, that are spaced uniformly in the range [0, 1] The target training data t was obtained by computing values sin(2πxn) of the function and adding a small level of Gaussian noise FC - Fortaleza Linear basis function models Linear basis function models Linear models for regression (cont.) ◮ We shall fit the data using a polynomial function of the form M 2 M j y(x, w)= w0 + w1x + w2x + · · · + wM x = wj x (1) Xj=0 ◮ M is the polynomial order, x j is x raised to the power of j ◮ Polynomial coefficients w0,..., wM are collected
    [Show full text]
  • Introduction to Radial Basis Function Networks
    Intro duction to Radial Basis Function Networks Mark J L Orr Centre for Cognitive Science University of Edinburgh Buccleuch Place Edinburgh EH LW Scotland April Abstract This do cumentisanintro duction to radial basis function RBF networks a typ e of articial neural network for application to problems of sup ervised learning eg regression classication and time series prediction It is now only available in PostScript an older and now unsupp orted hyp ertext ver sion maybeavailable for a while longer The do cumentwas rst published in along with a package of Matlab functions implementing the metho ds describ ed In a new do cument Recent Advances in Radial Basis Function Networks b ecame available with a second and improved version of the Matlab package mjoancedacuk wwwancedacukmjopapersintrops wwwancedacukmjointrointrohtml wwwancedacukmjosoftwarerbfzip wwwancedacukmjopapersrecadps wwwancedacukmjosoftwarerbfzip Contents Intro duction Sup ervised Learning Nonparametric Regression Classication and Time Series Prediction Linear Mo dels Radial Functions Radial Basis Function Networks Least Squares The Optimal WeightVector The Pro jection Matrix Incremental Op erations The Eective NumberofParameters Example Mo del Selection Criteria CrossValidation Generalised
    [Show full text]
  • Linear Models for Regression
    Machine Learning Srihari Linear Models for Regression Sargur Srihari [email protected] 1 Machine Learning Srihari Topics in Linear Regression • What is regression? – Polynomial Curve Fitting with Scalar input – Linear Basis Function Models • Maximum Likelihood and Least Squares • Stochastic Gradient Descent • Regularized Least Squares 2 Machine Learning Srihari The regression task • It is a supervised learning task • Goal of regression: – predict value of one or more target variables t – given d-dimensional vector x of input variables – With dataset of known inputs and outputs • (x1,t1), ..(xN,tN) • Where xi is an input (possibly a vector) known as the predictor • ti is the target output (or response) for case i which is real-valued – Goal is to predict t from x for some future test case • We are not trying to model the distribution of x – We dont expect predictor to be a linear function of x • So ordinary linear regression of inputs will not work • We need to allow for a nonlinear function of x • We don’t have a theory of what form this function to take 3 Machine Learning Srihari An example problem • Fifty points generated (one-dimensional problem) – With x uniform from (0,1) – y generated from formula y=sin(1+x2)+noise • Where noise has N(0,0.032) distribution • Noise-free true function and data points are as shown 4 Machine Learning Srihari Applications of Regression 1. Expected claim amount an insured person will make (used to set insurance premiums) or prediction of future prices of securities 2. Also used for algorithmic
    [Show full text]
  • Radial Basis Functions
    Acta Numerica (2000), pp. 1–38 c Cambridge University Press, 2000 Radial basis functions M. D. Buhmann Mathematical Institute, Justus Liebig University, 35392 Giessen, Germany E-mail: [email protected] Radial basis function methods are modern ways to approximate multivariate functions, especially in the absence of grid data. They have been known, tested and analysed for several years now and many positive properties have been identified. This paper gives a selective but up-to-date survey of several recent developments that explains their usefulness from the theoretical point of view and contributes useful new classes of radial basis function. We consider particularly the new results on convergence rates of interpolation with radial basis functions, as well as some of the various achievements on approximation on spheres, and the efficient numerical computation of interpolants for very large sets of data. Several examples of useful applications are stated at the end of the paper. CONTENTS 1 Introduction 1 2 Convergence rates 5 3 Compact support 11 4 Iterative methods for implementation 16 5 Interpolation on spheres 25 6 Applications 28 References 34 1. Introduction There is a multitude of ways to approximate a function of many variables: multivariate polynomials, splines, tensor product methods, local methods and global methods. All of these approaches have many advantages and some disadvantages, but if the dimensionality of the problem (the number of variables) is large, which is often the case in many applications from stat- istics to neural networks, our choice of methods is greatly reduced, unless 2 M. D. Buhmann we resort solely to tensor product methods.
    [Show full text]
  • Basis Functions
    Basis Functions Volker Tresp Summer 2017 1 Nonlinear Mappings and Nonlinear Classifiers • Regression: { Linearity is often a good assumption when many inputs influence the output { Some natural laws are (approximately) linear F = ma { But in general, it is rather unlikely that a true function is linear • Classification: { Linear classifiers also often work well when many inputs influence the output { But also for classifiers, it is often not reasonable to assume that the classification boundaries are linear hyper planes 2 Trick • We simply transform the input into a high-dimensional space where the regressi- on/classification is again linear! • Other view: let's define appropriate features • Other view: let's define appropriate basis functions • XOR-type problem with patterns 0 0 ! +1 1 0 ! −1 0 1 ! −1 1 1 ! +1 3 XOR is not linearly separable 4 Trick: Let's Add Basis Functions • Linear Model: input vector: 1; x1; x2 • Let's consider x1x2 in addition • The interaction term x1x2 couples two inputs nonlinearly 5 With a Third Input z3 = x1x2 the XOR Becomes Linearly Separable f(x) = 1 − 2x1 − 2x2 + 4x1x2 = φ1(x) − 2φ2(x) − 2φ3(x) + 4φ4(x) with φ1(x) = 1; φ2(x) = x1; φ3(x) = x2; φ4(x) = x1x2 6 f(x) = 1 − 2x1 − 2x2 + 4x1x2 7 Separating Planes 8 A Nonlinear Function 9 f(x) = x − 0:3x3 2 3 Basis functions φ1(x) = 1; φ2(x) = x; φ3(x) = x ; φ4(x) = x und w = (0; 1; 0; −0:3) 10 Basic Idea • The simple idea: in addition to the original inputs, we add inputs that are calculated as deterministic functions of the existing inputs, and treat them as additional inputs
    [Show full text]
  • Lecture 6 – Orthonormal Wavelet Bases
    Lecture 6 – Orthonormal Wavelet Bases David Walnut Department of Mathematical Sciences George Mason University Fairfax, VA USA Chapman Lectures, Chapman University, Orange, CA 6-10 November 2017 Walnut (GMU) Lecture 6 – Orthonormal Wavelet Bases Outline Wavelet systems Example 1: The Haar Wavelet basis Characterizations of wavelet bases Example 2: The Shannon Wavelet basis Example 3: The Meyer Wavelet basis Walnut (GMU) Lecture 6 – Orthonormal Wavelet Bases Wavelet systems Definition A wavelet system in L2(R) is a collection of functions of the form j/2 j D2j Tk j,k Z = 2 (2 x k) j,k Z = j,k j,k Z { } 2 { − } 2 { } 2 where L2(R) is a fixed function sometimes called the 2 mother wavelet. A wavelet system that forms an orthonormal basis for L2(R) is called a wavelet orthonormal basis for L2(R). Walnut (GMU) Lecture 6 – Orthonormal Wavelet Bases If the mother wavelet (x) is concentrated around 0 then j j,k (x) is concentrated around 2− k. If (x) is essentially supported on an interval of length L, then j,k (x) is essentially j supported on an interval of length 2− L. Walnut (GMU) Lecture 6 – Orthonormal Wavelet Bases Since (D j T ) (γ)=D j M (γ) it follows that if is 2 k ^ 2− k concentrated on the interval I then j,k is concentrated on the interval 2j I. b b d Walnut (GMU) Lecture 6 – Orthonormal Wavelet Bases Dyadic time-frequency tiling Walnut (GMU) Lecture 6 – Orthonormal Wavelet Bases Example 1: The Haar System This is historically the first orthonormal wavelet basis, described by A.
    [Show full text]
  • Finite Element Methods
    Part I Finite Element Methods Chapter 1 Approximation of Functions 1.1 Approximation of Functions Many successful numerical methods for differential equations aim at approx- imating the unknown function by a sum N u(x)= ciϕi(x), (1.1) i=0 X where ϕi(x) are prescribed functions and ci, i = 0,...,N, are unknown coffi- cients to be determined. Solution methods for differential equations utilizing (1.1) must have a principle for constructing N + 1 equations to determine c0,...,cN . Then there is a machinery regarding the actual constructions of the equations for c0,...,cN in a particular problem. Finally, there is a solve phase for computing solution c0,...,cN of the N + 1 equations. Especially in the finite element method, the machinery for constructing the equations is quite comprehensive, with many mathematical and imple- mentational details entering the scene at the same time. From a pedagogical point of view it can therefore be wise to introduce the computational ma- chinery for a trivial equation, namely u = f. Solving this equation with f given and u on the form (1.1) means that we seek an approximation u to f. This approximation problem has the advantage of introducing most of the finite element toolbox, but with postponing variational forms, integration by parts, boundary conditions, and coordinate mappings. It is therefore from a pedagogical point of view advantageous to become familiar with finite ele- ment approximation before addressing finite element methods for differential equations. First, we refresh some linear algebra concepts about approximating vec- tors in vector spaces. Second, we extend these concepts to approximating functions in function spaces, using the same principles and the same nota- tion.
    [Show full text]
  • Basis Functions
    Basis Functions Volker Tresp Summer 2015 1 I am an AI optimist. We've got a lot of work in machine learning, which is sort of the polite term for AI nowadays because it got so broad that it's not that well defined. Bill Gates (Scientific American Interview, 2004) \If you invent a breakthrough in artificial intelligence, so machines can learn," Mr. Gates responded,\that is worth 10 Microsofts."(Quoted in NY Times, Monday March 3, 2004) 2 Nonlinear Mappings and Nonlinear Classifiers • Regression: { Linearity is often a good assumption when many inputs influence the output { Some natural laws are (approximately) linear F = ma { But in general, it is rather unlikely that a true function is linear • Classification: { Similarly, it is often not reasonable to assume that the classification boundaries are linear hyper planes 3 Trick • We simply transform the input into a high-dimensional space where the regressi- on/classification is again linear! • Other view: let's define appropriate features • Other view: let's define appropriate basis functions 4 XOR is not linearly separable 5 Trick: Let's Add Basis Functions • Linear Model: input vector: 1; x1; x2 • Let's consider x1x2 in addition • The interaction term x1x2 couples two inputs nonlinearly 6 With a Third Input z3 = x1x2 the XOR Becomes Linearly Separable f(x) = 1 − 2x1 − 2x2 + 4x1x2 = φ1(x) − 2φ2(x) − 2φ3(x) + 4φ4(x) with φ1(x) = 1; φ2(x) = x1; φ3(x) = x2; φ4(x) = x1x2 7 f(x) = 1 − 2x1 − 2x2 + 4x1x2 8 Separating Planes 9 A Nonlinear Function 10 f(x) = x − 0:3x3 2 3 Basis functions φ1(x) = 1; φ2(x)
    [Show full text]
  • Choosing Basis Functions and Shape Parameters for Radial Basis Function Methods
    Choosing Basis Functions and Shape Parameters for Radial Basis Function Methods Michael Mongillo October 25, 2011 Abstract Radial basis function (RBF) methods have broad applications in numerical analysis and statistics. They have found uses in the numerical solution of PDEs, data mining, machine learning, and kriging methods in statistics. This work examines the use of radial basis func- tions in scattered data approximation. In particular, the experiments in this paper test the properties of shape parameters in RBF methods, as well as methods for finding an optimal shape parameter. Locating an optimal shape parameter is a difficult problem and a topic of current research. Some experiments also consider whether the same methods can be applied to the more general problem of selecting basis functions. 1 Introduction Radial basis function (RBF) methods are an active area of mathematical research. The initial motivation for RBF methods came from geodesy, mapping, and meteorology. They have since found applications in other areas, such as the numerical solution of PDEs, machine learning, and statistics. In the 1970s, Rolland Hardy suggested what he called the multiquadric method for ap- plications in cartography because he was not satisfied with the results of polynomial interpolation. His new method for fitting data could handle unevenly distributed data sites with greater consis- tency than previous methods [Har90]. The method of RBF interpolation used in this paper is a generalization of Hardy’s multiquadric and inverse multiquadric methods. Many RBFs are defined by a constant called the shape parameter. The choice of basis function and shape parameter have a significant impact on the accuracy of an RBF method.
    [Show full text]