The Fast Fourier Transform and Applications to Multiplication

Total Page:16

File Type:pdf, Size:1020Kb

The Fast Fourier Transform and Applications to Multiplication The Fast Fourier Transform and Applications to Multiplication Analysis of Algorithms Prepared by John Reif, Ph.D. Topics and Readings: - The Fast Fourier Transform • Reading Selection: • CLR, Chapter 30 Advanced Material : - Using FFT to solve other Multipoint Evaluation Problems - Applications to Multiplication Nth Roots of Unity • Assume Commutative Ring (R,+,·, 0,1) • ω is principal nth root of unity if – ωk ≠ 1 for k = 1, …, n-1 – ωn = 1, and n-1 ∑ω jp = 0 for 1≤ p ≤ n j=0 2πi/n • Example: ω = e for complex numbers Example of nth Root of Unity for Complex Numbers ω = e 2 π i / 8 is the 8th root of unity Fourier Matrix #1 1 … 1 $ % n−1 & %1 ω … ω & M (ω) = %1 ω 2 … ω 2(n−1) & n % & % & % n−1 (n−1)(n−1) & '1 ω … ω ( ij so M(ω)ij = ω for 0 ≤ i, j<n #a0 $ % & given a =% & % & 'a n-1 ( Discrete Fourier Transform T Input a column n-vector a = (a0, …, an-1) Output an n-vector which is the product of the Fourier matrix times the input vector DFTn (a) = M(ω) x a "f0 # $ % = $ % where $ % &fn-1 ' n-1 f = a ik i ∑ kω k=0 Inverse Fourier Transform -1 -1 DFTn (a) = M(ω) x a 1 Theorem M(ω)-1 = ω -ij ij n proof We must show M(ω)⋅M(ω)-1 = I 1 n-1 1 n-1 ∑ωikω -kj = ∑ω k(i-j) n k=0 n k=0 $0 if i-j ≠ 0 = % &1 if i-j = 0 n-1 using identity ∑ω kp = 0, for 1≤ p < n k=0 Fourier Transform is Polynomial Evaluation at the Roots of Unity T Input a column n-vector a = (a0, …, an-1) T Output an n-vector (f0, …, fn-1) which are the values polynomial f(x)at the n roots of unity "f0 # DFT (a) = $ % where n $ % $ % &fn-1 ' i fi = f(ω ) and n-1 f(x) = a x j ∑ j ⋅ j=0 Fast Fourier Transform • Viewed as Evaluation Problem: naïve algorithm takes n2 ops • Divide and Conquer gives FFT with O(n log n) ops for n a power of 2 • Key Idea: • If ω is nth root of unity then ω2 is n/2th root of unity • So can reduce the problem to two subproblems of size n/2 Algorithm FFTn T • Input a = (a0, …, an-1) , n a power of 2 [1] If n=1 then ouput T ⎛⎞'' T [2] ⎜⎟f0n ,...,f← FFTnn (( a 022 , a ,..., a − ) ) −1 ⎝⎠22 T ⎛⎞"" T ⎜⎟f0n ,...,f← FFTnn (( a 131 , a ,..., a − ) ) −1 ⎝⎠22 n [3] For i=0, ...,−← 1 do f f'"+ωi f 2 ii i '"i fni←− fω f i i+ 2 [4] Output (f0 , f 1 , ..., f n-1 ) FFT Circuit (also known as Butterfly Network) • Total Recursion depth = log n • Communication Distance 2d at depth d i 2i (n-1)i fi = a 0 + a 1ωω + a 2 +...+a n-1 ω 'i" fii = f + ω f i where i(n-2) ' 2 i 2 2i 2 2 fi = a 0 + a 2 (ωω ) + a 4 ( ) +...+ a n-2 ( ω ) i(n-2) "2i22 fi = a 1 + a 3 (ωω ) +...+ a n-1 ( ) ⎡⎤' ⎡⎤a0 ⎢⎥f0 ⎢a ⎥ ⎢⎥ M = M()ω 2 ⎢ 2 ⎥ = DFT(( a , a ,..., a )T ) ⎢⎥ n ⎢ M ⎥ nn02− 2 ' 2 2 ⎢⎥f n ⎢ ⎥ −1 a ⎣⎦⎢⎥2 ⎣⎦n−2 ⎡⎤" ⎡⎤a1 ⎢⎥f0 ⎢⎥a ⎢⎥ M ==M (ω 2 )⎢⎥3 DFT (( a , a ,..., a )T ) ⎢⎥ n131⎢⎥ M nn− " 22 ⎢⎥f n ⎢⎥ −1 a ⎣⎦⎢⎥2 ⎣⎦n−1 ' ' " " n Note: f n = fi , f n = fi , i=0, ..., −1 +i +i 2 2 2 n +1 But ω n =1, so (ω 2 ) 2 = ω n ⋅(ω 2 )i = ω 2i n for i=0, ..., −1 2 n Thus, f = f ' + ωi f " for i=0, ..., −1 i i i 2 n i + ' 2 " and f n = fi +ω ⋅ fi i + 2 n = f ' - ωi f " for i=0, ..., −1 i i 2 n n since (ω 2 )2 = ω n =1, so ω 2 = -1 Operation Counts for FFT Algorithm • Assume n = 2k • # additions Add(n) = 2· Add(n/2) + n = n log n • # multiplications Mult(n) = 2· Mult(n/2) + n/2 = ½ n log n • Total Time O(n log n) • Note in complex FFT, # real ops is 5 n log n Multipoint Polynomial Evaluation n−1 f (x) a xi • Input polynomial = ∑ i i=0 • Problem evaluate f(x) at x0, x1, …, xn-1 • Easy Cases: eval at roots of unity k FFT Case xk = ω for k=0,…,n Multipoint Polynomial Evaluation (cont’d) Summary of FFT: method f() x=+ f '() y x f "() y where yx= 2 fxf'( ), "( x ) both degree halved ⇒ needed to only evaluate at half as many points Other Polynomial Evaluation Problems Solved by FFT Each costs O(n log n) time i • Evaluate at points Xi = ba + d for i=0,…, n-1(Chirp Transfom) – Reduced to FFT • Single point evaluation of all derivatives of a polynomial – Solve by reduction to above Chirp Transform of case 2) i 2 i • Evaluate at points Xi = b(a ) + ca + d for i=0,…, n-1 – Solve by divide and conquer similar to FFT Single Point Evaluation of all Derivatives of Polynomial • Input n−1 f ( x ) a x i and point x = ∑ i 0 i=0 • output d k f (x) f k (x) = x = x for k = 0,...,n −1 dx 0 Single Point Evaluation of all Derivatives of Polynomial (cont’d) • Taylor Series Representation of n−1 f (x) c (x x )i = ∑ i − 0 i=0 Then (k ) f (x0 ) = k!ck This reduces to case of evaluation at points i xi == ab for i0,..., n − 1 • Solve this Chirp Transform problem by reduction FFT Advanced Material: Further Applications of FFT 1) Convolution: Products and Powers of Polynomials • Used for for Integer Multiplication Algorithms • Also used for Filtering on infinite input streams 2) Division and Inverse of Polynomials 3) Multipoint Evaluation and Interpolation Advanced Material: Products and Powers of Polynomials • Input vectors T a = (a0, a1, …, an-1) T b = (b0, b1, …, bn-1) • Definition of Convolution c = a b ⊗ n-1 c = a b Where i ∑ j i- j for i=0, …, 2n-1 j=0 define ak = bk = 0 if k< 0 or k>n Products and Powers of Polynomials (cont’d) • Convolution Theorem a b = FFT-1 FFT (a) FFT (b) ⊗ 2n ( 2n ⋅ 2n ) • Application to Polynomial Products: n-1 p(x) = a xi ∑ i i=0 n-1 q(x) = b x j ∑ j j=0 2n-2 n-1 p(x) q(x) = c xi where c = a b ⋅ ∑ i i ∑ j i-j i=0 j=0 Products of m Polynomials n-1 for k=1, ..., m let P (x) = a xi k ∑ k,i i=0 m m(n-1) m P (x) = c xi , where c = a ∏ k ∑ i i ∑∏ k,jk k=1 i=0 k=1 ∑ jk =1 • Generalized Convolution Theorem a12⊗⊗⊗ a... a= m -1 FFTn m( FFT n m (a 1 )••• FFT n m (a 2 ) ... FFT n m (a m )) Wrapped Convolutions T T • a = (a0, a1, …, an-1) , b = (b0, b1, …, bn-1) • Positive wrapped convolution is T c = (c0, c1, …, cn-1) i n-1 c = a b + a b i ∑ j i-j ∑ j n+i-j j=0 j=i+1 • Negative wrapped convolution is T d = (d0, d1, …, dn-1) i n-1 d = a b - a b i ∑ j i-j ∑ j n+i-j j=0 j=i+1 Application of Wrapped Convolution to Modular Polynomial Products n-1 p(x) = a xi ∑ i i=0 n-1 q(x) = b x j ∑ j j=0 p(x)⋅q(x) mod(xn +1) n-1 = d xi since xn = -1 mod(xn +1) ∑ i i=0 Computing Positive Wrapped Convolution • Let ω = principal nth root of unity • Assume n has multiplicative inverse, Theorem -1 c = FFTn (FFTn (a)⋅FFTn (b)) is the positive wrapped convolution of n-vectors a and b. Computing Negative Wrapped Convolution • Also dˆ = FFT-1 FFT (aˆ) FFT (bˆ) n ( n ⋅ n ) is the negatively wrapped convolution of n-vectors a and b T where ˆ n-1 a = (a0 , Ψa1, ...,Ψ a n-1 ) T ˆ n-1 b = b0 , Ψb1, ...,Ψ bn-1 ( ) and Ψ2 = ω = principal nth root of unity Integer Multiplication by Polynomial Product (solved via FFT) • Input n bit integers a,b define polynomials degree k = n/L k-1 a(x) = a xi , 0 a 2L ∑ i ≤ i ≤ i=0 k-1 b(x) = b xi , 0 b 2L ∑ i ≤ i ≤ i=0 so a = a(2L ), b = b(2L ) Integer Multiplication by Polynomial Product (cont’d) • Idea 1) Compute c(x) = a(x)· b(x) by convolution 2) Evaluate c(2L) = a· b Integer Multiplication Algorithms using Reduction to Polynomial Product • Pollard Mult Algorithm O(n(logn)2 )(loglogn)∈ use L = logn • Karp Mult Algorithm O(n(logn)2 ) use L = n • Schönage-Strassen Mult Algorithm O(n(logn)(loglogn)) use L = n and wrapped convolution Pollard Multiplication Algorithm • n = kL, L = 1 + log k 1) Choose primes P1, P2, P3 where 3 P1 ⋅P2 ⋅P3 ≥ 4⋅k L and Pi = á i ⋅2 + 1, á i = O(1) 2) Compute C(x) by convolution over finite field Zpi for i =1,2,3 (requires k mults on 2L bit integers) Pollard Multiplication Algorithm (cont’d) 3) Evaluate C(2L) • Time Bounds recursive mults FFT T(n) = 3kT(2L) + O(k logk)⋅O(L) = 3kT (2(1 + logk)) + O(k(log k)2 ) ≤ O(n(log n)2 (log log n)∈) for any ∈ > 0 Korp Multiplication Algorithm n = 2s = kL ! s 22 if s even k = " # (s-1) " 2 $2 else 1) Compute C(x) modulo k by convolution 2) Compute C(x) modulo (22L+1) by convolution 3) Compute C(x) coefficients from C(x) mod k, C(x) mod (22L+1) by Chinese remaindering Korp Multiplication Algorithm (cont’d) 4) Compute C(2L) • Time recursive mults FFT T(n) = 2kT(2L) + O(k logk)O(L) = 2 nT (2 n ) + O(n log n) = O(n(log n)2 ) Schönage-Strassen Multiplication Algorithm (2’) Compute C(x) mod (xk+1) modulo (22L+1) by wrapped convolution Requires only k recursive mults on 2L bit numbers • Time recursive mults FFT T(n) = kT(2L) + O(k logk)O(L) = nT (2 n ) + O(n log n) = O(n log n)(log log n) Still Open Problem: How Fast Can You Multiply Integers? • Can you mult n bit integers in O(n log n) time? The Fast Fourier Transform and Applications to Multiplication Analysis of Algorithms Prepared by John Reif, Ph.D.
Recommended publications
  • Polynomial Evaluation Engl2
    Alexander Rostovtsev, Alexey Bogdanov and Mikhail Mikhaylov St. Petersburg state Polytechnic University [email protected] Secure evaluation of polynomial using privacy ring homomorphisms Method of secure evaluation of polynomial y = F(x1, …, xk) over some rings on untrusted computer is proposed. Two models of untrusted computer are considered: passive and active. In passive model untrusted computer correctly computes polyno- mial F and tries to know secret input (x1, …, xk) and output y. In active model un- trusted computer tries to know input and output and tries to change correct output y so that this change cannot be determined. Secure computation is proposed by using one-time privacy ring homomorphism /n ® /n[z]/(f(z)), n = pq, generated by trusted computer. In the case of active model secret check point v = F(u1, …, uk) is used. Trusted computer generates poly- nomial f(z) = (z - t)(z + t), t Î /n and input Xi(z) Î /n[z]/(f(z)) such that Xi(t) º xi (mod n) for passive model, and f(z) = (z - t1)(z - t2)(z - t3), ti Î /n and input Xi(z) Î /n[z]/(f(z)) such that Xi(t1) º xi (mod n), Xi(t2) º ui (mod n) for active model. Untrusted computer computes function Y(z) = F(X1(z), …, Xk(z)) in the ring /n[z]/(f(z)). For passive model trusted computer determines secret output y º Y(t) (mod n). For active model trusted computer checks that Y(t2) º v (mod n), then deter- mines correct output y º Y(t1) (mod n).
    [Show full text]
  • Generalized Kakeya Sets for Polynomial Evaluation and Faster Computation of Fermionants
    Generalized Kakeya sets for polynomial evaluation and faster computation of fermionants Andreas Björklund1, Petteri Kaski2, and Ryan Williams3 1 Department of Computer Science, Lund University, Lund, Sweden 2 Department of Computer Science, Aalto University, Helsinki, Finland 3 Department of Electrical Engineering and Computer Science & CSAIL, MIT, Cambridge, MA, USA Abstract We present two new data structures for computing values of an n-variate polynomial P of degree at most d over a finite field of q elements. Assuming that d divides q − 1, our first data structure relies on (d + 1)n+2 tabulated values of P to produce the value of P at any of the qn points using O(nqd2) arithmetic operations in the finite field. Assuming that s divides d and d/s divides q − 1, our second data structure assumes that P satisfies a degree-separability condition and relies on (d/s+1)n+s tabulated values to produce the value of P at any point using O(nqssq) arithmetic operations. Our data structures are based on generalizing upper-bound constructions due to Mockenhaupt and Tao (2004), Saraf and Sudan (2008), and Dvir (2009) for Kakeya sets in finite vector spaces from linear to higher-degree polynomial curves. As an application we show that the new data structures enable a faster algorithm for com- puting integer-valued fermionants, a family of self-reducible polynomial functions introduced by Chandrasekharan and Wiese (2011) that captures numerous fundamental algebraic and combin- atorial invariants such as the determinant, the permanent, the number of Hamiltonian cycles in a directed multigraph, as well as certain partition functions of strongly correlated electron systems in statistical physics.
    [Show full text]
  • Factoring Polynomials Over Finite Fields
    Factoring Polynomials over Finite Fields More precisely: Factoring and testing irreduciblity of sparse polynomials over small finite fields Richard P. Brent MSI, ANU joint work with Paul Zimmermann INRIA, Nancy 27 August 2009 Richard Brent (ANU) Factoring Polynomials over Finite Fields 27 August 2009 1 / 64 Outline Introduction I Polynomials over finite fields I Irreducible and primitive polynomials I Mersenne primes Part 1: Testing irreducibility I Irreducibility criteria I Modular composition I Three algorithms I Comparison of the algorithms I The “best” algorithm I Some computational results Part 2: Factoring polynomials I Distinct degree factorization I Avoiding GCDs, blocking I Another level of blocking I Average-case complexity I New primitive trinomials Richard Brent (ANU) Factoring Polynomials over Finite Fields 27 August 2009 2 / 64 Polynomials over finite fields We consider univariate polynomials P(x) over a finite field F. The algorithms apply, with minor changes, for any small positive characteristic, but since time is limited we assume that the characteristic is two, and F = Z=2Z = GF(2). P(x) is irreducible if it has no nontrivial factors. If P(x) is irreducible of degree r, then [Gauss] r x2 = x mod P(x): 2r Thus P(x) divides the polynomial Pr (x) = x − x. In fact, Pr (x) is the product of all irreducible polynomials of degree d, where d runs over the divisors of r. Richard Brent (ANU) Factoring Polynomials over Finite Fields 27 August 2009 3 / 64 Counting irreducible polynomials Let N(d) be the number of irreducible polynomials of degree d. Thus X r dN(d) = deg(Pr ) = 2 : djr By Möbius inversion we see that X rN(r) = µ(d)2r=d : djr Thus, the number of irreducible polynomials of degree r is ! 2r 2r=2 N(r) = + O : r r Since there are 2r polynomials of degree r, the probability that a randomly selected polynomial is irreducible is ∼ 1=r ! 0 as r ! +1.
    [Show full text]
  • Arxiv:0911.1393V5 [Cs.CC] 1 Jul 2013 Author’S Addresses: C
    0 Most Tensor Problems are NP-Hard CHRISTOPHER J. HILLAR, Mathematical Sciences Research Institute LEK-HENG LIM, University of Chicago We prove that multilinear (tensor) analogues of many efficiently computable problems in numerical linear algebra are NP-hard. Our list here includes: determining the feasibility of a system of bilinear equations, deciding whether a 3-tensor possesses a given eigenvalue, singular value, or spectral norm; approximating an eigenvalue, eigenvector, singular vector, or the spectral norm; and determining the rank or best rank-1 approximation of a 3-tensor. Furthermore, we show that restricting these problems to symmetric tensors does not alleviate their NP-hardness. We also explain how deciding nonnegative definiteness of a symmetric 4-tensor is NP-hard and how computing the combinatorial hyperdeterminant of a 4-tensor is NP-, #P-, and VNP-hard. We shall argue that our results provide another view of the boundary separating the computa- tional tractability of linear/convex problems from the intractability of nonlinear/nonconvex ones. Categories and Subject Descriptors: G.1.3 [Numerical Analysis]: Numerical Linear Algebra General Terms: Tensors, Decidability, Complexity, Approximability Additional Key Words and Phrases: Numerical multilinear algebra, tensor rank, tensor eigenvalue, tensor singular value, tensor spectral norm, system of multilinear equations, hyperdeterminants, symmetric ten- sors, nonnegative definite tensors, bivariate matrix polynomials, NP-hardness, #P-hardness, VNP-hardness, undecidability, polynomial time approximation schemes ACM Reference Format: Hillar, C. J., and Lim, L.-H. 2012. Most tensor problems are NP-hard J. ACM 0, 0, Article 0 (June 2013), 38 pages. DOI = 10.1145/0000000.0000000 http://doi.acm.org/10.1145/0000000.0000000 1.
    [Show full text]
  • Polynomial Evaluation and Interpolation on Special Sets of Points
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Elsevier - Publisher Connector Journal of Complexity 21 (2005) 420–446 www.elsevier.com/locate/jco Polynomial evaluation and interpolation on special sets of points Alin Bostan∗, Éric Schost Laboratoire STIX, École polytechnique, 91128 Palaiseau, France Received 31 January 2004; accepted 6 September 2004 Available online 10 February 2005 Abstract We give complexity estimates for the problems of evaluation and interpolation on various polyno- mial bases. We focus on the particular cases when the sample points form an arithmetic or a geometric sequence, and we discuss applications, respectively, to computations with linear differential operators and to polynomial matrix multiplication. © 2005 Elsevier Inc. All rights reserved. Keywords: Polynomial evaluation and interpolation; Transposition principle; Polynomial matrix multiplication; Complexity 1. Introduction Let k be a field and let x = x0,...,xn−1 be n pairwise distinct points in k. Given arbitrary values v = v0,...,vn−1 in k, there exists a unique polynomial F in k[x] of degree less than n such that F(xi) = vi, for i = 0,...,n− 1. Having fixed a basis of the vector space of polynomials of degree at most n − 1, interpolation and evaluation questions consist in computing the coefficients of F on this basis from the values v, and conversely. ∗ Corresponding author. E-mail addresses: [email protected] (A. Bostan), [email protected] (É. Schost). 0885-064X/$ - see front matter © 2005 Elsevier Inc. All rights reserved. doi:10.1016/j.jco.2004.09.009 A.
    [Show full text]
  • Some Parallel Algorithms for Integer Factorisation
    Some Parallel Algorithms for Integer Factorisation Richard P. Brent Oxford University Computing Laboratory, Wolfson Building, Parks Road, Oxford OX1 3QD, UK [email protected] http://www.comlab.ox.ac.uk/ oucl/people/richard.brent.html Abstract. Algorithms for finding the prime factors of large composite numbers are of practical importance because of the widespread use of public key cryptosystems whose security depends on the presumed diffi- culty of the factorisation problem. In recent years the limits of the best integer factorisation algorithms have been extended greatly, due in part to Moore’s law and in part to algorithmic improvements. It is now rou- tine to factor 100-decimal digit numbers, and feasible to factor numbers of 155 decimal digits (512 bits). We describe several integer factorisa- tion algorithms, consider their suitability for implementation on parallel machines, and give examples of their current capabilities. 1 Introduction Any positive integer N has a unique prime power decomposition α α α 1 2 ··· k N = p1 p2 pk (p1 <p2 < ··· <pk primes, αj > 0). This result is easy to prove, but the standard proof gives no hint of an efficient algorithm for computing the prime power decomposition. In order to compute it, we need – 1. An algorithm to test if an integer N is prime. 2. An algorithm to find a nontrivial factor f of a composite integer N. Given these components there is a simple recursive algorithm to compute the prime power decomposition. Fortunately or unfortunately, depending on one’s point of view, problem 2 is generally believed to be hard.
    [Show full text]
  • Some Integer Factorization Algorithms Using Elliptic Curves
    Some Integer Factorization Algorithms using Elliptic Curves Richard P. Brent Computer Sciences Laboratory Australian National University 24 September 1985 Revised 10 Dec. 1985 Republished 7 Nov. 1998 Abstract Lenstra’s integer factorization algorithm is asymptotically one of the fastest known al- gorithms, and is also ideally suited for parallel computation. We suggest a way in which the algorithm can be speeded up by the addition of a second phase. Under some plausible assumptions, the speedup is of order log(p), where p is the factor which is found. In practice the speedup is significant. We mention some refinements which give greater speedup, an alternative way of implementing a second phase, and the connection with Pollard’s “p − 1” factorization algorithm. 1 Introduction Recently H.W. Lenstra Jr. proposed a new integer factorization algorithm, which we shall call “Lenstra’s algorithm” or the “one-phase elliptic curve algorithm” [17]. Under some plausible assumptions Lenstra’s algorithm finds a prime factor p of a large composite integer N in expected time q T1(p) = exp (2 + o(1)) ln p ln ln p , (1.1) where “o(1)” means a term which tends to zero as p → ∞. Previously algorithms with run- ning time exp p(1 + o(1)) ln N ln ln N were known [27]. However, since p2 ≤ N, Lenstra’s algorithm is comparable in the worst case and often much better, since it often happens that 2 ln p ln N. The Brent-Pollard “rho” algorithm [5] is similar to Lenstra’s algorithm in that its expected 1/2 1/2 running time depends on p, in fact it is of order p .
    [Show full text]
  • Polynomial Evaluation and Side Channel Analysis Claude Carlet, Emmanuel Prouff
    Polynomial Evaluation and Side Channel Analysis Claude Carlet, Emmanuel Prouff To cite this version: Claude Carlet, Emmanuel Prouff. Polynomial Evaluation and Side Channel Analysis. TheNew Codebreakers, 9100, Springer, pp.315 - 341, 2016, Lecture Notes in Computer Science, 978-3-662- 49300-7. 10.1007/978-3-662-49301-4_20. hal-01399573 HAL Id: hal-01399573 https://hal.archives-ouvertes.fr/hal-01399573 Submitted on 19 Nov 2016 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Polynomial Evaluation and Side Channel Analysis Claude Carlet1 and Emmanuel Prouff2;3 1 LAGA, UMR 7539, CNRS, Department of Mathematics, University of Paris XIII and University of Paris VIII [email protected] 2 ANSSI, FRANCE [email protected] 3 POLSYS, UMR 7606, LIP6, Sorbonne Universities, UPMC University Paris VI Abstract. Side Channel Analysis (SCA) is a class of attacks that ex- ploits leakage of information from a cryptographic implementation dur- ing execution. To thwart it, masking is a common countermeasure. The principle is to randomly split every sensitive intermediate variable oc- curring in the computation into several shares and the number of shares, called the masking order, plays the role of a security parameter.
    [Show full text]
  • Polynomial Evaluation Schemes
    MATHEMATICS OF COMPUTATION, VOLUME 29, NUMBER 132 OCTOBER 1975, PAGES 1046-1050 Polynomial Evaluation Schemes By A. C. R. Newbery* Abstract. An attempt is made to define a polynomial evaluation algorithm that is more resistant to accumulated round-off error than the schemes of Horner and Clenshaw under conditions of floating-point arithmetic. An algorithm is presented which generally compares favorably with both. Some suggestions are made, which could plausibly lead to substantial further improvements. In [1] it was established that the total effect of rounding error for Horner's (nested multiplication) evaluation scheme is bounded by (e + no) P(\a\)/(1 - no), where the arithmetic is performed in floating-point, e is the smallest number which significantly adds to 1, a = e(2 + e), and P(x) = 'Ln)\pr\xr. It was noted that both the theoretical and the empirically observed errors vary steeply with the magnitude of the argument. In view of this, it seems appropriate to look into the possibility of replacing a given polynomial evaluation problem by an equivalent one for which the argument has a smaller magnitude. We are presupposing a situation in which the over- head costs (for transforming one problem into another) may be neglected, e.g. where the polynomial approximates an elementary function on a library tape. Let the given problem be to evaluate P(x) = E^ p,xr at any argument in [-1,1]. It was observed in [1] that any finite-domain problem can be scaled into this form. For convenience we assume n = 2n '. (The analysis for odd-degree polynomials is only trivially different.) First, we split P into its even and odd components P(x) = E(x2) + x4>(x2), where E and 4> are polynomials of degree n' ,n' — 1 in the argument x2\ secondly, by writing x2 = xk + t with \t\ < J4, we can rewrite E(x2) and <ï>(x2)as Ê(t) and q>(r) with |r| < lk.
    [Show full text]
  • Fast Fourier Transform Algorithms with Applications Todd Mateer Clemson University, [email protected]
    Clemson University TigerPrints All Dissertations Dissertations 8-2008 Fast Fourier Transform Algorithms with Applications Todd Mateer Clemson University, [email protected] Follow this and additional works at: https://tigerprints.clemson.edu/all_dissertations Part of the Applied Mathematics Commons Recommended Citation Mateer, Todd, "Fast Fourier Transform Algorithms with Applications" (2008). All Dissertations. 231. https://tigerprints.clemson.edu/all_dissertations/231 This Dissertation is brought to you for free and open access by the Dissertations at TigerPrints. It has been accepted for inclusion in All Dissertations by an authorized administrator of TigerPrints. For more information, please contact [email protected]. FAST FOURIER TRANSFORM ALGORITHMS WITH APPLICATIONS A Dissertation Presented to the Graduate School of Clemson University In Partial Fulfillment of the Requirements for the Degree Doctor of Philosophy Mathematical Sciences by Todd Mateer August 2008 Accepted by: Dr. Shuhong Gao, Committee Chair Dr. Joel Brawley Dr. Neil Calkin Dr. Kevin James ABSTRACT This manuscript describes a number of algorithms that can be used to quickly evaluate a polynomial over a collection of points and interpolate these evaluations back into a polynomial. Engineers define the “Fast Fourier Transform” as a method of solving the interpolation problem where the coefficient ring used to construct the polynomials has a special multiplicative structure. Mathematicians define the “Fast Fourier Transform” as a method of solving the multipoint evaluation problem. One purpose of the document is to provide a mathematical treatment of the topic of the “Fast Fourier Transform” that can also be understood by someone who has an understanding of the topic from the engineering perspective. The manuscript will also introduce several new algorithms that efficiently solve the multipoint evaluation problem over certain finite fields and require fewer finite field operations than existing techniques.
    [Show full text]
  • Fast Polynomial Factorization and Modular Composition∗
    Fast polynomial factorization and modular composition∗ Kiran S. Kedlayay Christopher Umansz MIT Caltech June 13, 2011 Abstract We obtain randomized algorithms for factoring degree n univariate polynomials over Fq requiring O(n1:5+o(1) log1+o(1) q + n1+o(1) log2+o(1) q) bit operations. When log q < n, this is asymptotically faster than the best previous algorithms (von zur Gathen & Shoup (1992) and Kaltofen & Shoup (1998)); for log q ≥ n, it matches the asymptotic running time of the best known algorithms. The improvements come from new algorithms for modular composition of degree n univariate poly- nomials, which is the asymptotic bottleneck in fast algorithms for factoring polynomials over finite fields. The best previous algorithms for modular composition use O(n(!+1)=2) field operations, where ! is the exponent of matrix multiplication (Brent & Kung (1978)), with a slight improvement in the exponent achieved by employing fast rectangular matrix multiplication (Huang & Pan (1997)). We show that modular composition and multipoint evaluation of multivariate polynomials are essen- tially equivalent, in the sense that an algorithm for one achieving exponent α implies an algorithm for the other with exponent α + o(1), and vice versa. We then give two new algorithms that solve the prob- lem near-optimally: an algebraic algorithm for fields of characteristic at most no(1), and a nonalgebraic algorithm that works in arbitrary characteristic. The latter algorithm works by lifting to characteristic 0, applying a small number of rounds of multimodular reduction, and finishing with a small number of multidimensional FFTs. The final evaluations are reconstructed using the Chinese Remainder Theorem.
    [Show full text]
  • Oblivious Polynomial Evaluation and Secure Set-Intersection from Algebraic Prfs∗
    Oblivious Polynomial Evaluation and Secure Set-Intersection from Algebraic PRFs∗ Carmit Hazayy Abstract In this paper we study the two fundamental functionalities oblivious polynomial evaluation in the exponent and set-intersection, and introduce a new technique for designing efficient secure protocols for these problems (and others). Our starting point is the [BGV11] technique (CRYPTO 2011) for verifiable delegation of polynomial evaluations, using algebraic PRFs. We use this tool, that is useful to achieve verifiability in the outsourced setting, in order to achieve privacy in the standard two-party setting. Our results imply new simple and efficient oblivious polynomial evaluation (OPE) protocols. We further show that our OPE protocols are readily used for secure set-intersection, implying much simpler protocols in the plain model. As a side result, we demonstrate the usefulness of algebraic PRFs for various search functionalities, such as keyword search and oblivious transfer with adaptive queries. Our protocols are secure under full simulation-based definitions in the presence of malicious adversaries. Keywords: Efficient Secure Computation, Oblivious Polynomial Evaluation, Secure Set-Intersection, Committed Oblivious PRF ∗A preliminary version of this paper appeared in the proceedings of TCC 2015 [Haz15]. yFaculty of Engineering, Bar-Ilan University, Israel. Email: [email protected]. Research partially supported by a grant from the Israel Ministry of Science and Technology (grant No. 3-10883). 1 Introduction Efficient secure two-party computation. Secure two-party computation enables two parties to mutually run a protocol that computes some function f on their private inputs, while preserving a number of security properties. Two of the most important properties are privacy and correctness.
    [Show full text]