An Algorithm for Prime Factorization the Complexity of Factoring

An Algorithm for Prime Factorization The Complexity of Factoring Fact: If a is the smallest number > 1 that divides n, Algorithm PF runs in exponential time: then a is prime. We’re checking every number up to √n Proof: By contradiction. (Left to the reader.) • Can we do better? A multiset is like a set, except repetitions are allowed We don’t know. • • 2, 2, 3, 3, 5 is a multiset, not a set ◦ {{ }} Modern-day cryptography implicitly depends on the • fact that we can’t! PF(n): A prime factorization procedure Input: n N + Output: PFS∈ - a multiset of n’s prime factors PFS := for a =2to∅ √n do if a n then PFS := PF(n/a) a return PFS if PFS| = then PFS := n ∪{{[n is}} prime] ∅ {{ }} Example: PF(7007) = 7 PF(1001) = {{7,}}∪7 PF(143) = {{7, 7,}}∪11 PF(13) = {{7, 7, 11,}}∪13 . {{ }} 1 2 How Many Primes Are There? The distribution of primes Theorem 4: [Euclid] There are infinitely many primes. There are quite a few primes out there: Proof: By contradiction. Roughly one in every log(n) numbers is prime • Suppose that there are only finitely many primes: • Formally: let π(n) be the number of primes n: p1,...,pn. ≤ Prime Number Theorem Theorem: π(n) n/ log(n); Consider q = p p +1 ∼ • 1 ×···× n that is, Clearly q>p1,...,pn, so it can’t be prime. nlim π(n)/(n/ log(n))=1 • →∞ So q must have a prime factor, which must be one of • Why is this important? p1,...,pn (since these are the only primes). Cryptosystems like RSA use a secret key that is the Suppose it is p . • • i product of two large (100-digit) primes. Then p q and p p p ◦ i | i | 1 ×···× n How do you find two large primes? So p (q p p ); i.e., p 1 (Corollary 1) • ◦ i | − 1 ×···× n i | Roughly one of every 100 100-digit numbers is prime Contradiction! ◦ ◦ To find a 100-digit prime; ◦ Largest currently-known prime (as of 5/04): Keep choosing odd numbers at random 24036583 ∗ 2 1: 7235733 digits Check if they are prime (using fast randomized • − ∗ Check www.utm.edu/research/primes primality test) • Keep trying until you find one Primes of the form 2p 1 where p is prime are called ∗ − Roughly 100 attempts should do it Mersenne primes. ∗ Search for large primes focuses on Mersenne primes • 3 4 (Some) Open Problems Involving Greatest Common Divisor (gcd) Primes Definition: For a Z let D(a)= k N : k a ∈ { ∈ | } D(a)= divisors of a . Are there infinitely many Mersenne primes? • { } • Claim. D(a) < if (and only if) a = 0. | | ∞ 6 Goldbach’s Conjecture: every even number greater Proof: If a = 0 and k a, then 0 <k<a. • than 2 is the sum of two primes. 6 | Definition: For a,b Z, CD(a,b) = D(a) D(b) is E.g., 6=3+3, 20=17+3, 28=17+11 ∈ ∩ ◦ the set of common divisors of a, b. This has been checked out to 6 1016 (as of 2003) ◦ × Definition: The greatest common divisor of a and b Every sufficiently large integer (> 1043,000!) is the is ◦ sum of four primes gcd(a,b) = max(CD(a,b)). Two prime numbers that differ by two are twin primes • Examples: E.g.: (3,5), (5,7), (11,13), (17,19), (41,43) gcd(6, 9)=3 ◦ 60,000 • also 4, 648, 619, 711, 505 2 1! gcd(13, 100) = 1 ◦ × ± • Are there infinitely many twin primes? gcd(6, 45) = 3 • All these conjectures are believed to be true, but no one Def.: a and b are relatively prime if gcd(a,b)=1. has proved them. Example: 4 and 9 are relatively prime. • Two numbers are relatively prime iff they have no • common prime factors. Efficient computation of gcd(a,b) lies at the heart of com- mercial cryptography. 5 6 Least Common Multiple (lcm) Computing the GCD Definition: The least common multiple of a,b N +, There is a method for calculating the gcd that goes back lcm(a,b), is the smallest n N + such that a ∈n and to Euclid: ∈ | b n. Recall: if n > m and q divides both n and m, then | • Formally, M(a)= ka k N – the multiples of a q divides n m and n + m. • { | ∈ } − Define CM(a,b)= M(a) M(b) – the common mul- Therefore gcd(n, m) = gcd(m, n m). • ∩ − tiples of a and b Proof: Show CD(m, n)= CD(m, n m); i.e., that q • − lcm(a,b) = min(CM(a,b)) divides both n and m iff q divides both m and n m. • (If q divides n and m, then q divides n m by− the Examples: lcm(4, 9) = 36, lcm(4, 10) = 20. − • argument above. If q divides m and n m, then q divides m +(n m)= n.) − − This allows us to reduce the gcd computation to a • simpler case. We can do even better: gcd(n, m) = gcd(m, n m) = gcd(m, n 2m)= ... • − − keep going as long as n qm 0— n/m steps • − ≥ b c Consider gcd(6, 45): 45/6 = 7; remainder is 3 (45 3 (mod 6)) •b c ≡ gcd(6, 45) = gcd(6, 45 7 6) = gcd(6, 3)=3 • − × 7 8 We can keep this up this procedure to compute gcd(n1,n2): Euclid’s Algorithm If n n , write n as q n + r , where 0 r <n • 1 ≥ 2 1 1 2 1 ≤ 1 2 q1 = n1/n2 ◦ b c Input m, n [m, n natural numbers, m n] gcd(n1,n2) = gcd(r1,n2); r1 = n1 mod n2 num m; denom n [Initialize num and denom≥ ] • ← ← Now r1 <n2, so switch their roles: repeat until denom =0 • q num/denom n2 = q2r1 + r2, where 0 r2 <r1 ←b c • ≤ rem num (q denom)[num mod denom = rem] ← − ∗ gcd(r1,n2) = gcd(r1,r2) num denom [New num] • denom← rem [New denom; note num denom] Notice that max(n1,n2) > max(r1,n2) > max(r1,r2) • endrepeat← ≥ Keep going until we have a remainder of 0 (i.e., some- • Output num [num = gcd(m, n)] thing of the form gcd(rk, 0) or (gcd(0,rk)) This is bound to happen sooner or later Example: gcd(84, 33) ◦ Iteration 1: num = 84, denom = 33, q = 2, rem = 18 Iteration 2: num = 33, denom = 18, q = 1, rem = 15 Iteration 3: num = 18, denom = 15, q = 1, rem =3 Iteration 4: num = 15, denom = 3, q = 5, rem =0 Iteration 5: num = 3, denom =0 gcd(84, 33) = 3 ⇒ gcd(84, 33) = gcd(33, 18) = gcd(18, 15) = gcd(15, 3) = gcd(3, 0)=3 9 10 Euclid’s Algorithm: Correctness Euclid’s Algorithm: Complexity How do we know this works? Input m, n [m, n natural numbers, m n] We have two loop invariants, which are true each num m; denom n [Initialize num and denom≥ ] • time we start the loop: repeat← until denom← =0 gcd(m, n) = gcd(num, denom) q num/denom ◦ rem←b num (q cdenom) num denom ← − ∗ ◦ ≥ num denom [New num] At the end, denom = 0, so denom← rem [New denom; note num denom] • gcd(num, denom)= num. endrepeat← ≥ Output num [num = gcd(m, n)] How many times do we go through the loop in the Eu- clidean algorithm: Best case: Easy. Never! • Average case: Too hard • Worst case: Can’t answer this exactly, but we can get • a good upper bound. See how fast denom goes down in each iteration. ◦ 11 12 Claim: After two iterations, denom is halved: The Extended Euclidean Algorithm Recall num = q denom + rem. Use denom0 and • ∗ denom00 to denote value of denom after 1 and 2 iterations. Two cases: Theorem 5: For a,b N, not both 0, we can compute s, t Z such that ∈ 1. rem denom/2 denom0 denom/2 and ∈ ≤ ⇒ ≤ denom00 < denom/2. gcd(a,b)= sa + tb. 2. rem > denom/2. But then num0 = denom, Example: gcd(9, 4)=1=1 9+( 2) 4. • · − · denom0 = rem. At next iteration, q = 1, and Proof: By strong induction on max(a,b). Suppose with- denom00 = rem0 = num0 denom0 < denom/2 − out loss of generality a b. How long until denom is 1? ≤ • ≤ If max(a,b) = 1, then must have b = 1, gcd(a,b)=1 < 2log2(m) steps! • ◦ gcd(a,b)=0 a +1 b. After at most 2log2(m) steps, denom = 0. ◦ · · • If max(a,b) > 1, there are three cases: • If a = 0, then gcd(a,b)= b =0 a +1 b ◦ · · If a = b, then gcd(a,b)= a =1 a +0 b ◦ · · If 0 <a<b, then gcd(a,b) = gcd(a,b a). ◦ Moreover, b = max(a,b) > max(a,b a). Thus,− by IH, we can compute s, t such that− gcd(a,b) = gcd(a,a b)= sa+t(b a)=(s t)a+tb. − − − Note: this computation basically follows the “recipe” of Euclid’s algorithm. 13 14 Example Some Consequences Recall that gcd(84, 33) = gcd(33, 18) = gcd(18, 15) = Corollary 2: If a and b are relatively prime, then there gcd(15, 3) = gcd(3, 0)=3 exist s and t such that as + bt = 1. We work backwards to write 3 as a linear combination of Corollary 3: If gcd(a,b)=1 and a bc, then a c. | | 84 and 33: Proof: 3 =18 15 Exist s, t Z such that sa + tb =1 [Now− 3 is a linear combination of 18 and 15] • ∈ = 18 (33 18) Multiply both sides by c: sac + tbc = c − − • = 2(18) 33 Since a bc, a sac + tbc, so a c [Now− 3 is a linear combination of 18 and 33] • | | | Corollary 4: If p is prime and p Πn a , then p a = 2(84 2 33)) 33 | i=1 i | i − × − for some 1 i n.

An Algorithm for Prime Factorization the Complexity of Factoring

Algorithmic Factorization of Polynomials Over Number Fields

Number Theory Learning Module 3 — the Greatest Common Divisor 1

Subclass Discriminant Nonnegative Matrix Factorization for Facial Image Analysis

Primality Testing for Beginners

1. the Euclidean Algorithm the Common Divisors of Two Numbers Are the Numbers That Are Divisors of Both of Them

Finite Fields: Further Properties

The Pollard's Rho Method for Factoring Numbers

RSA and Primality Testing

Prime Factorization

Performance and Difficulties of Students in Formulating and Solving Quadratic Equations with One Unknown* Makbule Gozde Didisa Gaziosmanpasa University

Algebraic Number Theory Summary of Notes

DISCRIMINANTS in TOWERS Let a Be a Dedekind Domain with Fraction