It's As Easy As
Total Page:16
File Type:pdf, Size:1020Kb
It’s As Easy As abc Andrew Granville and Thomas J. Tucker Fermat’s Last Theorem (2) xp−1x + yp−1y = zp−1z. In this age in which mathematicians are supposed to bring their research into the classroom, even at We now have two linear equations (1) and (2) (think- ing of xp−1, yp−1, and zp−1 as our variables), which the most elementary level, it is rare that we can turn suggests using linear algebra to eliminate a vari- the tables and use our elementary teaching to help able: Multiply (1) by y and (2) by y, and subtract in our research. However, in giving a proof of Fer- to get mat’s Last Theorem, it turns out that we can use tools from calculus and linear algebra only. This xp−1(xy − yx)=zp−1(zy − yz). may strike some readers as unlikely, but bear with p−1 p−1 − us for a few moments as we give our proof. Therefore x divides z (zy yz ), but since x and z have no common factors, this implies that Fermat claimed that there are no solutions to (3) xp−1 divides zy − yz. (1) xp + yp = zp This is a little surprising, for if zy − yz is nonzero, for p ≥ 3, with x, y, and z all nonzero. If we assume then a high power of x divides zy − yz, something that there are solutions to (1), then we can assume that does not seem consistent with (1). that x, y, and z have no common factor, else we We want to be a little more precise. Since we dif- can divide out by that factor. Our first step will be ferentiated, we evidently never were working with to differentiate (1) to get integers x, y, z, but rather with polynomials. Thus if zy − yz =0, then (y/z) =0, and so y is a con- − − − pxp 1x + pyp 1y = pzp 1z , stant multiple of z, contradicting our statement that y and z have no common factor. Therefore (3) im- and after dividing out the common factor p, this plies that leaves us with (p − 1) degree(x) ≤ degree(zy − yz) Andrew Granville is a Canadian Research Chair of math- ematics at the Université de Montréal. His email address ≤ degree(y) + degree(z) − 1, is [email protected]. since degree(y)= degree(y) − 1 and degree(z)= Thomas J. Tucker is currently a visiting assistant profes- degree(z) − 1. Adding degree(x) to both sides gives sor at the City University of New York Graduate Center. His email address is [email protected]. (4) p degree(x) < degree(x) + degree(y) + degree(z). This article is based on the first named author’s AMS- MAA invited lecture at the Orlando annual meeting in Jan- The right side of (4) is symmetric in x, y, and z. uary 1996. It was prepared when both authors worked at The left side is a function of x simply because of the University of Georgia. The work of both authors is sup- the order in which we chose to do things above. We ported by the NSF. could just as easily have derived the same 1224 NOTICES OF THE AMS VOLUME 49, NUMBER 10 statement with y or z in place of x on the left side Then if we add the first column to the second, we of (4), so that get a(t) c(t) ∆(t)= , p degree(y) < degree(x) + degree(y) + degree(z) a(t) c(t) and and similarly p degree(z) < degree(x) + degree(y) + degree(z). c(t) b(t) ∆(t)= Adding these last three equations together and then c(t) b(t) dividing out by degree(x) + degree(y) + degree(z) implies p<3, by adding the second column to the first, a beau- tiful symmetry. and so Fermat’s Last Theorem is proved! We note that ∆(t) =0 , else ab − ab =0, so b is Well, not quite, but what we have proved (and a scalar multiple of a (with the same argument as so simply) is still of great interest: above), contradicting the hypothesis. Proposition 1. There are no genuine polynomial so- To find the appropriate analogy to (3), we in- lutions x(t),y(t),z(t) ∈ C[t] to x(t)p + y(t)p = z(t)p terpret that as stating that the factors of x (as well with p ≥ 3. By “genuine” we mean that the triple as of y and z) divide our determinant to a high (x(t),y(t),z(t)) is not a polynomial multiple of a power. So now suppose that α is a root of a(t) and solution of (1) in C. that (t − α)e is the highest power of (t − α) which divides a(t). Evidently (t − α)e−1 is the highest That Fermat’s Last Theorem is easy to prove for power of (t − α) which divides a(t), and thus it polynomials is an old result, going back certainly is the highest power of (t − α) which divides as far as Liouville (1851), although his proof, which ∆(t)=a(t)b(t) − a(t)b(t) (since α is not a root of goes through integration, is much more involved b(t)). Therefore (t − α)e divides ∆(t)(t − α). Multi- than that given here. The proof we have presented plying all such (t − α)e together, we obtain above is certainly some years old; for instance, a variant can be found in standard textbooks of fifty a(t) divides ∆(t) (t − α). years ago. After reading through it, one sees that a(α)=0 this argument is easily generalizable to other Dio- phantine problems, though it is not obvious what In fact, a(t) appears on the left side of this equation would be the ultimate generalization. only because we studied the linear factors of a; anal- Mason’s Generalization ogous statements for b(t) and c(t) are also true, and It takes a certain genius to generalize to some- since a(t), b(t), c(t) have no common roots, we can thing far simpler than the original. But what could combine those statements to read possibly be more simply stated, yet more general, (6) a(t)b(t)c(t) divides ∆(t) (t − α). than Fermat’s Last Theorem? It was Richard C. (abc)(α)=0 Mason (1983) who gave us that insight: Look for solutions to The next step is to take the degrees of both sides and see what that gives. Using the three different (5) a + b = c. representations of ∆ above, we have We will just follow through the proof above and degree(a) + degree(b) − 1, see where it leads: Start by assuming, with no loss ≤ − of generality, that a, b, and c are all nonzero poly- degree(∆) degree(a) + degree(c) 1, nomials without common factors (else all three degree(c) + degree(b) − 1. share the common factor and we can divide it out). Then we differentiate to get − The degree of (abc)(α)=0(t α) is precisely the a + b = c . total number of distinct roots of a(t)b(t)c(t). In- Next we need to do linear algebra. It is not quite serting all this into (6) we find that so obvious how to proceed analogously, but what max{degree(a), degree(b), degree(c)} we do learn in a linear algebra course is to put our < #{α ∈ C :(abc)(α)=0}. coefficients in a matrix, and solutions follow if the determinant is nonzero. This suggests defining Put another way, this result can be read as: a(t) b(t) Proposition 2. If a(t),b(t),c(t) ∈ C[t] do not have ∆(t):= . a(t) b(t) any common roots and provide a genuine polyno- mial solution to a(t)+b(t)=c(t), then the maximum NOVEMBER 2002 NOTICES OF THE AMS 1225 of the degrees of a(t),b(t),c(t) is less than the num- their central importance. He showed, amongst ber of distinct roots of a(t)b(t)c(t)=0. other things, that for any finite subset S of Q there is a map π : C ∪ {∞} → C ∪ {∞} for which This is a “best possible” result in the sense that − π(S) ⊆{0, 1, ∞} , and π 1(z) = degree(π) for we can find infinitely many examples where there every z/∈{0, 1, ∞}. We can reinterpret this in is exactly one more zero of a(t)b(t)c(t)=0 than terms of polynomials as follows. the largest of the degrees: for example, the famil- iar identity Proposition 3. For any f (t) ∈ Z[t] there exist a(t),b(t),c(t) ∈ Z[t] which do not have any com- 2 2 − 2 2 2 (2t) +(t 1) =(t +1) mon roots and provide a genuine polynomial so- lution to a(t)+b(t)=c(t) for which f (t) divides or the rather less interesting a(t)b(t)c(t), and such that the maximum of the de- tn +1=(tn +1). grees of a(t),b(t),c(t) is exactly one less than the number of distinct roots of a(t)b(t)c(t)=0. Classifying such polynomial identities leads us naturally to the study of a special class of rational Thus we can use Belyı˘ maps to construct many functions, as we shall see next. “best possible examples” in Proposition 2. As we shall see later, this elegant construction is central Silverman’s Proof to several important results.