Matematisk Institut Mat 3AL 2.1 Chapter II. Rings and polynomials
PREREQUISITES ABOUT RINGS AND IDEALS.
A ring R is a non-empty set with two compositions + and · such that 1) (R, +) is an abelian group; 2) (R, ·) is associative; 3) The distributive laws hold: a · (b + c)=a · b + a · c (a + b) · c = a · c + b · c. The neutral element in R with respect to addition + is denoted by 0. A neutral element with respect to multiplication · is uniquely determined and is called the identity element or unit element and is usually denoted by 1 (or sometimes by e). If (R, ·) is commutative, R is called commutative In the following we shall only consider commutative rings with identity element. We recall some basic definitions and theorems which should be known from Anders Thorup Mat 2ALG. For any element a of R the distributive law implies:
a · 0=a · (0 + 0) = a · 0+a · 0, hence a · 0=0.Thusa · b = 0 if either a =0orb =0. A commutative ring for which the converse holds, i.e. a commutative ring for which the product of two elements a and b can only be 0 if either a =0orb =0,is called an integral domain. A commutative ring R (with at least two elements) is called a field, if every non- zero element a in R has an inverse, i.e. an element a−1 ∈ R such that a · a−1 =1. Such an inverse is necessarily uniquely determined. (Why?) A field is necessarily an integral domain. Indeed, assume a =0and a · b =0.Ifa has an inverse a−1 the equation a · b = 0 implies 1 · b = a · a−1 · b = 0; hence b =0. Exercise 2.1. Prove that an integral domain R (with at least two elements) is afieldifR has only finitely many elements. (Hint: if a is a non-zero element prove that the mapping x → a · x from R into itself is injective.) For any integral domain R there exists a fraction field K, i.e. a field K containing R as a subring such that every element x ∈ K is a quotient between two elements of R, i.e. there exist elements a and b of R, a =0forwhich a · x = b. Example 2.2. The rational numbers form a field Q which is the fraction field of the integral domain Z of all ordinary integers. Example 2.3.LetK be a field. The polynomial ring K[x]isanintegraldomain. K[x] is not a field; the field of rational functions K(x) is the fraction field of K[x]. Matematisk Institut Mat 3AL 2.2
The notion of “ideal” is fundamental in ring theory. Definition 2.4. An additive subgroup I of (R, +), R being a commutative ring with an identity element, is called an ideal,ifRI ⊆ I, i.e. if r · a ∈ I for every r ∈ R and every a ∈ I. If I is an ideal of R the cosets (or residue classes) r + I = {r + a | a ∈ I} form a commutative ring, called the residue class ring (or quotient ring)ofR with respect to I. This ring is denoted R/I. If r is an element of R the residue class containing r is often denoted byr ¯.If r¯ ands ¯ are residue classes containing the elements r and s from R the sum, resp. product ofr ¯ ands ¯ is defined byr ¯ +¯s = r + s,resp.¯r · s¯ = r · s. Amapφ from one ring R to another ring S is called a homomorphism if φ sends sums into sums and products to products: φ(r1 + r2)=φ(r1)+φ(r2)and φ(r1 · r2)=φ(r1) · φ(r2) for all r1 and r2 in R. The subset I := {r ∈ R | φ(r)=0} of R is an ideal of R, called the kernel of φ and denoted by Ker(φ). For elements in the residue class ring R/I we use the bar notation (as in group theory (cf. Remark 1.51) by lettingr ¯ denote the residue class r + I The map R → R/I, defined by r → r¯ is obviously a homomorphism whose kernel is I. This map is called the canonical epimorphism from R to R/I. For ring homomorphisms there is an isomorphism theorem quite analogous to Theorem 1.65 for groups. The proof is omitted since it is an obvious modification of the corresponding proof in the group theoretical case. Theorem 2.5. Let φ be a surjective homomorphism from the ring R onto the ring S.IfI is the kernel of φ, then there is an isomorphism ψ from R/I to S.Ifκ is the canonical homomorphism from R to R/I the isomorphism ψ can be chosen such that ψ = φ ◦ κ.
Exercise 2.6. In every ring R clearly R and the set {0} are ideals. Prove that a commutative ring R (with at least two elements) is a field if and only if R and {0} are the only ideals of R. Definition 2.7.AnidealI of R is called a principal ideal if there exists an element a in I such that I = {ra | r ∈ R}.ThisisusuallywrittenI = Ra. Example 2.8. In the ring Z of integers, every ideal has the form Zn for some n in Z. Hence every ideal of Z is principal. Example 2.9.IfK is a field every ideal in the polynomial ring K[x] has the form K[x] · f(x) for some polynomial f(x)inK[x]. Hence every ideal of K[x]isprincipal. Definition 2.10. R is called a principal ideal ring (”PIR”) if every ideal of R is a principal ideal. Matematisk Institut Mat 3AL 2.3
Definition 2.11. R is called a principal ideal domain,(”PID”)ifR is an integral domain and every ideal of R is a principal ideal. Example 2.12. Z and the polynomial ring K[x]overafieldK are principal ideal domains. The residue class ring Z/4Z is not an integral domain, but every ideal is principal. Hence Z/4Z is a principal ideal ring, but not a principal ideal domain. While the polynomial ring in one variable over a field is a principal deal domain, the polynomial ring in more than one variable is not a principal ideal domain. Actu- ally, in that case the ideal consisting of all polynomials with constant term 0 is not principal. Next we introduce some important notions for ideals reflecting properties of the corresponding residue class rings. Definition 2.13.AnidealI of the ring R, I = R, is called a prime ideal if the product of two elements a and b in R can only belong to I if either a ∈ I or b ∈ I. Theorem 2.14. An ideal I of the ring R is a prime ideal if and only if R/I is an integral domain. Proof. ”if”: Assume R/I is an integral domain. Let a and b be any two elements of R not lying in I. The corresponding residue classes in R/I represented by a and b are non-zero in R/I;sinceR/I is an integral domain the product of these two residue classes is non-zero. This product is represented by a · b, which therefore cannot lie in I. Hence I is a prime ideal. ”only if”: Assume I is a prime ideal. Let a and b in R represent two non-zero residue classesa ¯ and ¯b in R/I.SinceI is a prime ideal, a · b is not in I.Nowa · b represents the producta ¯ · ¯b of the two residue classes which cannot be zero because a · b/∈ I. Hence R/I is an integral domain.
Example 2.15. In the ring Z of ordinary integers the prime ideals are the principal ideals generated by prime numbers and the zero ideal. Definition 2.16.AnidealI of the ring R, I R, is called maximal,ifI is maximal among the ideals of R that are = R. Theorem 2.17. An ideal I of the ring R is a maximal ideal if and only if R/I is a field. Proof. ”if”: Assume R/I is a field. Let M be an ideal of R containing I as a proper subset. Choose an element a ∈ M \ I. The residue classa ¯ (with respect to I)is=0, hence has an inverse, say ¯b,(b ∈ R) in the field R/I. Therefore a · b can be written as 1+(an element ∈ I). But this means that M contains 1 and therefore equals R. ”Only if”: Assume I is a maximal ideal. Leta ¯,(a ∈ R) be a non-zero residue class in R/I. Clearly a is not in I.ThesetM of elements of the form a · r + c, r running through the elements of R and c running through the elements of I,isanidealM of R.Sincea is not in I,theidealM contains I strictly and hence M = R.This Matematisk Institut Mat 3AL 2.4
implies that 1 = a · r + c for some r in R and some c in I. But this in turn means that 1=¯¯ a · r¯ in R/I which therefore is a field.
Example 2.18. In the ring R = Z[x]theidealI = {f(x) ∈ Z[x] | f(0) = 0} is a non-maximal prime ideal. In this connection the following holds. We omit the proof. Theorem 2.19. Every non-zero prime ideal in a principal ideal domain is maximal.
Important example 2.20 In the ring Z every non-zero prime ideal is the principal ideal generated by a prime number p. By the above result the residue class ring Z/Zp is a field. This can also be checked immediately since every finite integral domain (with more than one element) is necessarily a field. This field is sometimes denoted Zp and sometimes Fp. We shall also need the concept ”factorial ring” or ”ring with unique factorization”, denoted by UFD. We shall refrain from giving detailed proofs, but just explain the notions and the basic results in this subject. Let R be an integral domain. A non-zero element a in R which is not invertible is called an irreducible element if a has only trivial factorizations, i.e. a = r · s, r, s ∈ R implies that either r or s is invertible in R. Two non-zero elements a and b of an integral domain R are called associate if they generate the same principal ideal, i.e. if Ra = Rb. It is easily checked that a and b are associate if and only if a = b·(an invertible element) if and only if b = a·(an invertible element). R is called a UFD,ifR is an integral domain and every non-invertible non- zero element in a ”basically unique” way can be written as a product of irreducible elements. Here “basically unique” means the following: Assume an element a has two factorizations of irreducible elements:
a = π1 · π2 ···πn and a =˜π1 · π˜2 ···π˜m, then m = n and after a suitable permutation of the π’s every πi , i =1, 2,...,n, is associate toπ ˜i. A characteristic property of UFD’s is the following: If an irreducible element π divides a product a · b then π divides one of the factors a and b.Inotherwordsthe principal ideal generated by an irreducible element is a prime ideal. The classical examples of UFD’s are the ring Z of ordinary integers and the ring of polynomials in one variable over a field. Those two rings are PID’s. A general result is the following which we state without proof: Matematisk Institut Mat 3AL 2.5
Theorem 2.21. Every principal ideal domain is a unique factorization domain.
Remark 2.22. The converse is not true. The ring of polynomials in more than one variable over a field is not a principal ideal domain, but it can be shown to be a unique factorization domain.
In a UFD the principal ideal generated by an irreducible element is a non-zero prime ideal. If R is a PID an ideal I of R is a non-zero prime ideal exactly when I = Rπ for some irreducible element π in R. Combining this remark with Theorem 2.19 we get: Theorem 2.23. Let R be a principal ideal domain. A non-zero ideal of R is a prime ideal if and only if it has the form Rπ for some irreducible element π. For any non-zero prime ideal Rπ the residue class ring R/Rπ is a field.
FACTORIZATIONS OF POLYNOMIALS.
We consider the polynomial ring R[x], where R is either a field or a PID, (in particular for instance R being the ring of integers). The following observation is often useful: Let ϕ be a homomorphism of R onto R∗,whereR and R∗ are commutative rings. The mapping
n n Φ(r0 + r1x + ···+ rnx )=ϕ(r0)+ϕ(r1)x + ···+ ϕ(rn)x defines a homomorphism Φ of R[x]ontoR∗[x]. n Definition 2.24.Apolynomialf(x)=a0 + ···+ anx ∈ Z[x] is called primitive,if n the greatest common divisor (a0,...,an) is 1. In particular, if f(x)=a0+···+anx ∈ Z[x], is monic, i.e. an =1,thenf(x) is primitive. Theorem 2.25. (Gauss). The product of two primitive polynomials f(x) and g(x) is primitive. Proof. Indirectly. Assume f(x) · g(x) were not primitive; then there would exist a prime number p such that p divides all coefficients of f(x)g(x). Consider the homomorphism ϕ : Z → Zp = Fp defined by sending an integer into its residue class modulo p.ThenΦ(f) =0andΦ( g) = 0, but Φ(f · g)=0,whichis a contradiction since Φ is a homomorphism and Zp[x] is an integral domain. Lemma 2.26. Let f(x) be a primitive polynomial in Z[x],andq ∈ Q.Thenq = ±1 if qf(x) is a primitive polynomial. r ··· Proof. Write q = s ,wherer and s are mutually prime integers. Let f(x)=a0 + + n r n ra0 ran n anx .Thens (a0 + ···+ anx )= s + ···+ s x ∈ Z[x] ⇒ s|rai ∀i ⇒ s|ai ∀i ⇒ Matematisk Institut Mat 3AL 2.6 r a0 an n s = ±1(sincef(x) is primitive). Hence s f(x)=r s + ···+ s x is primitive ⇒ r = ±1.
The proofs of the following lemmas are quite straightforward and are left to the reader.
Lemma 2.27. For any non-zero polynomial f(x) in Q[x] there exists a rational number q such that qf(x) is primitive.
Lemma 2.28. Let f(x) be a polynomial in Z[x].Ifq is a rational number such that qf(x) is primitive then 1/q is an integer.
Lemma 2.29. If f(x) is primitive and f(x)=g(x)h(x), g(x) and h(x) being poly- nomials in Z[x],theng(x) and h(x) are primitive.
Remark 2.30. The invertible elements in Z[x]are±1.
Theorem 2.31. Let f(x) be a polynomial in Z[x].Then: i) Degree f(x)=0: f(x) is irreducible in Z[x] ⇔ f(x)=±p, p for a prime number p. ii) Degree f(x) > 0: f(x) is irreducible in Z[x] ⇔ f(x) is primitive and f(x) is irreducible in Q[x].
Proof. i) is clear. ii) ⇐ assume f(x) had a non-trivial factorization f(x)=g(x)h(x), g(x)and h(x) ∈ Z[x], where g(x), h(x)arenotinvertibleiZ[x]. Since f(x) is primitive the degrees of g(x)andh(x)mustbe> 0, hence f(x) would have a non-trivial decomposition in Q[x]. Contradiction! ⇒ It is clear, that f(x) must be primitive. Assume f(x) had a non-trivial factor- ization in Q[x]: f(x)=g(x)h(x), g(x)andh(x) not being constants. By lemma 2.27 there exist rational numbers q1 and q2 such that q1g(x)andq2h(x) are primitive. q1q2f(x)=[q1g(x)][q2h(x)] .
By Gauss’ theorem q1q2f(x) is primitive. Since f(x) is primitive, lemma 2.26 implies that q1q2 = ±1, and hence
f(x)=(± q1g(x)) · (q2h(x))
Thus f(x) would have a non-trivial factorization in Z[x]. Consequently f(x)must be irreducible as an element in Q[x]. Matematisk Institut Mat 3AL 2.7
Theorem 2.32. Z[x] is a UFD. Proof. 1) Every f(x) ∈ Z[x] is a product of irreducible polynomials. Indeed, if f(x) is constant, i.e. is an ordinary integer, then the assertion follows since Z is a UFD. If a polynomial in Z[x] has positive degree, it can be written as (a non-zero integer)·(a primitive polynomial). Therefore it suffices to show that any primitive polynomial f(x) is a product of irreducible polynomials. We proceed by induction on the degree of the polynomial. If the degree is one the assertion is clear. Assume the assertion has been proved for all primitive polynomials of degree the homomorphism ϕ : Z → Zp and the homomorphism Φ : Z[x] → Zp[x] Then 0 = Φ(f(x)g(x)) = Φ(f(x)) · Φ(g(x)). Since Zp[x]isanintegraldomain,Φ(f(x)) or Φ(g(x)) must be 0, i.e.: p|f(x)or p|g(x). ad ii) Since p(x)|f(x)g(x)inZ[x] it follows that p(x)|f(x)g(x)inQ[x]. Because p(x) is irreducible in Q[x]andQ[x]isUFD, we conclude that p(x)|f(x) or p(x)|g(x)inQ[x]. Assume for instance that p(x)|f(x)inQ[x]. Then we have f(x)=p(x)h(x) ,h(x) ∈ Q[x] . By lemma 2.27 there exists q in Q such that qh(x) is primitive. Consequently we get qf(x)=p(x) · (qh(x)) . As before (Gauss’ Theorem) qf(x) must be primitive. By lemma 2.28 we can write q =1/s for some integer s. Hence: h(x)=s(qh(x)) ∈ Z[x], and thus p(x)|f(x)in Z[x]. Similarly if p(x)|g(x)inQ[x] we conclude that p(x)|g(x)inZ[x]. Remark 2.33. With minor obvious modifications the above proof carries over to the case where Z is replaced by an arbitrary UFD. In other words, generally we have: Matematisk Institut Mat 3AL 2.8 Theorem 2.34. R is a UFD ⇒ R[x] is a UFD. Corollary to Theorem 2.34. For any field K the polynomial ring K[x1,...,xn] is a UFD. Remark 2.35. For later explicit applications a special case of Theorem 2.31 is very important: A monic polynomial in Z[x] is irreducible as an element in Q[x]ifand only if it irreducible as an element in Z[x]. Theorem 2.36 (Sch¨onemann-Eisenstein’s irreducibility criterion). Apoly- n n−1 nomial f(x) ∈ Z[x] of the form f(x)=x + an−1x + ···+ a1x + a0 is irreducible 2 in Q[x], if there exists a prime number p, such that p|an−1,...,p|a1, p|a0, p a0. Proof. By Theorem 2.31 (cf. the above remark 2.35) it suffices to show, that f(x)is irreducible in Z[x]. Assume there were a factorization f(x)=g(x) · h(x), where g(x)andh(x) ∈ Z[x] and g(x)andh(x) both were of degree Φ(f(x)) = xn =Φ(g(x)) · Φ(h(x)) ⇒ Φ(g(x)) = xm ;Φ(h(x)) = xn−m for a suitable m,1≤ m ≤ n − 1. This implies that all coefficients of g(x)andh(x) except the highest one are divisible by p;inparticularp would divide g(0) and h(0) 2 and thereby p |g(0)h(0) = f(0) = a0. Contradiction ! xn−1 Example 2.37 . x−1 is irreducible in Q[x] ⇔ n = prime number. Proof. ⇒: assume n is composite n = n1n2,1 xn − 1 xn1n2 − 1 xn1 − 1 = = (xn1 )n2−1 + ···+ xn1 +1 x − 1 x − 1 x − 1 is reducible. ⇐ xp−1 ⇔ assume n is a prime number p.Thenf(x)= x−1 is irreducible f(x +1)is irreducible. Now by explicit expansion we get (x +1)p − 1 p p p f(x +1)= = xp−1 + xp−2 + ···+ x + . x 1 p − 2 p − 1 p 2 p Since p is a prime number, p divides j for 1 ≤ j ≤ p − 1, and p p−1 = p. xp−1 Hence Theorem 2.36 shows the irreducibility of x−1 . Matematisk Institut Mat 3AL 2.9 Remark 2.38. Theorem 2.36 easily carries over to polynomials over a UFD: Let R be UFD and K its field of fractions. Let π be an irreducible element in R n n−1 and f(x)=x + rn−1x + ···+ r0 a polynomial in R[x]forwhichπ divides ri, 2 0 ≤ i ≤ n − 1, while r0 is not divisible by π ,thenf(x) is irreducible in K[X]. Example 2.39. f(x, y)=xn +yn −1 is irreducible in Q[x, y] for any natural number n. Indeed, just consider f(x, y +1)asanelementof(Q[y])[x] and use the above remark with π = y. n Exercise 2.40. f(x)=Πi=1(x − ai) − 1, where a1,...,an are distinct numbers in n Z, is irreducible in Q[x]. (Is there a similar result for Πi=1(x − ai)+1?) Finally we give an example where we use that every polynomial over Q has roots in the complex field C: n n−1 Example 2.41.Letf(x)=x + a1x + ···+ an−1x ± p be a polynomial in Z[x] where p is a prime number. Then f(x) is irreducible, if p>1+ | a1 | + ···+ | an−1 |. Proof. Assume f(x) had a non-trivial factorization f(x)=g(x)h(x)inZ[x]. Then f(0) = g(0)h(0) = ±p would imply that one of the factors g(0) or h(0), say g(0), were ±1. The product of the roots of g(x)wouldthenbe±1. In particular, there wouldbeatleastonerootα of absolute value 1 .Since α isalsoarootoff(x)we n n−1 would get: α + a1α + ···+ an−1α = ±p, which - by taking absolute values - would imply 1+ | a1 | + ···+ | an−1 | p. Contradiction! Two philosophical questions: Let n be a “random” natural number. What is most probable: That n is a prime number or that n is a composite number? Let f(x) be a “random” polynomial in Q[x]. What is most probable: That f(x) is an irreducible polynomial or that f(x) is a reducible polynomial? SYMMETRIC POLYNOMIALS. Let K be a field. A polynomial f(x1, ..., xn) ∈ K[x1, ..., xn] is called symmetric, if it is invariant under every permutation of the indeterminates x1, ...xn, i.e. if f(x1, ..., xn)=f(xσ(1), ..., xσ(n)) for every permutation σ ∈ Sn. Example 2.42. The polynomials ··· n s1 = x1 + + xn 1 terms ··· n s2 = x1x2 + x1x3 + + xn−1xn 2 terms ··· n s3 = x1x2x3 + + xn−2xn−1xn 3 terms ... n sn = x1x2 ···xn n terms Matematisk Institut Mat 3AL 2.10 are symmetric. This may either been seen directly or by considering the polynomial n n−1 n−2 n (T − x1)(T − x2) ···(T − xn)=T − s1T + s2T + ···+(−1) sn which is invariant under every permutation of x1..., xn; therefore all the coefficients are also invariant under every permutation of x1, ..., xn. These polynomials s1, ..., sn are called the elementary symmetric polynomials in x1, ..., xn. It is clear, that every polynomial in the elementary symmetric polynomial is symmetric in x1, ..., xn. The converse also holds true, as the following important theorem shows. Theorem 2.43. The main theorem about symmetric polynomials. Every symmetric polynomial f(x1, ..., xn) ∈ K[x1, ..., xn],whereK is an arbitrary field, can in exactly one way be written as a polynomial (with coefficients in K)in the elementary symmetric polynomials s1, ..., sn. In other words: for every symmetric polynomial f(x1, .., xn) ∈ K[x1, .., xn] there exists a unique polynomial g(y1, ..., yn) ∈ K[y1, ..., yn], such that f(x1, ..., xn)=g(x1 + ···+ xn,x1x2 + ··· , ..., x1 ···xn). Before giving the proof here are some general remarks about polynomials in several indeterminates. Every polynomial in x1, ..., xn can be written i1 in f(x1, ..., xn)= ai1,...,in x1 ...xn i1,...,in where ai1,··· ,in are elements in K. i 1 in ··· The degree of a term ai1,...,in x1 ...xn ,ai1,...,in = 0 is defined as i1 + + in . By the degree of a non-zero polynomial f we mean the highest degree of a term (=0)in f. For polynomials in more than one indeterminate the degree of the terms is not enough to determine an ordering of these. Therefore we introduce the concept sig- i 1 ··· in nature. By the signature of a term ai1,...,in x1 ...xn we mean the n-tuple i =(i1, ..., in). We order the set of signatures by first ordering them according to the degree and then ordering terms of the same degree lexicographically. This means that for two different signatures (i1, ..., in)and(j1, ..., jn)wehave (i1, ..., in) ≺ (j1, ..., jn) if either i1 + ···+ in Lemma 2.44. The product of two non-zero polynomials in K[x1, ..., xn],whereK is a field, is non-zero, and the term of highest signature in product is the product of the terms of highest signature in the two polynomials. Proof. Exercise. We now return to the proof of the main theorem. First the existence: Let f ∈ K[x1, ..., xn] be symmetric. We may assume f =0. i1 i2 in Let A = cx1 x2 ···xn be the term in f of highest signature. We remark that since f is symmetric, we get i1 ≥ i2 ≥ ... ≥ in. Indeed, assume iμ >iν for some μ>ν.Sincef is left fixed when permuting xμ and xν the polynomial i i1 i2 μ iν in f must also contain the term cx1 x2 ···xν ···xμ ···xn . But the signature of that i i1 i2 iν μ in term is higher than that of A = cx1 x2 ···xν ···xμ ···xn . The terms of highest signature in the elementary symmetric polynomials s1,s2, ..., sn are x1, x1x2,..., x1x2 ···xn. By virtue of the above lemma for any non-negative integers j1,j2, ..., jn the term of j1 j2 jn highest signature in the product s1 s2 ···sn , viewed as a polynomial in x1,x2, ..., xn, is j1 j2 jn x1 (x1x2) ···(x1x2 ···xn) The corresponding signature is (j1 + j2 + ···+ jn,j2 + ···+ jn, ..., jn) If we now choose j1 = i1 − i2,j2 = i2 − i3, ..., jn−1 = in−1 − in and jn = in, (which in view of the above remark are non-negative integers), this signature is just (i1,i2, ..., in). j1 j2 jn The difference f − cs1 s2 ...sn therefore is either 0 or a symmetric polynomial (= 0) for which the term of highest signature is less than (i1,i2, ..., in). If this difference j1 j2 jn is 0 the proof is done. Otherwise we apply the same procedure on f − cs1 s2 ...sn etc.. In this way we eventually reach the zero polynomial, since there are only finitely many signatures less than a given one. Now the uniqueness: It clearly suffices to show that for g(y1, ..., yn) ∈ K[y1, ..., yn],g(y1, ..., yn) =0the polynomial g(s1, ..., sn)=g(x1 + x2 + ···+ xn,x1x2 + ··· , ..., x1x2 ···xn)isnotthe zero polynomial. t1 t2 tn Let dy1 y2 ...yn ,d =0beatermin g.Ifwefory1,y2, ..., yn substitute the elementary symmetric polynomials s1,s2, ...sn the argument above shows that we get a polynomial in x1,x2, ..., xn in which the term of highest signature is t1 t2 tn dx1 (x1x2) ···(x1x2 ···xn) . Matematisk Institut Mat 3AL 2.12 The signature of this term is (t1 + t2 + ···+ tn,t2 + ···+ tn, ..., tn). If we now first consider the terms in g for which t1 + t2 + ···+ tn is biggest, then among these terms those for which t2 + ···+ tn is biggest, etc. in g we get t1 t2 tn separated a certain term dy1 y2 ...yn with the property that this and only this term by substitution of the elementary symmetric polynomials gives rise to a term with signature (t1 + t2 + ···+ tn,t2 ···+ tn, ..., tn). This term can not be cancelled away against any other term. Consequently g(x1 + x2 + ···+ xn,x1x2 + ··· ..., ..., x1x2 ···xn) is not the zero polynomial. Matematisk Institut Mat 3AL 2.13 ALGEBRAIC EXTENSIONS. Let K be a subfield of the field L. For an element α ∈ L we denote by K[α] the smallest subring of L containing K and α. The ring K[α] consists of all elements in n L that can be written in the form k0 + k1α + ···+ knα , ki ∈ K, n ∈ N. By K(α)wedenotethe smallest subfield of L containing K and α. The field K(α) consists of all elements in L that can be written in the form n k0 + k1α + ···+ knα n , k0 + k1α + ···+ knα ki, ki ∈ K, n ∈ N and the denominator =0. K(α) is clearly the field of fractions of K[α]. For a given α ∈ L, L K we consider the homomorphism Φ : K[x] → K[α] defined by Φ(f(x)) = f(α). Clearly Φ is surjective. We distinguish between two cases: 1) Ker Φ = 0, i.e. K[α] K[x] which is not a field. In this case K(x) is isomorphic to K(α) and we say that α is transcendent over K. 2) Ker Φ = 0. Since Ker Φ is an ideal in K[x]andK[x]isaPID,thekernel Ker Φ is the principal ideal K[x]p(x) generated by a polynomial p(x) =0.Bythe epimorphism theorem for rings (Theorem 2.5) we get K[x]/ Ker Φ K[α]. Ker Φ is therefore a prime ideal =0.Since K[x] is a PID, Ker Φ is a maximal ideal (Theorem 2.19), therefore (Theorem 2.17) K[α] is a field and hence K(α)=K[α]. In this case α is called algebraic over K.Thatα is algebraic over K thus means, that α is root of a proper polynomial (i.e. = 0) with coefficients in K. Now Ker Φ = K[x]p(x), where p(x) is uniquely determined up to an invertible factor in K[x], i.e.: up to an constant in K. The uniquely determined monic (i.e.: highest coefficient = 1) polynomial in K[x] generates Ker Φ and is denoted Irr(α, K). This polynomial is irreducible in K[x], (cf. Theorem 2.23). Let f(x) ∈ K[x] be a polynomial for which f(α)=0.Thenf(x) = Irr(α, K) · g(x)forsomeg(x) ∈ K[x]. This implies, that Irr(α, K) can be characterized as the uniquely determined monic irreducible polynomial in K[x] having α as a root. Irr(α, K) can also be characterized as the uniquely determined monic polynomial in K[x] of lowest degree having α as a root. Therefore Irr(α, K) is sometimes called α’s minimal polynomial w.r.t. K. From the above we, in particular, conclude that a polynomial in K[x]hasα as a root if and only if it is divisible by Irr(α, K). Matematisk Institut Mat 3AL 2.14 Using the division algorithm every polynomial f(x) ∈ K[x] can be written uniquely in the form f(x) = Irr(α, K) · g(x)+r(x) , degree r(x) If we in this equation insert α for x we get f(α)=r(α), hence every element in K[α] can be written n−1 a0 + a1α + ···+ an−1α ,a0,...,an−1 ∈ K. This representation is unique. Indeed, assume that an element β in K[α]hadthe representations n−1 β = a0 + a1α + ···+ an−1α and n−1 β = a0 + a1α + ···+ an−1α , where a0,a1, ..., a0,a1, ... belong to K,thenα would be root of the polynomial n−1 a0 − a0 +(a1 − a1)x + ···+(an−1 − an−1)x .Butα is not a root in any non- zero polynomial i K[x]ofdegree Definition 2.45.AnextensionL ⊇ K of fields is called algebraic, if every element in L is algebraic over K. We often just write: L/K is algebraic. Before continuing we bring some general remarks, which we shall need a lot of times. If L is an extension field of K and α1, ··· ,αs are elements in L we denote by K(α1,... ,αs) the smallest subfield of L containing K and α1,... ,αs.Itisclear, that K(α1,... ,αs)=K(α1) ...(αs). An extension field L of K can be considered as a vector space over K and thus has a dimension (i.e. the number of elements of a basis), which is denoted [L : K]. If this dimension is a finite number n any family of m elements ω1,... ,ωm, m>n, will be linearly dependent over K, i.e. there exist elements k1, ··· ,km ∈ K, not all 0 such that k1ω1 + ···+ kmωm =0. Matematisk Institut Mat 3AL 2.15 Theorem 2.46. Let L be a field extension of K for which [L : K] < ∞.ThenL/K is algebraic. Proof. Assume [L : K]=n<∞. For every element α in L the elements 1,α,α2,...,αn are linearly dependent over K; hence there exist elements k0,k1,...,kn in K, not all 0, such that n k0 + k1α + ···+ knα =0. Consequently α is a root of the non-zero polynomial n k0 + k1x + ···+ knx ∈ K[x] . Thus every element α in L is algebraic over K. Theorem 2.47. (Transitivity Theorem). Let K L M be fields. Then [M : K]=[M : L][L : K],where[M : L] and [L : K] are assumed to be finite. Proof. If α1,α2,...,αs form a K-basis for L and β1,β2,...,βt a L-basis for M,then αiβj,1 i s,1 j t,formaK-basis for M. For this we have to show two things: i) The elements αiβj ,1 i s,1 j t, are linearly independent over K. ii) Every element in M canbewrittenasaK-linear combination of the elements αiβj,1 i s,1 j t. Ad i) Assume kij αiβj =0, i,j where the kij ’s belong to K. This equation can be written ( kij αi)βj =0 j i For each j the inner sum is an element in L. Since the elements βj, 1 j n, are independent over L,weget kij αi =0 i for every j,1 j n. Since the elements αi, 1 i s, are linearly independent over K, we conclude that kij =0foralli, 1 i s and all j, 1 j t. Ad ii) Let ξ be an element in M.Sinceβj, 1 j t form a L-basis for M,we can write ξ as ξ = jβj , j Matematisk Institut Mat 3AL 2.16 where each j lies in L.Sinceαi, 1 i s,formaK-basis for L,each j can be written j = kij αi, i where every kij lies in K. This implies ξ = kij αiβj . i,j Thus ii) has been proved. Remark 2.48. The transitivity theorem is extremely important in explicit compu- tations. A particular application is the following: Let M/K be a finite extension and L any field between K and M, K L M.Then[L : K] divides [M : K]. To take an explicit example√ let α be the real root of any irreducible polynomial in Q[x]ofdegree3.Then 2 is not contained in Q(α). (A direct verification without use of the above remark is not completely trivial.) Another example is the following. Let a and b be rational√ numbers√ such√ that neither a, b or ab is the square of a rational number. Then a/∈ Q and b/∈ (Q( a)) (why?).√ √ This implies that√ √ √ √ [Q( a, b):Q]=[Q( a, b):Q( a)] · [Q( a):Q]=2· 2=4. Theorem 2.49. Let L beafieldextensionofK,andletα and β be elements in L which are algebraic over K.Thenα + β, α − β, α · β and α/β,(β =0 )arealso algebraic over K. Proof. K(α)=K[α] is a finite dimensional vector space over K.Sinceβ in particular is algebraic over K(α)=K[α], it follows that (K(α))(β)=K(α, β) is finite dimen- sional over K(α). By theorem 2.47 K(α, β) has finite dimension over K. Therefore by theorem 2.46 the field K(α, β) is algebraic over K. Since α + β, α − β, α · β and α/β,(β = 0) lie in K(α, β), these elements are algebraic over K. Definition 2.50.LetL be a field extension of K. The subset of L consisting of the elements that are algebraic over K (which by the above theorem is a subfield of L) is called the algebraic closure of K in L and is denoted K. Example 2.51. There exist infinite dimensional algebraic field extensions.√ Let L = R and K = Q. For every natural number n theorem 2.36 implies that [Q( n 2) : Q]=n. Therefore the algebraic closure of Q in L = R has infinite dimension over Q. Theorem 2.52. Let K ⊆ L ⊆ M be fields. If L/K is algebraic, every element ξ ∈ M that is algebraic over L is also algebraic over K. Matematisk Institut Mat 3AL 2.17 Proof. ξ is root of a non-zero polynomial in L[x]: n n−1 ξ + a0ξ + ···+ an−1 =0,a0,...,an−1 ∈ L. Since a0 is algebraic over K, the dimension [K(a0):K] is finite. Since a1 is algebraic over K,inparticularoverK(a0), the dimension [K(a0,a1): K(a0)] is finite. Since a2 is algebraic over K,inparticularoverK(a0,a1), the dimension [K(a0,a1,a2): K(a0,a1)] is finite. etc. Since an−1 is algebraic over K,inparticularoverK(a0,...,an−2), the dimension [K(a0,...,an−1):K(a0,...,an−2)] is finite. Since ξ is algebraic over K(a0,...,an−1), the dimension K(ξ,a0,...,an−1): K(a0,...,an−1)] is finite. By successive application of the transitivity theorem it follows that the dimension [K(ξ,a0,...,an−1):K] is finite; therefore Theorem 2.46 shows that ξ is algebraic over K. Corollary 2.53. (Transitivity for algebraic extensions). Let K ⊆ L ⊆ M be fields. Then M/L algebraic ∧ L/K algebraic ⇒ M/K algebraic. = Corollary 2.54.. Let K ⊆ L be fields. Then K = K. Exercise 2.55.Letα and β be complex numbers, that are algebraic over Q of degree p,resp. q. Show that α + β is algebraic over Q of degree pq,ifp and q are distinct prime numbers. −−−−−−−−−−−−−−− ADJUNCTION OF A ROOT. SPLITTING FIELDS. So far we have considered given existing field extensions and looked at polynomials vanishing on certain elements. Now we conversely consider a field K and a polynomial p(x) ∈ K[x] and are looking for extensions of K in which p(x)hasaroot. Theorem 2.56. Existence theorem concerning adjunction of a root of an irreducible polynomial. Let K be an arbitrary field and p(x)=a0 + a1x + ···+ n−1 n ∗ an−1x + x an irreducible monic polynomial in K[x]. Then there exists a field L containing a subfield K∗ with the following properties: 1) there exists an element α∗ in L∗ which is algebraic over K∗ such that L = K∗(α∗); 2) there exists an isomorphism ϕ of K onto K∗ such that Irr(α∗,K∗)=Φ(p(x)), where Φ is the isomorphism of K[x] onto K∗[x] induced by ϕ. Matematisk Institut Mat 3AL 2.18 Proof. Let L∗ be the residue class ring K[x]/K[x]p(x), which is a field, since p(x) is irreducible and K[x] is an PID. For a polynomial f(x) ∈ K[x] the corresponding residue class in L∗ is denoted f(x). Then: ∗ n−1 L = {k0 + k1x + ···+ kn−1x | k0,k1,...,kn−1 ∈ K} n−1 = {k0 + k1x + ···+ kn−1x | k0,k1,...,kn−1 ∈ K} . We set K∗ = {k | k ∈ K}, which becomes a subfield of L∗. The map defined by ϕ(k)=k is an isomorphism of K onto K∗.Nowx is a root of the polynomial n−1 n a0 + a1x + ···+ an−1x + x which is irreducible in K∗[x]; hence ∗ n−1 n Irr(x, K )=a0 + a1x + ···+ an−1x + x =Φ(p(x)). This proves the theorem, since x can be used as α∗. Remark to theorem 2.56 By identifying K with K∗ we may consider L∗ as an extension field of K in which p(x) has a root. An extension (in the more immediate sense of the word) can be obtained by considering the (set theoretical) disjoint union of K and L∗ \ K∗ and by the above construction ”transplanting” the operations addition and multiplication in this union; in this way one obtains a field containing K as a subfield and containing a root of p(x). Theorem 2.57. Uniqueness concerning adjunction of a root of an ir- reducible polynomial. Assume we have fields L, K, L∗,K∗, such that L ⊇ K, L∗ ⊇ K∗, L = K(α), L∗ = K∗(α∗),whereα is algebraic over K and α∗ algebraic over K∗. Assume further that there is an isomorphism ϕ from K onto K∗ such that Φ Irr(α, K) = Irr(α∗,K∗),whereΦ denotes the isomorphism from K[x] onto K∗[x] induced by ϕ. Then there exists a uniquely determined isomorphism ϕ from L onto L∗ such that 1) ϕ Res,K = ϕ; 2) ϕ (α)=α∗. Proof. Let p(x) = Irr(α, K)andp∗(x) = Irr(α∗,K∗) and assume these polynomials have the degree n. We first prove the uniqueness of ϕ .Evidently n−1 L = {k0 + k1α + ···+ kn−1α | k0,k1,...,kn−1 ∈ K} . If there exists an isomorphism ϕ with the properties 1) and 2), then n−1 ∗ ∗ n−1 ϕ (k0 + k1α + ···+ kn−1α )=ϕ(k0)+ϕ(k1)α + ···+ ϕ(kn−1)(α ) . Matematisk Institut Mat 3AL 2.19 Therefore there is at most one possibility for ϕ . Next we prove the existence of ϕ . There is an isomorphism ϕ1 : K[x]/(p(x) → L defined by ϕ1(f(x)) = f(α), f(x) ∈ K[x] and an isomorphism ∗ ∗ ∗ ϕ2 : K [x]/(p (x)) → L ∗ ∗ defined by ϕ2(g(x)) = g(α ) ,g(x) ∈ K [x]. Further the isomorphism K[x] Φ- K∗[x] induces an isomorphism e K[x]/(p(x)) Φ- K∗[x]/(p∗(x)) by Φ( f(x) modulo p(x)) = Φf(x) modulo p∗(x) , thus in particular we have Φ(( x) modulo p(x)) = x modulo p∗(x) .