An Elementary "Proof" of Fermat's Last Theorem
|
In the three centuries since its proposal, Fermat's Last Theorem has led to the development an enormous amount of mathematics, both good and bad. On the good side, it would not be too far-fetched to assert that much of modern algebraic number theory has sprung from attempts to prove what Fermat is alleged to have written in the margin of his copy of Diophantus in 1637. However, because of the problem's simple statement it has also generated a mountain of material which as best was wrong and at worst not even classifiable as mathematics. For the past seven years, James Harris has been a regular fixture on the Usenet
newsgroup, |
|
We begin by establishing four preliminary results. | |
|
Lemma 1. If p is an odd prime, p(r + s)rs | (r + s)p - rp - sp. Denote by Hp the polynomial | |
Hp = ((r + s)p - rp - sp) / (p(r + s)rs). | |
|
Then Hp(r, s) = Gp(r + s, rs), where Gp is a polynomial with integer coefficients. Proof. We observe that each term of (r + s)p - rp - sp has as its coefficient an order-p binomial coefficient, and it is well known that if p is prime, all of these will be divisible by p. It is easy to verify that both rs and r + s divide (r + s)p - rp - sp. To establish the last statement of the Lemma, we observe that H is a symmetric polynomial and that any such polynomial may be expressed in terms of the elementary symmetric functions, which in this 2-variable case are r + s and rs. [Note 1.] Lemma 2. Let p be an odd prime and x, y, z be integers satisfying xp + yp = zp and let v be an arbitrary integer. Then | |
|
(vp + 1)z2p - pv(xy)2 z2 Gp(vz2, (xy)2) - 2 (xy)p = 0 mod (x2 + y2 + vz2), | |
|
where Gp is defined in Lemma 1. Proof. For any integer v we have, trivially, | |
|
x2 + y2 = -vz2 mod (x2 + y2 + vz2). | |
|
Raising both sides to the power p gives | |
|
(x2 + y2)p = -(vz2)p mod (x2 + y2 + vz2). | |
|
Expanding the left side and using Lemma 1 gives | |
|
x2p + p(xy)2(x2 + y2)Hp(x2, y2) + y2p = -(vz2)p mod (x2 + y2 + vz2). | |
|
Since we assume that x, y, and z satisfy xp + yp = zp we can use (xp + yp)2= z2p to eliminate x2p and y2p in the congruence above, yielding | |
|
(vp + 1)z2p - p(xy)2(x2 + y2)Hp(x2, y2) - 2(xy)p = 0 mod (x2 + y2 + vz2). | |
|
Again referring to Lemma 1, we see that Hp(x2, y2) may be expressed as a polynomial in x2 + y2 and (xy)2, so modulo x2 + y2 + vz2 may be denoted by Gp(vz2, (xy)2), yielding the desired result. Lemma 3. Let p be an odd prime. Let x, y, z be pairwise coprime positive integers satisfying xp + yp = zp. Then
Proof. First, it is easy to show that no two of z - y, z - x, x + y can be 1. If, for example, z - y = z - x = 1, then we would have x = y, contradicting the relative primality of x and y (since we obviously couldn't have x = y = 1). The other two cases are handled similarly. It is also easy to show that no two of z - y, z - x, x + y can be divisible by p If, for example, z = y mod p and z = x mod p, then, applying Fermat's little theorem to xp + yp = zp, we would have x + y = z mod p, so we would conclude that x = 0 mod p and y = 0 mod p, again contradicting the relative primality of x and y. The remaining two cases are handled similarly. Combining these two results, we see that at least one of z - y, z - x, or x + y must have a prime factor q unequal to p. Suppose that q | z - y. Then, since | |
xp = zp - yp = (z - y)(zp-1 + zp-2y + ... + zyp-2 + yp-1) | |
|
it is clear that z - y | xp, so q | x. It is an old result that for prime p, z - y and (zp - yp) / (z - y) have no factors in common except for p, so if x = qju, with q not dividing u, then by inspecting the orders of q in the terms x, xp, and z - y, we conclude that qjp | z - y. Lemma 4. Let R be a commutative ring containing the integers. Suppose that q is a rational prime and t is an integer not divisible by q. Suppose also that q | A(t + qC) in R, for some A and C in R. Then, q | A in R. Proof. Since q and t are relatively prime in Z, there exist integers
r and s such that rq + st = 1. Since q | A(t + qC) in R,
we have q | At in R and obviously q | Arq in R, so
q | Ats + Arq = A(ts + rq) in R and hence
q | A in R, as required. | |
|
With the preliminaries out of the way, we establish the two congruences that constitute what James calls "Area One." 1.1. Divisibility results Let p be an odd prime and let x, y, z be pairwise coprime nonzero integers satisfying xp + yp = zp. From Lemma 3, without loss of generality we may suppose that q ≠ p is a prime factor of x and z - y. [Note 2.] Then q must be a factor of x2 + y2 - z2 as well. Let x = qju, where q does not divide u. Theorem 1. With the assumptions above, for any positive integer n there exists an integer m for which | ||||||||||||
| u2 + (y2 - z2) / q2j + mz2 = 0 mod qn. | [1] | |||||||||||
|
Proof. Observe that from Lemma 3, qpj | z - y, so we have that q2j divides y2 - z2. Since q does not divide z (otherwise we would contradict the relative primality of x and z), we can solve the congruence [1] for m, as required. From this, we have an immediate consequence, Corollary 1. For any integer k > 2j, there exists m such that if v = -1 + m2j then | ||||||||||||
| qk | x2 + y2 + vz2 | ||||||||||||
|
| ||||||||||||
|
1.2. The polynomial Fp, v(z2, xy) Denote the left hand side of the congruence of Lemma 1 by F = Fp, v(z2, xy), so that | ||||||||||||
| Fp, v(z2, xy) = (vp + 1)z2p - pvz2(xy)2Gp(vz2, (xy)2) - 2(xy)p. | ||||||||||||
|
For any fixed values of p and v, with v ≠ -1 we may consider F as a degree p polynomial in the indeterminants z2 and xy, so we may factor F into terms linear in z2 and xy, over the complex numbers: | ||||||||||||
| Fp, v(z2, xy) = (a1z2 + b1xy) ... (apz2 + bpxy). | [2] | |||||||||||
|
Equating coefficients, we observe that a1 ... ap = (vp + 1). By appropriate multiplication and division we may write [2] as | ||||||||||||
| ||||||||||||
|
In what follows, we will write the factorization of F as | ||||||||||||
| Fp, v(z2, xy) = (c1z2 + d1xy) ... (cpz2 + dpxy), | [3] | |||||||||||
|
where c1 = c2 = (v + 1)1/2, c3 = (vp + 1) / (v + 1), and ci = 1 for 3 < i ≤ p, and the d values are defined as above. Now from the polynomial identity [3], we return to regarding Fp, v as an integer, using the original values for x, y, and z, namely a Fermat counterexample for a specific p. | ||||||||||||
|
| ||||||||||||
|
1.3. Divisibility in the overring Let k > 2j be an arbitrary integer and fix v to be -1 + mq2j where m is as in Corollary 1. Define the ring R = Z[m1/2, d1, d2, ... , dp] for the ds in [3] resulting from this choice of v. [Note 3.] So, since qk | x2 + y2 + vz2 from Corollary 1 and x2 + y2 + vz2 | Fp, v from Lemma 2 we have (considering all subsequent congruences in the context of R, which we shall denote by =R for emphasis), | ||||||||||||
| Fp, v = (c1z2 + d1qjuy) (c2z2 + d2qjuy) ... (cpz2 + dpqjuy) =R 0 mod qk. | ||||||||||||
|
We then have Theorem 2. Suppose Fp, v = (c1z2 + d1xy)(c2z2 + d2xy) ... (cpz2 + d1xy) as in [3]. Then, for any integer n > 0 there exists an integer m such that with v = -1 + m2j, | ||||||||||||
| (m1/2z2 + d1uy)(m1/2z2 + d2uy) =R 0 mod qn | ||||||||||||
|
Proof. Using the values of ci established in [3], we have | ||||||||||||
| (m1/2qjz2 + d1qjuy)(m1/2qjz2 + d2qjuy) ... (z2 + dpqjuy) =R 0 mod qk | ||||||||||||
|
and dividing out qj from each of the first two factors yields | ||||||||||||
| (m1/2z2 + d1uy)(m1/2z2 + d2uy) ... (z2 + dpqjuy) =R 0 mod qk - 2j. | [4] | |||||||||||
|
Denote by A the product of the first 2 factors in [4] and by B the product of the remaining p - 2 factors, we have, after expanding B, | ||||||||||||
| A = (m1/2z2 + d1uy)(m1/2z2 + d2uy) | ||||||||||||
|
and | ||||||||||||
| B = ((vp + 1) / (v + 1))z2(p - 2) + qjC. | ||||||||||||
|
Letting t = ((vp + 1) / (v + 1))z2(p - 2) (and observing that in Z, q does not divide the integer t) we can rewrite [4] as | ||||||||||||
| A(t + qjC) =R 0 mod qk - 2j, | ||||||||||||
|
and by repeated applications of Lemma 4 we conclude that | ||||||||||||
| A = (m1/2z2 + d1uy)(m1/2z2 + d2uy) =R 0 mod qk - 2j. | [5] | |||||||||||
|
Writing k - 2j as n establishes our result. | ||||||||||||
|
| ||||||||||||
|
1.4. Consequences For convenience, we recall our terminology.
Finally, recall that by Theorems 1 and 2, for any n > 0 we can find an m = m(n) so that | ||||||||||||
| S(m) = 0 mod qn | ||||||||||||
|
and in R = R(m) | ||||||||||||
| T(m) =R 0 mod qn. | ||||||||||||
|
Note that changing the exponent r in the congruence modulo qr that defines m will in general change m, which will change v, which will change Fp, v, which will change the c and d values, which will give rise to a different ring R. However, if we have defined m(r), then in any congruence modulo qs for s ≥ r, m(s) will serve equally well as a solution for m(r) with no change in the ring R. | ||||||||||||
|
The next result and its corollaries establishes some further (and somewhat surprising) relations among the d values and the nature of q in the ring R. Theorem 3. Let m = m(j). Then, in the ring R = R(m) we will have d1 + d2 =R 0 mod qj. Proof. Let Fp, v be the polynomial defined in [3] for v = -1 + mq2j. Observe that in Fp, v = (vp + 1)z2j - pvz2(xy)2Gp(z2, xy) - 2(xy)p, Gp is a degree-(p - 3) polynomial in z2, so in Fp, v the (z2)p-1 term will have coefficient 0. If we expand the right side of the factorization of Fp, v and equate coefficients we will have | ||||||||||||
| d1c2...cp + c1d2c3...cp + . . . + c1...cp-1dp = 0. | ||||||||||||
|
Using the values for the cs gives us | ||||||||||||
| (d1 + d2)(v + 1)1/2((vp + 1) / (v + 1)) + d3(v + 1) + (vp + 1)(d4 + . . . + dp) = 0. | ||||||||||||
|
So, since v ≠ -1, we have | ||||||||||||
| ((vp + 1) / (v + 1))(d1 + d2) = -(v + 1)1/2(d3 + ((vp + 1) / (v + 1))(d4 + . . . + dp)). | ||||||||||||
|
Denoting the factor on the right by D, we may write this as | ||||||||||||
| ((vp + 1) / (v + 1))(d1 + d2) = -(v + 1)1/2D. | ||||||||||||
|
Thus, in the ring R we have | ||||||||||||
| ((vp + 1) / (v + 1))(d1 + d2) =R -(v + 1)1/2D mod qj. | ||||||||||||
|
Since (vp + 1) / (v + 1) is an integer not divisible by q it has a multiplicative inverse in Z modulo qj, so we may write | ||||||||||||
| d1 + d2 =R -(v + 1)1/2((vp + 1) / (v + 1))-1D mod qj. | ||||||||||||
|
Finally, recall that v = -1 + mq2j so the factor (v + 1)1/2 on the right will be divisible (in R) by qj, and thus | ||||||||||||
| d1 + d2 =R 0 mod qj. | ||||||||||||
|
| ||||||||||||
|
Corollary 2. With R defined as above, d1d2 =R 1 mod qj. Proof. Since we have just established d1 + d2 =R 0 mod qj and we know from Theorem 2 that | ||||||||||||
| mz4 + z2m1/2(d1 + d2)uy + d1d2u2y2 =R 0 mod qj | ||||||||||||
|
we find that | ||||||||||||
| mz4 + d1d2u2y2 =R 0 mod qj. | ||||||||||||
|
Now since y = z mod qpj by Lemma 3, we may set y = z above and since neither is zero modulo q we may multiply both sides by (z2)-1 modulo q in Z, giving us | ||||||||||||
| mz2 + d1d2u2 =R 0 mod qj. | [8] | |||||||||||
|
Recall that m is defined by m = -(u2 + (y2 - z2) / q2j) / z2 mod qj and observe that qj divides y2 - z2) / q2j so we have m = -u2 / z2 mod qj. Substituting this into [8] yields | ||||||||||||
| -u2 + d1d2u2 =R 0 mod qj | ||||||||||||
|
and since u is not divisible by q in Z we may multiply both sides of the above congruence by a multiplicative inverse of u mod qj to obtain the desired result. | ||||||||||||
|
| ||||||||||||
|
In his so-called "Area 2," James now proceeds to use S(m) and T(m) to get an expression not apparently involving m. Recall that for any n > 0 we may find a rational integer m = m(n) so that | ||||||||||||
| S(m) = u2 + (y2 - z2) / q2j + mz2 = 0 mod qn | [9] | |||||||||||
|
and in R = R(m), | ||||||||||||
| T(m) = mz4 + m1/2(d1 + d2)z2uy + d1d2u2 | ||||||||||||