Lecture from: 20.11.2024 | Video: Videos ETHZ
Interesting Factorization in Rings
Factorization in a ring becomes more interesting when not every element is a unit (i.e., has a multiplicative inverse). If every non-zero element is a unit, as in the field of real numbers, any element can be trivially factored. For example, . These factorizations aren’t particularly insightful because we’re just multiplying by units.
For factorization to be meaningful, we want the group of units to be significantly smaller than the ring itself: .
Non-Trivial Factorization: Let’s define a non-trivial factorization as , where and . This ensures that neither nor are simply units being multiplied by . In fields like , there are no non-trivial factorizations.
Integers: The ring of integers offers interesting factorizations because its units are only . For example, is a non-trivial factorization.
Factorization in Gaussian Integers
Consider the Gaussian integers , complex numbers of the form where and are integers. Let’s factorize .
-
Using the Norm: The norm of a Gaussian integer is defined as . The norm is multiplicative: .
-
Norm of : .
-
Factoring the Norm: . If can be factored, the norms of the factors must multiply to 10.
-
Potential Factors: We look for Gaussian integers with norms 2 and 5. has norm 2, and has norm 5.
-
Factorization: Indeed, .
-
Irreducible Factors: and are irreducible in . This is because their norms are prime integers, and any further factorization would require Gaussian integers with norms that are not integers, which is impossible.
Units and Associates: Notice that if you find an irreducible element like , you can multiply it by any unit () to get another irreducible element. These are called associates. For example, , , and are also irreducible. Each irreducible Gaussian integer is thus one of four associates.
Gaussian Primes
Visualizing Irreducible Gaussian Integers: Plotting the Gaussian integers and highlighting the irreducible ones reveals an interesting pattern related to prime numbers and sums of squares.
Zoomed Out:
Gaussian Integers Modulo an Irreducible Element
We’ll use the example of , which is irreducible in . We’re forming the quotient ring , which consists of equivalence classes of Gaussian integers modulo .
-
Elements within the Circle: If we consider the Gaussian integers strictly within a circle of radius centered at the origin, we find 13 such elements.
These are: .
Lecturer's Mistake
In my opinion, in the lecture, it was mistakenly stated that there are 13 distinct elements in the field . This is incorrect. While there are 13 Gaussian integers within the circle of radius , many of them are equivalent modulo .
-
Modular Equivalence: In the “mod world,” many of these 13 elements are equivalent. Two Gaussian integers and are equivalent modulo if their difference is a multiple of : if for some Gaussian integer .
- Example 1: because , which is a multiple of .
- Example 2: because .
-
Distinct Representatives: When we account for these equivalences, we find only five distinct equivalence classes (or cosets) in . A convenient set of representatives for these classes is . Every Gaussian integer is congruent modulo to precisely one of these five representatives.
-
Field Structure: forms a field with five elements. As all finite fields with the same number of elements are isomorphic, is isomorphic to (or ). The modulo construction gives us an alternative way to represent this finite field.
Polynomial Fields and Euclidean Division
In polynomial rings , where is a field, we can perform polynomial long division, much like integer division. This leads to the concept of a residue polynomial and modular arithmetic with polynomials.
Euclidean Division: For any two polynomials and in (with ), there exist unique polynomials (quotient) and (remainder) in such that:
, where or .
Modular Congruence: Just as with integers, we can define modular congruence for polynomials. We say that is congruent to modulo , written as:
or
This means and have the same remainder when divided by . We can also express the remainder using the notation:
Example:
Let’s divide by in :
Here, the quotient is and the remainder is . Notice that .
Thus, we can write:
And in modular notation:
This Euclidean division and the resulting modular arithmetic are fundamental tools for working with polynomial rings and constructing extension fields.
Polynomials as Functions
While we can represent polynomials in as sequences of coefficients, it’s often beneficial to view them as functions from to . This perspective allows for a deeper understanding of polynomial properties and enables applications like polynomial interpolation and error-correcting codes.
Recall that in , addition and multiplication are defined in the natural way:
- Addition: , where the coefficients are added term-wise.
- Multiplication: , where is obtained through polynomial multiplication.
Example:
Consider the polynomial in . Here, the coefficients and the variable take values in .
We can evaluate at each element of :
As we can see, some polynomials have roots (values where the polynomial evaluates to zero) within the field, while others do not. does not have a root in .
Lemma 5.29: The Factor Theorem
A key connection between roots and factors of a polynomial is captured by the Factor Theorem:
Lemma 5.29 (Factor Theorem)
Let be a field, , and . Then divides if and only if .
Proof:
() If divides , then for some polynomial . Substituting , we get .
() If , we can use the division algorithm for polynomials. We can divide by to get a quotient and a remainder , where the degree of is less than the degree of . So is a constant, say .
Thus, . Substituting , we get . Since , we have . Therefore, , which means divides .
This lemma is fundamental for relating the roots of a polynomial to its factorization.
Polynomial Interpolation
Polynomial interpolation is the process of finding a polynomial that passes through a given set of points. Given data points , where the are distinct, there exists a unique polynomial of degree at most such that for all .
Example in
Let’s work in , the ring of polynomials with coefficients in the finite field . We want to find a polynomial of degree at most 2 (i.e., ) such that , , and .
We can set up a system of linear equations:
This gives us the system of congruences:
This system can be solved using Gaussian elimination (as discussed in 08 Gauss Elimination, Elimination Matrices and Solution Sets).
If the values come from evaluating a polynomial of degree at most at the values, the resulting matrix will be invertible. This guarantees a unique solution for the coefficients . Since our values are distinct in and has degree at most 2, the given information should allow for reconstruction.
General Rule
To interpolate a polynomial of degree at most , we need data points. This can be understood from a linear algebra perspective. We have unknown coefficients () and equations (one for each data point). This creates a square matrix. If the values are distinct, the matrix is invertible, guaranteeing a unique solution for the coefficients.
For more see: Vandermonde Matrices
Lagrange Interpolation
Lagrange interpolation provides an explicit formula for the interpolating polynomial. Given data points , where the are distinct elements of a field , we want to find a polynomial of degree at most such that for all .
The idea behind Lagrange interpolation is to construct a set of basis polynomials with the following property:
These polynomials act like “switches,” turning on (evaluating to 1) only at their corresponding and turning off (evaluating to 0) at all other .
Once we have these basis polynomials, the interpolating polynomial can be constructed as a linear combination:
This works because when we evaluate at :
since all other terms where are zero.
The Lagrange basis polynomials are defined as:
In other notation, the Lagrange basis polynomial is defined as:
Note that the term is omitted from the product. This is explicitly stated in the original notation with under the product symbol.
Each is a polynomial of degree . It’s constructed to be 1 at and 0 at all other where .
This formula ensures that (because the factors in the numerator and denominator become the same) and for (because the factor becomes zero in the numerator).
Lagrange interpolation thus provides a direct and elegant way to construct the interpolating polynomial, although for large , evaluating this formula can become computationally expensive.
Theorem 5.31: A Non-Zero Polynomial of Degree Has at Most Roots in a Field
Let be a field. A non-zero polynomial of degree has at most roots in .
Proof:
-
Distinct Roots: Let be distinct roots of in . This means for each .
-
Factorization: By the Factor Theorem (Lemma 5.29), if is a root of in , then is a factor of . Therefore, for each distinct root , we have .
-
Product of Factors: We can combine the distinct linear factors corresponding to the distinct roots into a single factor: .
-
Degree Comparison: The degree of the product is . Since has degree , and the degree of a product of polynomials is the sum of their degrees, we must have .
-
Conclusion: Thus, has at most distinct roots in .
Note
This theorem holds only for non-zero polynomials. The zero polynomial has infinitely many roots. Also note that the degree of the polynomial must be exactly equal to . Otherwise the amount of roots can only be less than d and never equal to , because the factored-out linear factors already decrease the total possible roots.
Proving Uniqueness of the Interpolated Polynomial
Now, let’s prove the uniqueness of the interpolated polynomial.
-
Assumption: Suppose we have two different polynomials and , both of degree at most , that interpolate the same data points . This means for all .
-
Difference Polynomial: Consider the polynomial . Since and have degree at most , also has degree at most .
-
Roots of b(x): For each , we have . This means has at least distinct roots ().
-
Contradiction: We have a non-zero polynomial of degree at most with distinct roots. This contradicts the theorem we just proved.
-
Conclusion: Therefore, our initial assumption must be false. There cannot be two different interpolating polynomials of degree at most for the same data points. The interpolated polynomial is unique.
Constructing Galois Fields using Polynomials
We can construct finite fields using polynomial rings. The process involves creating a quotient ring of polynomials modulo an irreducible polynomial.
-
Start with a Finite Field: Begin with a finite field (e.g., where is prime). Let . In many cases this is just a prime , meaning .
-
Polynomial Ring: Consider the polynomial ring , which consists of polynomials with coefficients in .
-
Irreducible Polynomial: Choose an irreducible polynomial of degree . An irreducible polynomial is one that cannot be factored into lower-degree polynomials within .
-
Quotient Ring (Finite Field): Construct the quotient ring . This consists of the equivalence classes of polynomials in modulo . Two polynomials are equivalent if they have the same remainder when divided by .
Representing Elements
The elements of can be uniquely represented by polynomials of degree less than . This is because any polynomial can be written as , where is the remainder and . The remainder serves as the representative of the equivalence class.
-
Size of the Field: Because elements are represented by polynomials of degree less than , and each of the coefficients can be any of the elements of , the size of the quotient ring (and thus the field) is .
-
Field Operations:
-
Addition: Addition is performed component-wise on the coefficients of the polynomials, modulo the characteristic of the field . That is, each coefficient is the sum of the corresponding coefficients in , so the coefficient of of the sum of two polynomials is the sum of their coefficients of modulo . The resulting polynomial has also a degree less than .
-
Multiplication: Multiplication is performed by multiplying the polynomials as usual and then taking the remainder when dividing by . Since has degree , this ensures the result can also be represented by a polynomial of degree less than .
-
Example: Constructing
Let and let . The polynomial is irreducible in because it has no roots in . Thus . Here .
Let’s multiply and :
Now we divide this by :
The remainder is . Thus: in .
This method allows us to construct finite fields of any prime power order . These fields are crucial in various areas like cryptography and coding theory. The irreducibility of guarantees the existence of multiplicative inverses for all non-zero elements in the quotient ring, fulfilling the requirements of a field.
Modular Arithmetic and Polynomial GCDs
The concept of modular arithmetic with integers extends naturally to polynomials. Recall that for integers , , and , the equation has a solution (meaning has a multiplicative inverse modulo ) if and only if . This is based on the ideal structure of integers.
Ideals and GCDs:
For integers and , the ideal generated by and is denoted . A fundamental result in number theory states that this ideal is principal, meaning it can be generated by a single element.
Proof that (up to units)
Assume . This means:
- can be written as a linear combination of and : for some integers and .
- Since and , there exist integers and such that and . This follows directly from the definition of the ideal . Because and can be expressed as multiples of , this shows that is a common divisor of and .
- Let be any common divisor of and . This means and for some integers and . Substituting into the equation , we have . Therefore, is a multiple of , which implies that is the greatest common divisor (since any other common divisor divides ).
In rings with units (invertible elements), is equal to the gcd up to multiplication by a unit. So, , where is a unit. In , the units are and , so .
Polynomial Analogy:
This concept translates directly to polynomials. Let and be polynomials in a field . The ideal generated by and is . Similar to integers, this ideal is principal: , where (up to multiplication by units in , which are non-zero constant polynomials).
Multiplicative Inverses for Polynomials:
The equation has a solution (i.e., has a multiplicative inverse modulo ) if and only if .
Proof of the Condition for Polynomial Inverses
- If : There exist polynomials and such that . Modulo , this becomes , making the inverse of .
- If : Then for some . So, . Any common divisor of and must divide 1, implying the gcd is 1.
Note that for any non-zero with degree less than when is irreducible. This is because an irreducible polynomial cannot share any non-trivial factors with a lower-degree polynomial. If the were not 1, it would imply that is divisible by some polynomial, which contradicts its irreducibility unless is a unit (a constant). Since we consider to have a lower degree than and since , the gcd must be 1.
Conclusion:
The existence of multiplicative inverses for polynomials modulo another polynomial hinges on their greatest common divisor, mirroring the situation with integers. This is fundamental for constructing finite fields from polynomial rings. When is irreducible, all non-zero polynomials with have inverses modulo . This is what makes a field.
These fields are also called Galois Fields and are usually denoted as , where (the size of the base field ) and is the degree of the irreducible polynomial . contains elements. Specifically is denoted as with .
Example:
The quotient ring is isomorphic to the field of complex numbers, . Elements in correspond to in , with playing the role of . Addition is component-wise, and multiplication modulo mirrors complex multiplication: , analogous to .
Continue here: 20 Generators in Finite Fields, Properties of Finite Fields, Error Correcting Codes, Reed-Solomon Codes