data:image/s3,"s3://crabby-images/f4d76/f4d76f3cdd5431ee0d2314822603c90dca87fa91" alt="Deep Learning Essentials"
Matrix properties
In this section, we will look at some of the important properties matrices which are very useful for deep learning applications.
- Norm: Norm is an important property of a vector or a matrix that measures the size of the vector or the matrix. Geometrically it can also be interpreted as the distance of a point, x, from an origin. A Lp norm is therefore defined as follows:
data:image/s3,"s3://crabby-images/a380d/a380dbce0871f591d67f41b08b0c5a3ed515fb46" alt=""
Though a norm can be computed for various orders of p, most popularly known norms are L1 and L2 norm. L1 norm is usually considered a good choice for sparse models:
data:image/s3,"s3://crabby-images/93eef/93eef6f3bff5b32134092a41baae7708d369f9d7" alt=""
Another norm popular in the deep learning community is the max norm, also referred to as L∞. This is simply equivalent to the value of the largest element in the vector:
data:image/s3,"s3://crabby-images/d71e9/d71e9f392826655a271b9d3e5f3c854f26c2b5c4" alt=""
So far, all the previously mentioned norms are applicable to vectors. When we want to compute the size of a matrix, we use Frobenius norm, defined as follows:
data:image/s3,"s3://crabby-images/8ebc1/8ebc1cd80502499bbbb6d51417c14bea2a8eefcd" alt=""
Norms are usually used as they can be used to compute the dot product of two vectors directly:
data:image/s3,"s3://crabby-images/1c291/1c291cec4c0785c9532813ba15d05e6c5047e0ef" alt=""
- Trace: Trace is an operator that is defined as the sum of all the diagonal elements of a matrix:
data:image/s3,"s3://crabby-images/9fb54/9fb54de98ba67dc9595e002c8f851dec64814627" alt=""
Trace operators are quite useful in computing the Frobenius norm of the matrix, as follows:
data:image/s3,"s3://crabby-images/b2267/b22676e8cd21f272de0fe72b25b772b66129a50f" alt=""
Another interesting property of trace operator is that it is invariant to matrix transpose operations. Hence, it is often used to manipulate matrix expressions to yield meaningful identities:
data:image/s3,"s3://crabby-images/a54e1/a54e14cd4fdcbd9c25cf1dafb6576f51639d8aa2" alt=""
- Determinant: A determinant of a matrix is defined as a scalar value which is simply a product of all the eigenvalues of a matrix. They are generally very useful in the analysis and solution of systems of linear equations. For instance, according to Cramer's rule, a system of linear equations has a unique solution, if and only if, the determinant of the matrix composed of the system of linear equations is non-zero.