Let
V be a subspace of
R
n
for some
n. A collection
B = {
v1,
v2, …,
v
r
} of vectors from
V is said to be a
basis for
V if
B is linearly independent and spans
V. If either one of these criterial is not satisfied, then the collection is not a basis for
V. If a collection of vectors spans
V, then it contains enough vectors so that every vector in
V can be written as a linear combination of those in the collection. If the collection is linearly independent, then it doesn't contain so many vectors that some become dependent on the others. Intuitively, then, a basis has just the right size: It's big enough to span the space but not so big as to be dependent.
Example 1: The collection {
i, j} is a basis for
R2, since it spans
R2 and the vectors
i and
j are linearly independent (because neither is a multiple of the other). This is called the
standard basis for
R2. Similarly, the set {
i, j, k} is called the standard basis for
R3, and, in general,
is the standard basis for
R
n
.
Example 2: The collection {
i, i+j, 2
j} is not a basis for
R2. Although it spans
R2, it is not linearly independent. No collection of 3 or more vectors from
R2 can be independent.
Example 3: The collection {
i+j, j+k} is not a basis for
R3. Although it is linearly independent, it does not span all of
R3. For example, there exists no linear combination of
i + j and
j + k that equals
i + j + k.
Example 4: The collection {
i + j, i − j} is a basis for
R2. First, it is linearly independent, since neither
i + j nor
i − j is a multiple of the other. Second, it spans all of
R2 because every vector in
R2 can be expressed as a linear combination of
i + j and
i − j. Specifically, if
a
i +
b
j is any vector in
R2, then
if
k1 = ½(
a + b) and
k2 = ½(
a − b).
A space may have many different bases. For example, both {
i, j} and {
i + j, i − j} are bases for
R2. In fact,
any collection containing exactly two linearly independent vectors from
R2 is a basis for
R2. Similarly, any collection containing exactly three linearly independent vectors from
R3 is a basis for
R3, and so on. Although no nontrivial subspace of
R
n
has a unique basis, there
is something that all bases for a given space must have in common.
Let
V be a subspace of
R
n
for some
n. If
V has a basis containing exactly
r vectors, then
every basis for
V contains exactly
r vectors. That is, the choice of basis vectors for a given space is not unique, but the
number of basis vectors
is unique. This fact permits the following notion to be well defined: The number of vectors in a basis for a vector space
V ⊆
R
n
is called the
dimension of
V, denoted dim
V.
Example 5: Since the standard basis for
R2, {
i, j}, contains exactly 2 vectors,
every basis for
R2 contains exactly 2 vectors, so dim
R2 = 2. Similarly, since {
i, j, k} is a basis for
R3 that contains exactly 3 vectors, every basis for
R3 contains exactly 3 vectors, so dim
R3 = 3. In general, dim
R
n
=
n for every natural number
n.
Example 6: In
R3, the vectors
i and
k span a subspace of dimension 2. It is the
x−z plane, as shown in Figure
1 .
Example 7: The one-element collection {
i + j = (1, 1)} is a basis for the 1-dimensional subspace
V of
R2 consisting of the line
y =
x. See Figure
2 .
Example 8: The trivial subspace, {
0}, of
R
n
is said to have dimension 0. To be consistent with the definition of dimension, then, a basis for {
0} must be a collection containing zero elements; this is the empty set, ø.
The subspaces of
R1,
R2, and
R3, some of which have been illustrated in the preceding examples, can be summarized as follows:
Example 9: Find the dimension of the subspace
V of
R4 spanned by the vectors
The collection {
v1,
v2,
v3,
v4} is not a basis for
V—and dim
V is not 4—because {
v1,
v2,
v3,
v4} is not linearly independent; see the calculation preceding the example above. Discarding
v3 and
v4 from this collection does not diminish the span of {
v1,
v2,
v3,
v4}, but the resulting collection, {
v1,
v2}, is linearly independent. Thus, {
v1,
v2} is a basis for
V, so dim
V = 2.
Example 10: Find the dimension of the span of the vectors
Since these vectors are in
R5, their span,
S, is a subspace of
R5. It is not, however, a 3-dimensional subspace of
R5, since the three vectors,
w1,
w2, and
w3 are not linearly independent. In fact, since
w3 =
3w1 +
2w2, the vector
w3 can be discarded from the collection without diminishing the span. Since the vectors
w1 and
w2 are independent—neither is a scalar multiple of the other—the collection {
w1,
w2} serves as a basis for
S, so its dimension is 2.
The most important attribute of a basis is the ability to write every vector in the space in a
unique way in terms of the basis vectors. To see why this is so, let
B = {
v1,
v2, …,
v
r
} be a basis for a vector space
V. Since a basis must span
V, every vector
v in
V can be written in at least one way as a linear combination of the vectors in
B. That is, there exist scalars
k1,
k2, …,
kr
such that
To show that no other choice of scalar multiples could give
v, assume that
is also a linear combination of the basis vectors that equals
v.
Subtracting (*) from (**) yields
This expression is a linear combination of the basis vectors that gives the zero vector. Since the basis vectors must be linearly independent, each of the scalars in (***) must be zero:
Therefore, k′1 =
k1, k′2 =
k2,…, and k′r =
kr, so the representation in (*) is indeed unique. When
v is written as the linear combination (*) of the basis vectors
v1,
v2, …,
v
r
, the uniquely determined scalar coefficients
k1,
k2, …,
kr
are called the
components of
v relative to the basis
B. The row vector (
k1,
k2, …,
kr
) is called the
component vector of
v relative to
B and is denoted (
v)
B
. Sometimes, it is convenient to write the component vector as a
column vector; in this case, the component vector (
k1,
k2, …,
kr
)T is denoted [
v]
B
.
Example 11: Consider the collection
C = {
i, i + j, 2
j} of vectors in
R2. Note that the vector
v = 3
i + 4
j can be written as a linear combination of the vectors in
C as follows:
and
The fact that there is more than one way to express the vector
v in
R2 as a linear combination of the vectors in
C provides another indication that
C cannot be a basis for
R2. If
C were a basis, the vector
v could be written as a linear combination of the vectors in
C in one
and only one way.
Example 12: Consider the basis
B = {
i +
j, 2
i −
j} of
R2. Determine the components of the vector
v = 2
i − 7
j relative to
B.
The components of
v relative to
B are the scalar coefficients
k1 and
k2 which satisfy the equation
This equation is equivalent to the system
The solution to this system is
k1 = −4 and
k2 = 3, so
Example 13: Relative to the standard basis {
i, j, k} = {
ê1,
ê2,
ê3} for
R3, the component vector of any vector
v in
R3 is equal to
v itself: (
v)
B
=
v. This same result holds for the standard basis {
ê1,
ê2,…,
ên} for every
R
n
.
Orthonormal bases. If
B = {
v1,
v2, …,
v
n
} is a basis for a vector space
V, then every vector
v in
V can be written as a linear combination of the basis vectors in one and only one way:
Finding the components of
v relative to the basis
B—the scalar coefficients
k1,
k2, …,
kn
in the representation above—generally involves solving a system of equations. However, if the basis vectors are
orthonormal, that is, mutually orthogonal unit vectors, then the calculation of the components is especially easy. Here's why. Assume that
B = {vˆ1,vˆ2,…,vˆn} is an orthonormal basis. Starting with the equation above—with vˆ1, vˆ2,…, vˆn replacing
v1,
v2, …,
v
n
to emphasize that the basis vectors are now assumed to be unit vectors—take the dot product of both sides with vˆ1:
By the linearity of the dot product, the left-hand side becomes
Now, by the orthogonality of the basis vectors, vˆi · vˆ1 = 0 for
i = 2 through
n. Furthermore, because vˆ is a unit vector, vˆ1 · vˆ1 = ‖vˆ1‖12 = 12 = 1. Therefore, the equation above simplifies to the statement
In general, if
B = {
vˆ1,
vˆ2,…,
vˆn} is an orthonormal basis for a vector space
V, then the components,
ki
, of any vector
v relative to
B are found from the simple formula
Example 14: Consider the vectors
from
R3. These vectors are mutually orthogonal, as you may easily verify by checking that
v1 ·
v2 =
v1 ·
v3 =
v2 ·
v3 = 0. Normalize these vectors, thereby obtaining an orthonormal basis for
R3 and then find the components of the vector
v = (1, 2, 3) relative to this basis.
A nonzero vector is
normalized—made into a unit vector—by dividing it by its length. Therefore,
Since
B = {
vˆ1,
vˆ2,
vˆ3} is an orthonormal basis for
R3, the result stated above guarantees that the components of
v relative to
B are found by simply taking the following dot products:
Therefore, (
v)
B
= (5/3, 11/(3√2),3/√2), which means that the unique representation of
v as a linear combination of the basis vectors reads
v = 5/3
vˆ1 + 11/(3√2)
vˆ2 + 3/√2
vˆ3, as you may verify.
Example 15: Prove that a set of mutually orthogonal, nonzero vectors is linearly independent.
Proof. Let {
v1,
v2, …,
v
r
} be a set of nonzero vectors from some
R
n
which are mutually orthogonal, which means that no
v
i
=
0 and
v
i
·
v
j
= 0 for
i ≠
j. Let
be a linear combination of the vectors in this set that gives the zero vector. The goal is to show that
k1 =
k2 = … =
kr
= 0. To this end, take the dot product of both sides of the equation with
v1:
The second equation follows from the first by the linearity of the dot product, the third equation follows from the second by the orthogonality of the vectors, and the final equation is a consequence of the fact that ‖
v1‖2 ≠ 0 (since
v1 ≠
0). It is now easy to see that taking the dot product of both sides of (*) with
v
i
yields
ki
= 0, establishing that
every scalar coefficient in (*) must be zero, thus confirming that the vectors
v1,
v2, …,
v
r
are indeed independent.