COMSOL 6.2 - The Effect of Periodicity

The 3D periodicity of crystalline materials is conventionally described by a set of lattice vectors and a basis (a specific pattern of atoms) associated with each lattice point. Consider the case of a “primitive lattice”, that is, a crystal made up of a single atom, where the atomic locations are coincident with the lattice points. Such a lattice does not require a basis (or more formally the basis is a single atom located at (0,0,0)). The set of lattice vectors R can be written as:

where n1, n2, and n3 are integers (taking all values between –∞ and ∞) and a1, a2, and a3 are the lattice vectors. For a primitive lattice, the unit cell is the parallelepiped constructed from the vectors a1, a2, and a3.

A useful way to represent the lattice is by means of an array of delta functions. A physical quantity of interest (for example the electric potential) can then be represented by the convolution of the variation in the potential within a single unit cell of the lattice with the delta function array. This approach is easiest to understand in 1D, when the delta function array is known as a Dirac comb. A 1D lattice can be represented as:

where a is the lattice parameter. This periodic function can be represented by a Fourier series of the form:

Of particular importance to understand semiconductor transport is the concept of the reciprocal lattice. This is the lattice produced by taking the Fourier transform of the real space lattice. For the Dirac comb this is given by:

where the final step follows from Equation 3-9. The reciprocal lattice is another Dirac comb with spacing proportional to the reciprocal of the real space lattice.

In 3D the lattice can be represented as δ (r–R) where the summation over all combinations of lattice vectors is implied by the use of the set of vectors R. The Fourier transform of the lattice is:

where Ω is the volume of the real space unit cell (Ω = a1 ⋅ (a2 × a3)) and K*is the set of reciprocal lattice vectors given by:

where n1, n2, and n3 are integers (taking all values between –∞ and ∞) and b1, b2, and b3 are the reciprocal lattice vectors given by:

Understanding the reciprocal lattice in terms of Fourier transforms as described above is useful. The Heisenberg uncertainty principle can be seen to be related to the properties of the Fourier transform (in the time domain: ΔfΔt≈1). Also the effect of a lattice basis can be straightforwardly introduced by taking the convolution of the basis with the lattice in real space. The convolution theorem then tells us that the result in reciprocal space (or k-space) is the product of the reciprocal lattice and the Fourier transform of the basis. The main effect of the basis is to modulate the amplitude (and phase) of the reciprocal lattice points. When zeros in the Fourier transform of the basis coincide with reciprocal lattice points, the basis leads to the elimination of these points in the reciprocal lattice. In X-ray imaging experiments, which sample the reciprocal lattice by crystal diffraction, this is referred to as extinction.

Importantly, any physical quantity with a periodicity that matches that of the lattice can straightforwardly be represented as the convolution of some function with the reciprocal lattice in real space, or as a modulating function for the reciprocal lattice in k-space, changing the amplitudes of each of the lattice points in k-space. Since each of these points represents a single harmonic component of the quantity of interest this construction can be thought of as a representation of a three dimensional Fourier series. For example, consider a periodic potential V(r) = V(r + R). V(r), which can be written in the form:

where the summation occurs over all the reciprocal lattice vectors K*. The reciprocal lattice is therefore a representation of the Fourier components required in 3D to represent a function with the periodicity of the lattice.

The Sommerfeld model did not attempt to include the effective electric potential of the crystal or the other electrons. It is clear that the effective potential must be periodic with the same periodicity as the crystal lattice. The periodicity of the lattice has important consequences for the electrons. First, since the problem has periodic symmetry, observable physical quantities must also have periodic symmetry. As a consequence ⏐ψ⏐ must be periodic, so that, using the notation developed previously:

equivalently:

The translational symmetry of the lattice imposes additional restrictions on the form of θ(R). For two lattice vectors, RA and RA + RB, translational symmetry implies:

which leads to the requirement that θ depends linearly on the three integers, n1, n2, and n3, which specify R (see Equation 3-8):

θ can therefore be written in the form:

for constants c1, c2, and c3. Writing k = c1b1 + c2b1 + c3b1 leads to the requirement θ = k⋅R and Equation 3-12 then becomes:

If ψ (r) is written in the form:

substituting into Equation 3-13 shows that uk(r+R) = uk(r). Equation 3-14, along with the periodicity requirement on uk(r), is known as Bloch’s theorem. It is extremely useful as it allows the wave function corresponding to a particular k vector to be expanded in a Fourier series of the same form as that of the potential (Equation 3-11):

Wave functions which satisfy Bloch’s theorem are frequently referred to as Bloch functions.

The time-independent Schrödinger equation can now be written using the periodic expansion for both the potential and the Bloch wave function:

where G is used for the potential reciprocal lattice vector to distinguish it from K. Simplifying this equation gives:

To obtain the equation for the coefficients of the sum, premultiply the sum by

and integrate over all space. This gives:

Equation 3-15 is valid for any periodic potential, small or large.

To obtain the small potential limit of Equation 3-15, consider the case of a single sinusoidal potential component with a small amplitude:

Since the periodicity of the potential is one dimensional, it is only necessary to consider Fourier components in the direction of K1 in the expansion of uk(r), which can therefore be written in the form:

and Equation 3-15 takes the form:

In the limit V1 = 0, p = 0, and Equation 3-16 recovers the form of the Sommerfeld E–k relationship, Equation 3-4. However, for V1 = 0, additional solutions now also exist for nonzero values of p, which take the form of a set of parabolas with origins shifted by p times the reciprocal lattice vector K1. As a result of the periodicity of the lattice, E has become multivalued for a given k as shown in Figure 3-1. The E–k relationship is periodic, with a single repeating unit contained within the dashed lines (shown at ±K1/2). Since the parts of the plot outside the dashed lines consist of repeated information, Ek plots are conventionally drawn showing only the region between the dashed lines. This region is called the Brillouin zone and in 3D it consists of all the points in k-space closer to one particular reciprocal lattice point K than to any other point.

Writing Equation 3-16 explicitly for the case V1 = 0 gives:

Figure 3-1 shows that on the planes half between the lattice vectors (defined by the equations k·K1 = qK1/2, for integers q, and indicated by the dashed line), the energy associated with two different values of p, for example p1 and p2, can be equal. Note that p1 and p2 are associated with two (different) periodic components of the wave function or with two different Ek curves in the figure. Away from these planes, the energy associated with each periodic component of the wave function differs from all the other components.

Next consider the energy on the plane k·K1 = qK1/2, at the point corresponding to p1 = 0 and p2 = 1 as Equation 3-17 changes into Equation 3-16 by a slow increase of V1 from zero. Close to the plane it is expected that initially, as V1 is increased, a solution exists in which C0 and C1 are significant but all other coefficients are extremely small; that is, a solution in which only the p1 = 0 and p2 = 1 components of the wave function play a significant role. Making the assumption that the other coefficients are zero reduces the set of Equation 3-16 to just two equations:

where the vector k has been decomposed into components parallel (k||) and perpendicular (k⊥) to K1. Taking the product of these two equations leads to a quadratic equation that can be solved to obtain the value of E. It is most convenient to use the following nondimensional variables when solving the equation:

A little algebra then leads to the equation:

that has solutions:

In the case U = 0, k⊥= 0, this equation reproduces the two parabolas centered on 0 and K1 (Figure 3-1). A nonzero value of k⊥ simply shifts the parabolas to greater energies. At the edge of the Brillouin zone (indicated by the dashed line), where the two parabolas cross, δ is zero, and as k|| decreases, δ increases, reaching a maximum of 1 at the origin. When a small but finite value of U is introduced (for example U < 0.1) there is little effect on the curve away from the Brillouin zone edge, but as δ becomes comparable to, or less than, U its effect becomes significant. At the edge of the Brillouin zone the two Ek curves (corresponding to two different Fourier components of the wave function) no longer cross but are separated by a small gap such that ε=1±U. This corresponds to a gap between the lowest two Ek curves of magnitude 2V1.

The form of Equation 3-16 makes it clear what the effect of other harmonics in the periodic potential would be. For a potential of the form:

Equation 3-16 becomes:

The effect of Vh is to couple the equations involving Cp and Cp±h, and it therefore perturbs apart the parabolas centered on pK1 and on (p ± h)K1. Provided Vh is small, its effect is only significant near the edge of the Brillouin zone. Thus, for a more general periodic potential with higher harmonics, the Ekrelationship takes the form shown in Figure 3-2.

The Ek diagram has now changed so that certain energies are forbidden. The allowed states exist within bands of permitted energies, with band gaps separating them. Figure 3-2 can be redrawn in various ways. In the figure the repeated zone scheme is shown. This scheme highlights the periodicity of the lattice and makes clear the concept of energy bands and band gaps. An alternative is to show only the nth band in the nth Brillouin zone (known as the extended band scheme). This approach produces results that look similar to the equivalent (parabolic) plot for the Sommerfeld model, with gaps appearing in the curve at the edges of the Brillouin zones. Practically, the more compact reduced zone scheme is usually employed, which shows only the information in the first Brillouin zone (between the dashed lines). When the band structure information of real materials is displayed in this form, it is typical to show the bands along several connected lines within the first Brillouin zone and to use the reduced zone scheme.

	There are now several states corresponding to a given wave number k with several associated energies. It is conventional to label the individual states with a band number, n, to distinguish between them. Thus the wave function Ψnk with energy Enk corresponds to the state in the nth band with wave vector k (note that in the above argument, the band index n corresponds to the harmonic p, which dominates in the Bloch function away from the Brillouin zone edges).

Before considering the effect of a potential varying in three dimensions, it is useful to visualize Equation 3-18 in a different way. Figure 3-3 shows two surfaces of constant energy for the solution to this equation (the nondimensionalized form of the equation is not used for the plot). In the free electron model the constant energy surfaces are spheres, centered on the origin. The periodicity of the structure means that there are now spheres centered on each of the reciprocal lattice points. The effect of the periodic potential is to split apart the spherical surfaces at the points where they would have overlapped, to form a set of nonintersecting surfaces.

In the more general case where the potential varies periodically in all three dimensions, exactly the same arguments apply, provided that the potential coefficients are small, (that is VG << h2⏐G2⏐/8m). Where the spheres centered on the different lattice points K and K– G intersect, the effect of the potential is to cause the kind of remapping shown in Figure 3-2 and Figure 3-3 — where the two surfaces intersect it splits apart instead. The nearly free electron energy surfaces can therefore be constructed by drawing spheres of equal radius centered on each lattice point K and rejoining them in this manner where they intersect, to form a set of nonintersecting surfaces. Although this procedure sounds simple, in practice rather complicated energy surfaces result from the procedure. Ref. 2 considers the example of a simple cubic material in detail, and Ref. 1 shows several examples of constant energy surfaces for different lattices.

Considering the approximations made in deriving the nearly free electron model, it is quite remarkable that the model agrees so well with the measured Ek surfaces of many real materials, particularly considering that the true potential is expected to vary rapidly in the vicinity of the atomic cores. The reason for the success of the nearly free electron theory is related to many-body effects. In materials where it is possible to divide the electrons into tightly bound core electrons and weakly bound valence electrons, the core electrons and the ions can be replaced by a pseudo-ion with a weakly varying pseudopotential, surrounded by the outermost valence electrons. The wave function for the valence states must be orthogonal to that of the core states and the pseudopotential is constructed to ensure that this is the case. The resulting potential varies much more slowly than the true potential as a result of a Pauli repulsion effect. This effect repels the valence electrons away from the core states in order to ensure that the corresponding wave functions are orthogonal. The nearly free electron model can then be applied to real materials if the pseudopotential, rather than the true potential, is used.

To compute the density of states in k-space the periodic boundary condition is applied, in this case on a crystal made up of Nc = N1× N2× N3 unit cells in the a1, a2, and a3 directions. Using the periodic boundary condition for the wave function gives the equation:

Equation 3-13 adds the additional requirements:

Thus:

and the allowed values of k are:

where n1, n2, and n3 are integers. The reciprocal space volume per allowed k-vector is given by:

where ξBZ is the volume of the Brillouin zone in k-space. Consequently, the number of allowed wave vectors in a single Brillouin zone is equal to the number of unit cells in the crystal. Since each state can accommodate two electrons of opposite spin, filling one band over the entire Brillouin zone corresponds to a crystal with two valence electrons per unit cell. The density of states in k-space is given by:

where the volume of the Brillouin zone (ξBZ = 8π3/Ωu, where Ωu is the unit cell volume) has been calculated explicitly using Equation 3-10 and vector algebra (note also that Ω = NcΩu is the volume of the crystal itself). This result is identical to that obtained by the Sommerfeld model.

The available states in the crystal are filled up in the same way as in the Sommerfeld model. The occupancy of the states is still given by Equation 3-5 and the Fermi level is defined by Equation 3-6. For a metal, the Fermi surface geometry in a periodic potential reflects the equipotential surfaces of the band structure at the Fermi energy (for example in the previous section surfaces like those shown in Figure 3-3). In semiconductors and insulators the Fermi energy lies within the band gap so there is no clear Fermi surface. However, in semiconductors the Fermi function slightly overlaps the band above (below) the Fermi level, known as the conduction band (or the valence band), and the states near the bottom (top) of the band have a low but significant probability of being occupied (unoccupied). Since it is these states that lead to the conductivity of semiconductors (see The Semiclassical Model), it is worth considering what form the density of states takes at the very edge of a band.

Consider first the minimum of a conduction band. Choose a new coordinate system such that the Taylor series for the E–k relationship expanded about the band minimum up to second order takes the form:

Here the constants m∗x, m∗y, and m∗z are associated with the k′x, k′y, and k′z terms in the series, respectively. The reason for choosing this form of the (arbitrary) constant becomes apparent below. The coordinate system for the vectors k′ has its origin at the minimum of the band in k-space and is aligned so that Equation 3-20 applies in the form given. There are no first-order terms in the expansion since it is at a minimum in the E(k') relationship (that is, it is at the bottom of a band), which represents the equation of an ellipsoidal surface. Close to the band edge the constant energy surfaces are therefore ellipsoids, with a semi-axis in the k′x direction given by

and similarly for the k′y and k′z directions. A given constant energy surface contains a volume given by

The number of states enclosed by the constant energy surfaces is therefore

and the density of states gc (E) is given by:

This result is identical in form to Equation 3-7, except that the mass has been replaced by an effective mass, m*=(m∗x m∗y m∗z)1/3 (the constants in Equation 3-20 are named in a manner consistent with this result).

The same argument can be applied to the top of the valence band (using a new axis system, k′′) leading to the similar result:

At this point it is useful to consider an alternative representation of the wave function, known as the Wannier function. Wannier functions, Wn(r − R), are wave packets of Bloch functions (Ψnk(r)) that are localized at a particular lattice vector R. These are defined in the following way:

where N is the number of unit cells in the crystal. The Wannier functions are orthogonal since:

A single Bloch state can be represented in the form:

Wannier functions are useful to represent tightly bound, localized states, since the spatial extent is limited. The functions themselves are not unique, since the phase of the Bloch states Ψnk(r) is arbitrary. However, it is possible to define a maximally localized Wannier function, which gives an intuitive picture of the bonding in the solid.

Wannier functions are important because these form the basis of alternative approaches to computing the band structures of solids. For example, in the tight binding approximation (TBA) it is assumed that the Wannier function is an atomic orbital, enabling the wave function to be constructed directly from Equation 3-24. Instead of using a single atomic wave function, one can employ a linear combination of them, resulting in the linear combination of atomic orbitals (LCAO) approach.

The band structure of silicon is illustrated in Figure 3-4. Although the three-dimensional band structure is considerably more complex than the simple picture described, many of the principles described are appropriate. Silicon is an indirect band-gap semiconductor, which means that the bottom of the conduction bands occur at a different point in k-space to the top of the valence bands. The valence band maxima occur at the center of the Brillouin zone. The conduction band minima occur at approximately 4/5 of the distance from the zone center to its edge along the kx, ky, and kz axes.

Considering first the conduction bands, there are six symmetry equivalent minima in the locations shown in Figure 3-4. Physically the form of the energy density of states is important to determine the transport properties of the semiconductor. All six of the conduction band minima are equivalent and consequently the contributions to the density of states can be added. The constant energy surfaces near the band minima are close to being ellipsoidal; Equation 3-21 gives a good description of the energy density of states. The transport properties of the band can be characterized by a single effective mass without any loss of accuracy in the model.

There are two coincident valence band maxima located at point Γ. An additional valence band, with a slightly lower maximum energy (produced by spin-orbit coupling, see Ref. 1) is also located at this point. Each of these bands has a different effective mass associated with it. It is common to represent the effect of the three valence bands with an average density of states so that Equation 3-22 is assumed. Strictly speaking this assumption is less accurate for holes than it is for electrons, as a result of the different energies associated with the band minima.

This discussion motivates the adoption of the so-called one-band model, in which a single valence and conduction band is considered in the transport model. The one-band model can be applied to many practical semiconducting materials.