Auto and Cross Correlation

June 2, 2021

Back to: Fundamentals of Signal Processing

Auto-Correlation

The auto-correlation function R_xx(m) for a real-valued sequence x(n) is defined as:

(1) $\begin{equation*} R_{xx}(n,m)=E[x(n+m)x(n)] \end{equation*}$

If the data sequence x(n) is wide sense stationary, then R_xx(n,n+m) simplifies to:

(2) $\begin{equation*} R_{xx}(m)=E[x(m)x(0)] \end{equation*}$

The statistical properties do not depend on absolute time n but on time difference m.

R_xx(m) displays how statistically related a data sequence is to an m-sample delayed version of itself. For example, if you look at sample x(3), how strong is its statistical relationship with x(4)? How about x(5) or x(6)?

For many real-world systems, R_xx(m) generally decreases as m increases, meaning the data sequence x becomes less statistically related to itself as the delay increases.

Two properties of R_xx(m) are its center spike and symmetry.

Center Spike

R_xx(m) has a spike at the center m=0 location because:

(3) $\begin{equation*} R_{xx}(0)=E[x(0)x^{\star}(0)] \end{equation*}$

(4) $\begin{equation*} =E|x(0)|^{2} \end{equation*}$

(5) $\begin{equation*} =\text{variance of }x(0) \end{equation*}$

Symmetry

R_xx(m) is symmetric about the center m=0 location because:

(6) $\begin{equation} R_{xx}(-m)=E[x(-m)x(0)] \end{equation}$	by definition of R_xx(m)
(7) $\begin{equation} =E[x(-m+m)x(0+m)] \end{equation}$	by wide sense stationary property
(8) $\begin{equation} =E[x(m)x(0)] \end{equation}$	by commutative property
(9) $\begin{equation} =R_{xx}(m) \end{equation}$	by definition of R_xx(m)

Cross-Correlation

The cross-correlation function R_yx(m) is similar to the autocorrelation Rxx(m), but two sequences x and y are compared rather than solely x.

The cross-correlation function R_yx(m) for a real-valued sequence x(n) is defined as:

(10) $\begin{equation*} R_{yx}(n,m)=E[x(n+m)y(n)] \end{equation*}$

If the data sequence x(n) is wide sense stationary, then R_yx(n,n+m) simplifies to:

(11) $\begin{equation*} R_{yx}(m)=E[x(m)y(0)] \end{equation*}$

The statistical properties do not depend on absolute time n, only on the time difference m.

R_yx(m) displays how statistically related a data sequence x is to an m-sample delayed version of a data sequence y. For example, if you look at sample x(3), how strong is its statistical relationship with y(4)? How about y(6)?

Auto and Cross Correlation

Auto-Correlation

Center Spike

Symmetry

Cross-Correlation

References