In this lab, you'll create sinusoids at a variety of sampling rates, both above and below Nyquist, and compare the resulting stem plots. Then you'll downsample a sinusoid, and interpolate it back to the original sampling rate, using a variety of different interpolation kernels.
In order to make sure everything works, you might want to go to the command line, and run
pip install -r requirements.txt
This will install the modules that are used on the autograder, including numpy, h5py, and the gradescope utilities.
import submitted
import importlib
import numpy as np
import matplotlib.pyplot as plt
import h5py
In this first part of the MP, you will create a sinusoid, and sample it at different sampling rates. The four parameters will be
frequency
$f=$ frequency of the sinusoid (Hertz)phasor
$z=re^j\phi=$ phasor encoding its amplitude and phaseduration
$d=$ duration (seconds)samplerate
$F_s=$ sampling rate (samples/second)The returned signal should be $$x[n] = \Re\{ z e^{j2\pi nf/F_s}\}$$
import importlib
importlib.reload(submitted)
help(submitted.sinusoid)
Help on function sinusoid in module submitted: sinusoid(frequency, phasor, duration, samplerate) timeaxis, signal = sinusoid(frequency, phasor, duration, samplerate) Generate a sinusoid. frequency (real scalar) - frequency of the sinusoid, in Hertz phasor (complex scalar) - magnitude times e^{j phase} duration (real scalar) - duration, in seconds samplerate (real scalar) - sampling rate, in samples/second timeaxis (array) - sample times, from 0 to duration, including endpoints signal (array) - the generated sinusoid, length = int(duration*samplerate+1)
First, let's generate a sinusoid with plenty of samples, so we make sure it plots correctly. We'll use stem
to plot it, so we can see each sample.
importlib.reload(submitted)
timeaxis, signal = submitted.sinusoid(4, 1, 1, 32)
plt.figure(figsize=(14, 5))
plt.stem(timeaxis, signal)
plt.xlabel('Time (seconds)')
plt.title('4Hz cosine, sampled at 32 samples/second')
Text(0.5, 1.0, '4Hz cosine, sampled at 32 samples/second')
Now let's see what happens when we gradually lower the sampling rate.
fig, ax = plt.subplots(4,1, figsize=(14,10))
samplerates = [16, 10, 8, 6]
for k in range(4):
timeaxis, signal = submitted.sinusoid(4, 1, 1, samplerates[k])
ax[k].stem(timeaxis, signal)
ax[k].set_title('4Hz sinusoid with a sampling rate of %d Hz'%(samplerates[k]))
Notice that, as long as you have at least two samples per period ($F_s\ge 2f$), you can reconstruct the original cosine (e.g., there are four sign-reversals in one second, so you can see that it's a 4Hz sinusoid).
When $F_s < 2f$, you can no longer see the real underlying cosine. It has been "aliased" to a new "alias frequency," $f_a=F_s-f$. That's because, for any integer $n$,
$$\cos\left(\frac{2\pi f n}{F_s}\right) = \cos\left(\frac{2\pi (F_s-f) n}{F_s}\right)$$The highest frequency at which the cosine can be is called the "Nyquist rate", and it is $$F_N = \frac{1}{2}F_s$$
Let's adjust the sampling frequency so that 4Hz is just barely above or just barely below Nyquist.
fig, ax = plt.subplots(5,1, figsize=(14,10))
samplerates = [9, 8, 7, 6, 5]
for k in range(5):
timeaxis, signal = submitted.sinusoid(4, 1, 1, samplerates[k])
ax[k].stem(timeaxis, signal)
alias = min(4, samplerates[k]-4)
ax[k].set_title('4Hz, at $F_s$=%d Hz, looks like %dHz'%(samplerates[k], alias))
ax[k].set_ylim(-1,1)
fig.tight_layout()
Notice that, when $f > F_s/2$, it looks as though the sinusoid has a frequency of $F_s-f$. This is called its "aliased frequency." When the sampling rate gets even lower, aliasing can be even worse. The aliased frequency in general is
$$f_a = \min\left(f \mbox{mod} F_s, (F_s-f)\mbox{mod} F_s\right)$$where $\mbox{mod}$ means modulo (written in python as f % Fs
). Let's see some examples.
fig, ax = plt.subplots(5,1, figsize=(14,10))
samplerates = [4.5, 4, 3.5, 3, 2.5]
for k in range(5):
timeaxis, signal = submitted.sinusoid(4, 1, 2, samplerates[k])
ax[k].stem(timeaxis, signal)
alias = min(4 % samplerates[k], (samplerates[k]-4) % samplerates[k])
ax[k].set_title('4Hz, at $F_s$=%g Hz, looks like %gHz'%(samplerates[k], alias))
ax[k].set_ylim(-1,1)
fig.tight_layout()
Exactly the same thing happens to sine waves as to cosines, except that:
In order to plot sine waves, we'll use a phasor of $-j=e^{-j\pi/2}$, to give it a phase of $-\pi/2$.
fig, ax = plt.subplots(5,1, figsize=(14,10))
samplerates = [10,9,8,7,6]
for k in range(5):
timeaxis, signal = submitted.sinusoid(4, -1j, 1, samplerates[k])
ax[k].stem(timeaxis, signal)
alias = min(4 % samplerates[k], (samplerates[k]-4) % samplerates[k])
ax[k].set_title('4Hz, at $F_s$=%g Hz, looks like %gHz'%(samplerates[k], alias))
ax[k].set_ylim(-1,1)
fig.tight_layout()
These two equations: $$\cos\left(\frac{2\pi f n}{F_s}\right) = \cos\left(\frac{2\pi (F_s-f) n}{F_s}\right)$$ $$\sin\left(\frac{2\pi f n}{F_s}\right) = -\sin\left(\frac{2\pi (F_s-f) n}{F_s}\right)$$
...can be combined to give this equation: $$\Re\left\{z \exp\left(j\frac{2\pi f n}{F_s}\right)\right\}=\Re\left\{z^* \exp\left(j\frac{2\pi (F_s-f) n}{F_s}\right)\right\}$$
In other words,
For example, let's look at the signal $$x(t) = \cos\left(2\pi ft-\frac{\pi}{4}\right)$$
This is a cosine delayed by $\pi/4$ radians. It's positive for about $3/8$ of a period, then negative for $1/2$ period, and so on.
fig, ax = plt.subplots(5,1, figsize=(14,10))
samplerates = [30,11,10,6,5]
for k in range(5):
phasor = np.exp(-1j*np.pi/4)
timeaxis, signal = submitted.sinusoid(4, phasor, 1, samplerates[k])
ax[k].stem(timeaxis, signal)
alias = min(4 % samplerates[k], (samplerates[k]-4) % samplerates[k])
ax[k].set_title('4Hz, at $F_s$=%g Hz, looks like %gHz'%(samplerates[k], alias))
ax[k].set_ylim(-1,1)
fig.tight_layout()
Sometimes it's useful to explicitly calculate the aliased frequency and aliased phasor (HINT: you probably want to use the np.mod
method):
importlib.reload(submitted)
help(submitted.compute_aliasing)
Help on function compute_aliasing in module submitted: compute_aliasing(frequencies, phasors, samplerates) aliased_freqs, aliased_phasors = compute_aliasing(frequencies, phasors, samplerates) Find the frequency and phasor of sinusoid aliases. All arguments should have same length. frequencies (real array) - frequencies of the sinusoids, in Hertz phasors (complex array) - magnitudes times e^{j phases} samplerates (real array) - sampling rates, in samples/second aliased_freqs (real array) - frequencies at which sinusoids seems to occur, in Hertz aliased_phasors (complex array) - phasors with which sinusoids seems to occur
importlib.reload(submitted)
samplerates = np.array([30,11,10,6,5])
freqs = np.repeat(4, 5)
phasors = np.repeat(np.exp(-1j*np.pi/4), 5)
aliased_freqs, aliased_phasors = submitted.compute_aliasing(freqs, phasors, samplerates)
print(aliased_freqs)
print(aliased_phasors)
[4 4 4 2 1] [0.70710678-0.70710678j 0.70710678-0.70710678j 0.70710678-0.70710678j 0.70710678+0.70710678j 0.70710678+0.70710678j]
If we plot a sinusoid at the aliased frequency and aliased phasor, it should look exactly the same as the original sinusoid.
importlib.reload(submitted)
time1, original = submitted.sinusoid(freqs[-1],phasors[-1],2,samplerates[-1])
time2, alias = submitted.sinusoid(aliased_freqs[-1],aliased_phasors[-1],2,samplerates[-1])
time3, oops = submitted.sinusoid(aliased_freqs[-1],aliased_phasors[0],2,samplerates[-1])
fig, ax = plt.subplots(3,1, figsize=(14,10))
ax[0].stem(time1, original)
ax[0].set_title('Sinusoid at %dHz, %dHz samplerate'%(freqs[-1],samplerates[-1]))
ax[1].stem(time2, alias)
ax[1].set_title('Sinusoid at %dHz, %dHz samplerate'%(aliased_freqs[-1],samplerates[-1]))
ax[2].stem(time3, oops)
ax[2].set_title('Aliased freq, but original phasor')
fig.tight_layout()
Remember that, if you have a signal that's periodic with a period $T_0$, then you can find its Fourier series coefficients using the Fourier analysis formula: $$X_k = \frac{1}{T_0}\int_{0}^{T_0} x(t) e^{-j\frac{2\pi kt}{T_0}}dt$$
We can get a pretty good measurement of $X_k$ on a computer by setting the sampling rate high enough, replacing the integral by a sum, and multiplying the integrand by $F_s$ (to represent $dt=1/F_s$). After simplifying a little, we get: $$X_k = \frac{1}{N_0} \sum_{n=0}^{N_0-1} x[n]e^{-j\frac{2\pi kn}{N_0}}$$ where $$N_0 = T_0F_s$$ During the rest of this MP, it will be useful to have a good Fourier analysis function, so let's write it.
importlib.reload(submitted)
help(submitted.fourier_analysis)
Help on function fourier_analysis in module submitted: fourier_analysis(signal, number_of_coefficients) coefficients = fourier_analysis(signal, number_of_coefficients) Find the Fourier series coefficients using the discrete-time Fourier analysis formula. signal (array of length N_0) = one period of the signal number_of_coefficients (scalar) = number of coefficients to compute, starting with X_0 coefficients (array of length=number_of_coefficients) = X_0 through X_{number_of_coefficients-1}
Remember that, if a signal is periodic with period $N_0$, then it's also periodic with any period that's a multiple of $N_0$. We can use that fact to make the plot easier to read, e.g., by using four periods of a cosine as if they were just one period; that way the coefficient $X_1$ will be plotted as if it were $X_4$.
Let's choose a cosine with a really long period, say, 50 samples, and then compute the Fourier series coefficients over a space of 200 samples.
n = np.arange(200)
signal = np.cos(2*np.pi*n/50)
k = np.arange(41)/4
coefficients = submitted.fourier_analysis(signal, 41)
fig, ax = plt.subplots(2,1,figsize=(14,6))
ax[0].stem(n, signal)
ax[0].set_ylabel('$x[n]$')
ax[0].set_xlabel('$n$')
ax[1].stem(k, np.abs(coefficients))
ax[1].set_ylabel('$|X_k|$')
ax[1].set_xlabel('$k$')
Text(0.5, 0, '$k$')
Let's define the triangle function as follows: $$h[n]=\begin{cases}1-\frac{|t|}{T}&-T+1\le n\le T-1\\0&\mbox{otherwise}\end{cases}$$
This function will be useful when we try to interpolate signals, in part 5. Go ahead and write a function to compute it. Notice that you only need to include the nonzero samples, not the zero-valued samples.
importlib.reload(submitted)
help(submitted.triangle)
Help on function triangle in module submitted: triangle(T) timeaxis, h = triangle(T) Return a triangle function of length 2*T-1. T (scalar) - length of each side of the triangle, in samples timeaxis (array, length 2*T-1) - sample indices, from -(T-1) through (T-1) h (array, length 2*T-1) - the triangle function, 1 - abs(timeaxis)/T
importlib.reload(submitted)
t_timeaxis, triangle = submitted.triangle(25)
plt.figure(figsize=(14,4))
plt.plot(t_timeaxis, triangle)
[<matplotlib.lines.Line2D at 0x7f852e7f9a60>]
We are now going to simulate a discrete-to-continuous time converter by downsampling our 50-sample cosine by a factor of 25, and then upsampling it by the same factor of 25. The upsampling operation will use the same four types of interpolation that are used in typical D/C converters: rectangle, triangle, spline, and sinc.
First, let's downsample:
lowrate_signal = signal[::25]
lowrate_n = n[::25]
plt.figure(figsize=(14,4))
plt.stem(lowrate_n, lowrate_signal)
plt.title('Cosine, sampled with only two samples per period')
Text(0.5, 1.0, 'Cosine, sampled with only two samples per period')
Now let's pretend we're interpolating it into continuous time. The general formula for interpolation is
$$x(t) = \sum_{n=-\infty}^\infty x[n] h\left(t-nT\right)$$where $T$ is the sampling period.
In order to simulate D-to-C conversion, we will use $t$ to represent time in the highrate signal (e.g., $0\le t<200$) instead of time in the continuous-time signal. We'll use $n$ to represent time in the lowrate signal (e.g., in the example, $0\le n< 8$), while $T$ is the ratio between the two sampling rates (e.g., $T=25$).
importlib.reload(submitted)
help(submitted.interpolate)
Help on function interpolate in module submitted: interpolate(lowrate_signal, T, kernel_timeaxis, kernel) highrate_signal = interpolate(lowrate_signal, T, kernel_timeaxis, kernel) Use lowrate-to-highrate conversion to simulate discrete-to-continuous conversion. lowrate_signal (length-N array) - the lowrate signal T (scalar) - ratio of highrate/lowrate, i.e., number of output samples per input sample kernel_timeaxis (array) - sample times of the kernel, at the highrate kernel (array) - the interpolation kernel. length(kernel)==length(kernel_timeaxis). highrate_signal (length-N*T array) - the highrate signal Note: in order to keep the output to only N*T samples, use modulo arithmetic for the interpolation, e.g., highrate_signal[np.mod(kernel_timeaxis+n*T, N*T)] += kernel * lowrate_signal[n]
Notice those last few lines from the docstring. The triangle function has a length of $2T-1$. The lowrate signal has a length of $N$. If you put in a triangle function in place of every lowrate sample, you'd get a total length of $NT+(T-1)$, but that's not what we want -- what we want is a signal of length $NT$. In order to make that happen, we'll use modulo indexing, i.e., the interpolation formula should be implemented as
for n in range(N):
You might find the function np.mod
to be useful.
importlib.reload(submitted)
highrate_signal = submitted.interpolate(lowrate_signal, 25, t_timeaxis, triangle)
fig,ax = plt.subplots(3,1,figsize=(14,10))
ax[0].stem(n,signal)
ax[0].set_title('Original Signal')
ax[1].stem(lowrate_n, lowrate_signal)
ax[1].set_title('Lowrate "Discrete Time" Signal')
ax[2].stem(n, highrate_signal)
ax[2].set_title('Highrate "Continuous Time" Signal using Triangle Interpolator')
Text(0.5, 1.0, 'Highrate "Continuous Time" Signal using Triangle Interpolator')
Well, it doesn't look very much like a cosine! In fact, it's just a linear interpolation between the lowrate samples.
In order to better understand what's going on, let's look at its spectrum. In the following plot you should see that most of the spectral energy is at $X_1$, but there is significant nonzero energy also at $X_3$, $X_5$, and so on:
highrate_coefficients = submitted.fourier_analysis(highrate_signal, 41)
fig, ax = plt.subplots(2,1,figsize=(14,6))
ax[0].stem(k, np.abs(coefficients))
ax[0].set_title('Fourier Series Coefficients of the Original Sinusoid')
ax[1].stem(k, np.abs(highrate_coefficients))
ax[1].set_title('Fourier Series Coefficients of the Triangle-Interpolated Sinusoid')
Text(0.5, 1.0, 'Fourier Series Coefficients of the Triangle-Interpolated Sinusoid')
Rectangle interpolation is much, much worse than triangle interpolation, but it's much easier to implement in hardware, so it's actually much more common in the real world. Let's try it, and see how it turns out:
importlib.reload(submitted)
help(submitted.rectangle)
Help on function rectangle in module submitted: rectangle(T) timeaxis, h = rectangle(T) Return a rectangle function of length T. T (scalar) - length, in samples timeaxis (length-T array) - sample indices, from 0 to T-1, corresponding to h h (length-T array) - the rectangle function
importlib.reload(submitted)
r_timeaxis, rectangle = submitted.rectangle(25)
plt.figure(figsize=(14,4))
plt.plot(r_timeaxis, rectangle)
[<matplotlib.lines.Line2D at 0x7f8531937640>]
importlib.reload(submitted)
highrate_signal = submitted.interpolate(lowrate_signal, 25, r_timeaxis, rectangle)
fig,ax = plt.subplots(3,1,figsize=(14,10))
ax[0].stem(n,signal)
ax[0].set_title('Original Signal')
ax[1].stem(lowrate_n, lowrate_signal)
ax[1].set_title('Lowrate "Discrete Time" Signal')
ax[2].stem(n, highrate_signal)
ax[2].set_title('Highrate "Continuous Time" Signal using Rectangle Interpolator')
Text(0.5, 1.0, 'Highrate "Continuous Time" Signal using Rectangle Interpolator')
highrate_coefficients = submitted.fourier_analysis(highrate_signal, 41)
fig, ax = plt.subplots(2,1,figsize=(14,6))
ax[0].stem(k, np.abs(coefficients))
ax[0].set_title('Fourier Series Coefficients of the Original Sinusoid')
ax[1].stem(k, np.abs(highrate_coefficients))
ax[1].set_title('Fourier Series Coefficients of the Rectangle-Interpolated Sinusoid')
Text(0.5, 1.0, 'Fourier Series Coefficients of the Rectangle-Interpolated Sinusoid')
One way of thinking about the superiority of triangle interpolation over rectangle interpolation is to say that the triangle-interpolated $x(t)$ is continuous, while the rectangle-interpolated $x(t)$ is not continuous.
We can make an even better interpolation if we try to make one that is continuous, and that has continuous first derivatives.
Remember that $$x(t) = \sum x[n] h(t-nT)$$
All of those constraints can be satisfied by using a cubic spline interpolation kernel. For example, the following cubic spline function satisfies these properties:
$$h(t) = \begin{cases} 1-\frac{3}{2}\left(\frac{|t|}{T}\right)^2+\frac{1}{2}\left(\frac{|t|}{T}\right)^3 & |t|\le T\\ -\frac{3}{2}\left(\frac{|t|-2T}{T}\right)^2\left(\frac{|t|-T}{T}\right)&T\le |t|\le 2T\\ 0 &\mbox{otherwise}\end{cases}$$importlib.reload(submitted)
help(submitted.spline)
Help on function spline in module submitted: spline(T) timeaxis, h = spline(T) Return a continuous spline interpolator with continuous first deriviative. T (scalar) - the upsampling factor timeaxis (array, length 4*T-1) - sample indices, from -(2*T-1) through (2*T-1) h (array, length 4*T-1) - the cubic spline interpolation kernel
importlib.reload(submitted)
s_timeaxis, spline = submitted.spline(25)
plt.figure(figsize=(14,4))
plt.plot(s_timeaxis, spline)
[<matplotlib.lines.Line2D at 0x7f85321086d0>]
importlib.reload(submitted)
highrate_signal = submitted.interpolate(lowrate_signal, 25, s_timeaxis, spline)
fig,ax = plt.subplots(3,1,figsize=(14,10))
ax[0].stem(n,signal)
ax[0].set_title('Original Signal')
ax[1].stem(lowrate_n, lowrate_signal)
ax[1].set_title('Lowrate "Discrete Time" Signal')
ax[2].stem(n, highrate_signal)
ax[2].set_title('Highrate "Continuous Time" Signal using Spline Interpolator')
Text(0.5, 1.0, 'Highrate "Continuous Time" Signal using Spline Interpolator')
highrate_coefficients = submitted.fourier_analysis(highrate_signal, 41)
fig, ax = plt.subplots(2,1,figsize=(14,6))
ax[0].stem(k, np.abs(coefficients))
ax[0].set_title('Fourier Series Coefficients of the Original Sinusoid')
ax[1].stem(k, np.abs(highrate_coefficients))
ax[1].set_title('Fourier Series Coefficients of the Spline-Interpolated Sinusoid')
Text(0.5, 1.0, 'Fourier Series Coefficients of the Spline-Interpolated Sinusoid')
The spline-interpolated sinusoid looks a whole lot better than the triangle-interpolated sinusoid, doesn't it? The Fourier series coefficients $X_3$ and $X_5$ are almost zero, but not quite -- it's hard to see in the plot above, but they do include some energy. Such energy may not be visible in an image, but human ears can sometimes hear it.
One might ask, is there any interpolation kernel for which all the derivatives are continuous? In other words, not just $h'(t)=\frac{dh}{dt}$, but also $h''(t)=\frac{d^2h}{dt^2}$ is continuous, and so is $h'''(t)=\frac{d^3h}{dt^3}$, and so on? The answer is yes. The kernel with all derivatives continuous is called the sinc function, defined as:
$$h(t)=\begin{cases} 1 & t=0\\ \frac{\sin(\pi t/T)}{(\pi t/T)} & \mbox{otherwise} \end{cases}$$This is actually an infinite-length interpolation kernel, which is kind of inconvenient! In order to actually use the sinc interpolator in practice, we need to truncate it after some fixed duration, as described in the docstring:
importlib.reload(submitted)
help(submitted.sinc)
Help on function sinc in module submitted: sinc(T, D) timeaxis, h = sinc(T, D) Return D samples from the center of h(t)=sin(pi*t/T) / (pi*t/T). T (scalar) - the upsampling factor D (scalar) - the duration of the returned kernel; always an odd number. timeaxis (array, length D) - sample indices, from -(D-1)/2 through (D-1)/2 h (array, length 4*T-1) - the sinc interpolation kernel
importlib.reload(submitted)
sinc_timeaxis, sinc = submitted.sinc(25,199)
plt.figure(figsize=(14,4))
plt.plot(sinc_timeaxis, sinc)
[<matplotlib.lines.Line2D at 0x7f8532463430>]
importlib.reload(submitted)
highrate_signal = submitted.interpolate(lowrate_signal, 25, sinc_timeaxis, sinc)
fig,ax = plt.subplots(3,1,figsize=(14,10))
ax[0].stem(n,signal)
ax[0].set_title('Original Signal')
ax[1].stem(lowrate_n, lowrate_signal)
ax[1].set_title('Lowrate "Discrete Time" Signal')
ax[2].stem(n, highrate_signal)
ax[2].set_title('Highrate "Continuous Time" Signal using Sinc Interpolator')
Text(0.5, 1.0, 'Highrate "Continuous Time" Signal using Sinc Interpolator')
highrate_coefficients = submitted.fourier_analysis(highrate_signal, 41)
fig, ax = plt.subplots(2,1,figsize=(14,6))
ax[0].stem(k, np.abs(coefficients))
ax[0].set_title('Fourier Series Coefficients of the Original Sinusoid')
ax[1].stem(k, np.abs(highrate_coefficients))
ax[1].set_title('Fourier Series Coefficients of the Spline-Interpolated Sinusoid')
Text(0.5, 1.0, 'Fourier Series Coefficients of the Spline-Interpolated Sinusoid')
If you reached this point in the notebook, then probably your code is working well, but before you run the autograder on the server, you should first run it on your own machine.
You can do that by going to a terminal, and running the following command line:
python grade.py
In order to help you debug, you have the file solutions.hdf5
. You can use it to debug like this:
import h5py
with h5py.File('solutions.hdf5','r') as f:
print(list(f.keys()))
['T', 'aliased_freqs', 'aliased_phasors', 'coefficients', 'duration', 'frequencies', 'frequency', 'highrate_signal', 'lowrate_signal', 'number_of_coefficients', 'phasor', 'phasors', 'r_timeaxis', 'rectangle', 's_timeaxis', 'samplerate', 'samplerates', 'signal', 'sinc', 'sinc_timeaxis', 'spline', 't_timeaxis', 'timeaxis', 'triangle']
As you can see, this file contains a lot of objects, created during a sample run of the solution code with random parameters. Let's plot the song that it was expecting:
importlib.reload(submitted)
with h5py.File('solutions.hdf5','r') as f:
highrate_signal=submitted.interpolate(f['lowrate_signal'],f['T'][0], f['t_timeaxis'], f['triangle'])
NT = len(highrate_signal)
fig, ax = plt.subplots(2,1,figsize=(14,10))
ax[0].plot(np.arange(NT),highrate_signal,np.arange(NT),f['highrate_signal'])
ax[0].set_title('My version and reference version of triangle-interpolation')
ax[1].plot(np.arange(NT),highrate_signal-f['highrate_signal'])
ax[1].set_title('Difference between the two')
If the lower plot has any nonzero values, those nonzero values are places where your solution differs from the reference solution. If they differ by more than 0.01, then your solution will probably be marked incorrect, so you should figure out why those things have happened.
You can earn up to 10% extra credit on this MP by finishing the file called extra.py
, and submitting it to the autograder.
This file has just one function, called extra.rate_conversion
, with the following signature:
import extra
help(extra.rate_conversion)
Help on function rate_conversion in module extra: rate_conversion(highrate_signal, highrate, lowrate) lowrate_signal, coefficients = rate_conversion(highrate_signal, highrate, lowrate) highrate_signal (real array, length N_0*T) - one period of the high-rate signal highrate (scalar) - the high sampling rate, expressed in samples/second lowrate (scalar) - the low sampling rate, expressed in samples/second lowrate_signal (real array, length N_0) - one period of the low-rate signal coefficients (complex array) - Fourier coefficients X_{-int((N0-1)/2)} through X_{int((N0-1)/2)}. This should perform a Fourier series analysis of the high-rate signal. Then resynthesize the low-rate signal using Fourier synthesis, with only the terms that are still valid at the low-rate, i.e., X_{-int((N0-1)/2)} through X_{int((N0-1)/2)}. You'll want to use a complex-valued lowrate_signal during Fourier synthesis, then return its real part.
Let's rewrite that docstring in terms of equations, rather than words.
The goal is to convert a signal from a high sampling rate to a low sampling rate, by doing Fourier analysis at the high rate, and then performing Fourier synthesis at the low rate. At the low rate, we'll keep only the terms that are below Nyquist at the low rate.
Fourier analysis at the high rate is done using:
$$X_k = \frac{1}{N_0T} \sum_{n=0}^{N_0T-1} x_h[n] e^{-j2\pi kn/N_0T}$$where $x_h[n]$ is the high-rate signal, $N_0$ is the length of one period at the low rate (equal to the length of lowrate_signal
), and $N_0T$ is the length of one period at the high rate (equal to the length of highrate_signal
).
Fourier synthesis at the low rate is then done using: $$x_l[n] = \sum_{k=-\mbox{int}((N_0-1)/2)}^{\mbox{int}((N_0-1)/2)} X_ke^{j2\pi kn/N_0}$$
Notice that, if $N_0$ is odd, this summation includes $N_0$ harmonics, all of which have frequencies below $F_s/2$. If $N_0$ is even, this summation includes only $N_0-1$ harmonics; the harmonic right at $kF_0=F_s/2$ is omitted.
Here's what happens if we load one period of the violin from MP1, and apply the extra credit assignment to it.
import soundfile as sf
x, highrate = sf.read('violin.wav')
highrate_signal = x[50000:50170]
lowrate = highrate/4
print('highrate is',highrate,'lowrate is', lowrate)
importlib.reload(extra)
lowrate_signal, coefficients = extra.rate_conversion(highrate_signal, highrate, lowrate)
highrate is 44100 lowrate is 11025.0
fig, ax = plt.subplots(3,1,figsize=(14,7))
ax[0].stem(highrate_signal)
ax[0].set_title('One period of a Violin at %d samples/second'%(highrate))
ax[1].stem(lowrate_signal)
ax[1].set_title('The same period, at %d samples/second'%(lowrate))
hw = int((len(lowrate_signal)-1)/2)
ax[2].stem(np.arange(-hw,hw+1), np.abs(coefficients))
ax[2].set_title('Fourier series coefficients')
fig.tight_layout()
You can compare your solutions to the reference solutions by opening extra_solutions.hdf5
, as shown here:
importlib.reload(extra)
with h5py.File('extra_solutions.hdf5','r') as f:
lowrate_signal, coefficients = extra.rate_conversion(f['highrate_signal'],f['highrate'][0], f['lowrate'][0])
fig, ax = plt.subplots(2,1,figsize=(14,10))
N0 = len(lowrate_signal)
ax[0].plot(np.arange(N0),lowrate_signal,np.arange(N0),f['lowrate_signal'])
ax[0].set_title('My version and reference version of rate_conversion')
ax[1].plot(np.arange(N0),lowrate_signal-f['lowrate_signal'])
ax[1].set_title('Difference between reference and hypothesis')