homework 6, version 0

👀 Reading hidden code

226 μs

Submission by: Jazzy Doe (jazz@mit.edu)

👀 Reading hidden code

13.2 ms

Oopsie! You need to update Pluto to the latest version

Close Pluto, go to the REPL, and type:

julia> import Pkg
julia> Pkg.update("Pluto")

👀 Reading hidden code

121 μs

Homework 6: Epidemic modeling III

18.S191, fall 2020

This notebook contains built-in, live answer checks! In some exercises you will see a coloured box, which runs a test case on your code, and provides feedback based on the result. Simply edit the code, run it, and the check runs again.

For MIT students: there will also be some additional (secret) test cases that will be run as part of the grading process, and we will look at your notebook and write comments.

Feel free to ask questions!

👀 Reading hidden code

564 μs

name

"Jazzy Doe"

kerberos_id

"jazz"

👀 Reading hidden code

# edit the code below to set your name and kerberos ID (i.e. email without @mit.edu)

student = (name = "Jazzy Doe", kerberos_id = "jazz")

# you might need to wait until all other cells in this notebook have completed running.

# scroll around the page to see what's up

17.6 μs

Let's create a package environment:

👀 Reading hidden code

226 μs

begin

using Pkg

Pkg.activate(mktempdir())

Pkg.add([

Pkg.PackageSpec(name="PlutoUI", version="0.6.7-0.6"),

Pkg.PackageSpec(name="Plots", version="1.6-1"),

])

using Plots

gr()

using PlutoUI

end

👀 Reading hidden code

  Activating new project at `/tmp/jl_DdRlKZ`
    Updating registry at `~/.julia/registries/General.toml`
   Resolving package versions...
    Updating `/tmp/jl_DdRlKZ/Project.toml`
  [91a5bcdd] + Plots v1.40.14
  [7f904dfe] + PlutoUI v0.6.11
    Updating `/tmp/jl_DdRlKZ/Manifest.toml`
  [66dad0bd] + AliasTables v1.1.3
  [d1d4a3ce] + BitFlags v0.1.9
  [d360d2e6] + ChainRulesCore v1.25.1
  [9e997f8a] + ChangesOfVariables v0.1.10
  [944b1d66] + CodecZlib v0.7.8
  [35d6a980] + ColorSchemes v3.29.0
  [3da002f7] + ColorTypes v0.12.1
  [c3611d14] + ColorVectorSpace v0.11.0
  [5ae59095] + Colors v0.13.1
  [34da2185] + Compat v4.16.0
  [f0e56b4a] + ConcurrentUtilities v2.5.0
  [187b0558] + ConstructionBase v1.5.8
  [d38c429a] + Contour v0.6.3
  [9a962f9c] + DataAPI v1.16.0
  [864edb3b] + DataStructures v0.18.22
  [ffbed154] + DocStringExtensions v0.9.5
  [460bff9d] + ExceptionUnwrapping v0.1.11
  [c87230d0] + FFMPEG v0.4.2
  [53c48c17] + FixedPointNumbers v0.8.5
  [1fa38f19] + Format v1.3.7
  [28b8d3ca] + GR v0.73.6
  [42e2da0e] + Grisu v1.0.2
  [cd3eb016] + HTTP v1.10.16
  [3587e190] + InverseFunctions v0.1.17
  [92d709cd] + IrrationalConstants v0.2.4
  [1019f520] + JLFzf v0.1.11
  [692b3bcd] + JLLWrappers v1.7.0
  [682c06a0] + JSON v0.21.4
  [b964fa9f] + LaTeXStrings v1.4.0
  [23fbe1c1] + Latexify v0.16.8
  [2ab3a3ac] + LogExpFunctions v0.3.28
  [e6f89c97] + LoggingExtras v1.1.0
  [1914dd2f] + MacroTools v0.5.16
  [739be429] + MbedTLS v1.1.9
  [442fdcdd] + Measures v0.3.2
  [e1d29d7a] + Missings v1.2.0
  [77ba4419] + NaNMath v1.0.3
  [4d8831e6] + OpenSSL v1.5.0
  [bac558e1] + OrderedCollections v1.8.1
  [69de0a69] + Parsers v2.8.3
  [ccf2f8ad] + PlotThemes v3.3.0
  [995b91a9] + PlotUtils v1.4.3
  [91a5bcdd] + Plots v1.40.14
  [7f904dfe] + PlutoUI v0.6.11
  [aea7be01] + PrecompileTools v1.2.1
  [21216c6a] + Preferences v1.4.3
  [43287f4e] + PtrArrays v1.3.0
  [3cdcf5f2] + RecipesBase v1.3.4
  [01d81517] + RecipesPipeline v0.6.12
  [189a3867] + Reexport v1.2.2
  [05181044] + RelocatableFolders v1.0.1
  [ae029012] + Requires v1.3.1
  [6c6a2e73] + Scratch v1.2.1
  [992d4aef] + Showoff v1.0.3
  [777ac1f9] + SimpleBufferStream v1.2.0
  [a2af1166] + SortingAlgorithms v1.2.1
  [860ef19b] + StableRNGs v1.0.3
  [82ae8749] + StatsAPI v1.7.1
  [2913bbd2] + StatsBase v0.34.4
  [fd094767] + Suppressor v0.2.8
  [62fd8b95] + TensorCore v0.1.1
  [3bb67fe8] + TranscodingStreams v0.11.3
  [5c2747f8] + URIs v1.5.2
  [1cfade01] + UnicodeFun v0.4.1
  [1986cc42] + Unitful v1.23.1
  [45397f5d] + UnitfulLatexify v1.7.0
  [41fe7b60] + Unzip v0.2.0
  [6e34b625] + Bzip2_jll v1.0.9+0
  [83423d85] + Cairo_jll v1.18.5+0
  [ee1fde0b] + Dbus_jll v1.16.2+0
  [2702e6a9] + EpollShim_jll v0.0.20230411+1
  [2e619515] + Expat_jll v2.6.5+0
  [b22a6f82] + FFMPEG_jll v4.4.4+1
  [a3f928ae] + Fontconfig_jll v2.16.0+0
  [d7e528f0] + FreeType2_jll v2.13.4+0
  [559328eb] + FriBidi_jll v1.0.17+0
  [0656b61e] + GLFW_jll v3.4.0+2
  [d2c73de3] + GR_jll v0.73.6+0
  [78b55507] + Gettext_jll v0.21.0+0
  [7746bdde] + Glib_jll v2.84.0+0
  [3b182d85] + Graphite2_jll v1.3.15+0
  [2e76f6c2] + HarfBuzz_jll v8.5.1+0
  [aacddb02] + JpegTurbo_jll v3.1.1+0
  [c1c5ebd0] + LAME_jll v3.100.2+0
  [88015f11] + LERC_jll v3.0.0+1
  [1d63c593] + LLVMOpenMP_jll v18.1.8+0
  [dd4b983a] + LZO_jll v2.10.3+0
  [e9f186c6] + Libffi_jll v3.4.7+0
  [7e76a0d4] + Libglvnd_jll v1.7.1+1
  [94ce4f54] + Libiconv_jll v1.18.0+0
  [4b2f31a3] + Libmount_jll v2.41.0+0
  [89763e89] + Libtiff_jll v4.5.1+1
  [38a345b3] + Libuuid_jll v2.41.0+0
  [e7412a2a] + Ogg_jll v1.3.5+1
  [458c3c95] + OpenSSL_jll v3.5.0+0
  [91d4177d] + Opus_jll v1.3.3+0
  [36c8627f] + Pango_jll v1.56.3+0
  [30392449] + Pixman_jll v0.44.2+0
  [c0090381] + Qt6Base_jll v6.7.1+1
  [a44049a8] + Vulkan_Loader_jll v1.3.243+0
  [a2964d1f] + Wayland_jll v1.23.1+0
  [2381bf8a] + Wayland_protocols_jll v1.44.0+0
  [02c8fc9c] + XML2_jll v2.13.6+1
  [ffd25f8a] + XZ_jll v5.8.1+0
  [f67eecfb] + Xorg_libICE_jll v1.1.2+0
  [c834827a] + Xorg_libSM_jll v1.2.6+0
  [4f6342f7] + Xorg_libX11_jll v1.8.12+0
  [0c0b7dd1] + Xorg_libXau_jll v1.0.13+0
  [935fb764] + Xorg_libXcursor_jll v1.2.4+0
  [a3789734] + Xorg_libXdmcp_jll v1.1.6+0
  [1082639a] + Xorg_libXext_jll v1.3.7+0
  [d091e8ba] + Xorg_libXfixes_jll v6.0.1+0
  [a51aa0fd] + Xorg_libXi_jll v1.8.3+0
  [d1454406] + Xorg_libXinerama_jll v1.1.6+0
  [ec84b674] + Xorg_libXrandr_jll v1.5.5+0
  [ea2f1a96] + Xorg_libXrender_jll v0.9.12+0
  [c7cfdc94] + Xorg_libxcb_jll v1.17.1+0
  [cc61e674] + Xorg_libxkbfile_jll v1.1.3+0
  [e920d4aa] + Xorg_xcb_util_cursor_jll v0.1.4+0
  [12413925] + Xorg_xcb_util_image_jll v0.4.1+0
  [2def613f] + Xorg_xcb_util_jll v0.4.1+0
  [975044d2] + Xorg_xcb_util_keysyms_jll v0.4.1+0
  [0d47668e] + Xorg_xcb_util_renderutil_jll v0.3.10+0
  [c22f9ab0] + Xorg_xcb_util_wm_jll v0.4.2+0
  [35661453] + Xorg_xkbcomp_jll v1.4.7+0
  [33bec58e] + Xorg_xkeyboard_config_jll v2.44.0+0
  [c5fb5394] + Xorg_xtrans_jll v1.6.0+0
  [3161d3a3] + Zstd_jll v1.5.7+1
  [35ca27e7] + eudev_jll v3.2.14+0
  [214eeab7] + fzf_jll v0.61.1+0
  [a4ae2306] + libaom_jll v3.11.0+0
  [0ac62f75] + libass_jll v0.15.2+0
  [1183f4f0] + libdecor_jll v0.2.2+0
  [2db6ffa8] + libevdev_jll v1.13.4+0
  [f638f0a6] + libfdk_aac_jll v2.0.3+0
  [36db933b] + libinput_jll v1.28.1+0
  [b53b4c65] + libpng_jll v1.6.49+0
  [f27f6e37] + libvorbis_jll v1.3.7+2
  [009596ad] + mtdev_jll v1.1.7+0
  [1270edf5] + x264_jll v2021.5.5+0
  [dfaa095f] + x265_jll v3.5.0+0
  [d8fb68d0] + xkbcommon_jll v1.8.1+0
  [0dad84c5] + ArgTools
  [56f22d72] + Artifacts
  [2a0f44e3] + Base64
  [ade2ca70] + Dates
  [8bb1440f] + DelimitedFiles
  [f43a241f] + Downloads
  [7b1f6079] + FileWatching
  [b77e0a4c] + InteractiveUtils
  [b27032c2] + LibCURL
  [76f85450] + LibGit2
  [8f399da3] + Libdl
  [37e2e46d] + LinearAlgebra
  [56ddb016] + Logging
  [d6f4376e] + Markdown
  [a63ad114] + Mmap
  [ca575930] + NetworkOptions
  [44cfe95a] + Pkg
  [de0858da] + Printf
  [3fa0cd96] + REPL
  [9a3f8284] + Random
  [ea8e919c] + SHA
  [9e88b42a] + Serialization
  [6462fe0b] + Sockets
  [2f01184e] + SparseArrays
  [10745b16] + Statistics
  [fa267f1f] + TOML
  [a4e569a6] + Tar
  [8dfed614] + Test
  [cf7118a7] + UUIDs
  [4ec0a83e] + Unicode
  [e66e0078] + CompilerSupportLibraries_jll
  [deac9b47] + LibCURL_jll
  [29816b5a] + LibSSH2_jll
  [c8ffd9c3] + MbedTLS_jll
  [14a3606d] + MozillaCACerts_jll
  [4536629a] + OpenBLAS_jll
  [05823500] + OpenLibm_jll
  [efcefdf7] + PCRE2_jll
  [83775a58] + Zlib_jll
  [8e850b90] + libblastrampoline_jll
  [8e850ede] + nghttp2_jll
  [3f19e933] + p7zip_jll

6.3 s

We began this module with data on the COVID-19 epidemic, but then looked at mathematical models. How can we make the connection between data and models?

Models have parameters, such as the rate of recovery from infection. Where do the parameter values come from? Ideally we would like to extract them from data. The goal of this homework is to do this by fitting a model to data.

For simplicity we will use data that generated from the spatial model in Homework 5, rather than real-world data, and we will fit the simplest SIR model. But the same ideas apply more generally.

There are many ways to fit a function to data, but all must involve some form of optimization, usually minimization of a particular function, a loss function; this is the basis of the vast field of machine learning.

The loss function is a function of the model parameters; it measures how far the model output is from the data, for the given values of the parameters.

We emphasise that this material is pedagogical; there is no suggestion that these specific techniques should be used actual calculations; rather, it is the underlying ideas that are important.

👀 Reading hidden code

819 μs

Exercise 1: Calculus without calculus

👀 Reading hidden code

254 μs

Before we jump in to simulating the SIR equations, let's experiment with a simple 1D function. In calculus, we learn techniques for differentiating and integrating symbolic equations, e.g. $\frac{d}{d x} x^{n} = n x^{n - 1}$ . But in real applications, it is often impossible to apply these techniques, either because the problem is too complicated to solve symbolically, or because our problem has no symbolic expression, like when working with experimental results.

Instead, we use ✨ computers ✨ to approximate derivatives and integrals. Instead of applying rules to symbolic expressions, we use much simpler strategies that only use the output values of our function.

As a first example, we will approximate the derivative of a function. Our method is inspired by the analytical definition of the derivative!

$f^{'} (a) := lim_{h \to 0} \frac{f (a + h) - f (a)}{h} .$

The finite difference method simply fixes a small value for $h$ , say $h = 10^{- 3}$ , and then approximates the derivative as:

$f^{'} (a) ≃ \frac{f (a + h) - f (a)}{h} .$

👀 Reading hidden code

555 μs

Exercise 1.1 - tangent line

👉 Write a function finite_difference_slope that takes a function f and numbers a and h. It returns the slope $f^{'} (a)$ , approximated using the finite difference formula above.

👀 Reading hidden code

313 μs

finite_difference_slope (generic function with 2 methods)

function finite_difference_slope(f::Function, a, h=1e-3)

return missing

end

👀 Reading hidden code

622 μs

missing

finite_difference_slope(sqrt, 4.0, 5.0)

👀 Reading hidden code

15.0 μs

Here we go!

Replace missing with your answer.

👀 Reading hidden code

120 μs

👉 Write a function tangent_line that takes the same arguments f, a and g, but it returns a function. This function ( $R \to R$ ) is the tangent line with slope $f^{'} (a)$ (computed using finite_difference_slope) that passes through $(a, f (a))$ .

👀 Reading hidden code

349 μs

Hint

Remember that functions are objects! For example, here is a function that returns the square root function:

function the_square_root_function()
	f = x -> sqrt(x)
	return f
end

👀 Reading hidden code

43.7 ms

tangent_line (generic function with 1 method)

function tangent_line(f, a, h)

return missing

end

👀 Reading hidden code

393 μs

Here we go!

Replace missing with your answer.

👀 Reading hidden code

3.4 ms

👀 Reading hidden code

11.7 ms

👀 Reading hidden code

1.1 s

# this is our test function

wavy(x) = .1x^3 - 1.6x^2 + 7x - 3;

👀 Reading hidden code

587 μs

The slider below controls $h$ using a log scale. In the (mathematical) definition of the derivative, we take $lim_{h \to 0}$ . This corresponds to moving the slider to the left.

Notice that, as you decrease $h$ , the tangent line gets more accurate, but what happens if you make $h$ too small?

👀 Reading hidden code

318 μs

👀 Reading hidden code

297 ms

0.31622776601683794

👀 Reading hidden code

21.5 μs

👀 Reading hidden code

15.1 μs

Exercise 1.2 - antiderivative

In the finite differences method, we approximated the derivative of a function:

$f^{'} (a) ≃ \frac{f (a + h) - f (a)}{h}$

We can do something very similar to approximate the 'antiderivate' of a function. Finding the antiderivative means that we use the slope $f^{'}$ to compute $f$ numerically!

This antiderivative problem is illustrated below. The only information that we have is the slope at any point $a \in R$ , and we have one initial value, $f (1)$ .

👀 Reading hidden code

632 μs

# in this exercise, only the derivative is given

wavy_deriv(x) = .3x^2 - 3.2x + 7;

👀 Reading hidden code

480 μs

👀 Reading hidden code

699 μs

👀 Reading hidden code

161 ms

Using only this information, we want to reconstruct $f$ .

By rearranging the equation above, we get the Euler method:

$f (a + h) ≃ h f^{'} (a) + f (a)$

Using this formula, we only need to know the value $f (a)$ and the slope $f^{'} (a)$ of a function at $a$ to get the value at $a + h$ . Doing this repeatedly can give us the value at $a + 2 h$ , at $a + 3 h$ , etc., all from one initial value $f (a)$ .

👉 Write a function euler_integrate_step that applies this formula to a known function $f^{'}$ at $a$ , with step size $h$ and the initial value $f (a)$ . It returns the next value, $f (a + h)$ .

👀 Reading hidden code

611 μs

euler_integrate_step (generic function with 1 method)

function euler_integrate_step(fprime::Function, fa::Number,

a::Number, h::Number)

return missing

end

👀 Reading hidden code

381 μs

Here we go!

Replace missing with your answer.

👀 Reading hidden code

21.6 ms

👉 Write a function euler_integrate that takes takes a known function $f^{'}$ , the initial value $f (a)$ and a range T with a == first(T) and h == step(T). It applies the function euler_integrate_step repeatedly, once per entry in T, to produce the sequence of values $f (a + h)$ , $f (a + 2 h)$ , etc.

👀 Reading hidden code

270 μs

euler_integrate (generic function with 1 method)

function euler_integrate(fprime::Function, fa::Number,

T::AbstractRange)

a0 = T[1]

h = step(T)

return missing

end

👀 Reading hidden code

527 μs

Let's try it out on $f^{'} (x) = 3 x^{2}$ and T ranging from $0$ to $10$ .

We already know the analytical solution $f (x) = x^{3}$ , so the result should be an array going from (approximately) 0.0 to 1000.0.

👀 Reading hidden code

274 μs

missing

euler_test = let

fprime(x) = 3x^2

T = 0 : 0.1 : 10

euler_integrate(fprime, 0, T)

end

👀 Reading hidden code

9.4 ms

Here we go!

Replace missing with your answer.

👀 Reading hidden code

1.0 ms

👀 Reading hidden code

5.8 ms

👀 Reading hidden code

207 ms

You see that our numerical antiderivate is not very accurate, but we can get a smaller error by choosing a smaller step size. Try it out!

There are also alternative integration methods that are more accurate with the same step size. Some methods also use the second derivative, other methods use multiple steps at once, etc.! This is the study of Numerical Methods.

👀 Reading hidden code

257 μs

Exercise 2: Simulating the SIR differential equations

Recall from the lectures that the ordinary differential equations (ODEs) for the SIR model are as follows:

$\begin{aligned} \dot{s} & = - β s i \\ \dot{i} & = + β s i - γ i \\ \dot{r} & = + γ i \end{aligned}$

where $\dot{s} := \frac{d s}{d t}$ is the derivative of $s$ with respect to time. Recall that $s$ denotes the proportion (fraction) of the population that is susceptible, a number between $0$ and $1$ .

We will use the simplest possible method to simulate these, namely the Euler method. The Euler method is not always a good method to solve ODEs accurately, but for our purposes it is good enough.

In the previous exercise, we introduced the euler method for a 1D function, which you can see as an ODE that only depends on time. For the SIR equations, we have an ODE that only depends on the previous value, not on time, and we have 3 equations instead of 1.

The solution is quite simple, we apply the euler method to each of the differential equations within a single time step to get new values for each of $s$ , $i$ and $r$ at the end of the time step in terms of the values at the start of the time step. The euler discretised equations are:

$\begin{aligned} s (t + h) & = s (t) - h \cdot β s (t) i (t) \\ i (t + h) & = i (t) + h \cdot (β s (t) i (t) - γ i (t)) \\ r (t + h) & = r (t) + h \cdot γ i (t) \end{aligned}$

👉 Implement a function euler_SIR_step(β, γ, sir_0, h) that performs a single Euler step for these equations with the given parameter values and initial values, with a step size $h$ .

sir_0 is a 3-element vector, and you should return a new 3-element vector with the values after the timestep.

👀 Reading hidden code

942 μs

euler_SIR_step (generic function with 1 method)

function euler_SIR_step(β, γ, sir_0::Vector, h::Number)

s, i, r = sir_0

return [

missing,

]

end

👀 Reading hidden code

721 μs

Missing

missing

missing

missing

euler_SIR_step(0.1, 0.05,

[0.99, 0.01, 0.00],

0.1)

👀 Reading hidden code

14.8 μs

👉 Implement a function euler_SIR(β, γ, sir_0, T) that applies the previously defined function over a time range $T$ .

You should return a vector of vectors: a 3-element vector for each point in time.

👀 Reading hidden code

250 μs

euler_SIR (generic function with 1 method)

function euler_SIR(β, γ, sir_0::Vector, T::AbstractRange)

# T is a range, you get the step size and number of steps like so:

h = step(T)

num_steps = length(T)

return missing

end

👀 Reading hidden code

538 μs

0.0:0.1:60.0

sir_T = 0 : 0.1 : 60.0

👀 Reading hidden code

16.6 μs

missing

sir_results = euler_SIR(0.3, 0.15,

[0.99, 0.01, 0.00],

sir_T)

👀 Reading hidden code

17.6 μs

Let's plot $s$ , $i$ and $r$ as a function of time.

👀 Reading hidden code

221 μs

Error message

MethodError: no method matching getindex(::Missing, ::Int64)

Stack trace

Here is what happened, the most recent locations are first:

_broadcast_getindex_evalf
from broadcast.jl:670
_broadcast_getindex
from broadcast.jl:643
getindex
from broadcast.jl:597
copy
from broadcast.jl:899
materialize
from broadcast.jl:860
plot_sir!(p::Plots.Plot{Plots.GRBackend}, T::StepRangeLen{Float64, Base.TwicePrecision{Float64}, Base.TwicePrecision{Float64}, Int64}, results::Missing; label::String, kwargs::Base.Pairs{Symbol, Union{}, Tuple{}, NamedTuple{(), Tuple{}}})
from Other cell: line 2
```
function plot_sir!(p, T, results; label="", kwargs...)	s = getindex.(results, [1])	i = getindex.(results, [2])	r = getindex.(results, [3])
```
Show more...

plot_sir!(plot(), sir_T, sir_results)

👀 Reading hidden code

---

plot_sir! (generic function with 1 method)

👀 Reading hidden code

2.1 ms

👉 Do you see an epidemic outbreak (i.e. a rapid growth in number of infected individuals, followed by a decline)? What happens after a long time? Does everybody get infected?

👀 Reading hidden code

198 μs

blabla

default_SIR_parameters_observation = md"""

blabla

"""

👀 Reading hidden code

281 μs

👉 Make an interactive visualization in which you vary $β$ and $γ$ . What relation should $β$ and $γ$ have for an epidemic outbreak to occur?

👀 Reading hidden code

216 μs

👀 Reading hidden code

61.7 μs

Exercise 3: Numerical gradient

For fitting we need optimization, and for optimization we will use derivatives (rates of change). In Exercise 1, we wrote a function finite_difference_slope(f, a) to approximate $f^{'} (a)$ . In this exercise we will write a function to compute partial derivatives.

👀 Reading hidden code

400 μs

Exercise 3.1

👉 Write functions ∂x(f, a, b) and ∂y(f, a, b) that calculate the partial derivatives $\frac{\partial f}{\partial x}$ and $\frac{\partial f}{\partial y}$ at $(a, b)$ of a function $f : R^{2} \to R$ (i.e. a function that takes two real numbers and returns one real).

Recall that $\frac{\partial f}{\partial x}$ is the derivative of the single-variable function $g (x) := f (x, b)$ obtained by fixing the value of $y$ to $b$ .

You should use anonymous functions for this. These have the form x -> x^2, meaning "the function that sends $x$ to $x^{2}$ ".

👀 Reading hidden code

485 μs

∂x (generic function with 1 method)

function ∂x(f::Function, a, b)

return missing

end

👀 Reading hidden code

379 μs

missing

∂x(

(x, y) -> 7x^2 + y,

3, 7

)

👀 Reading hidden code

30.5 μs

Here we go!

Replace missing with your answer.

👀 Reading hidden code

5.1 ms

∂y (generic function with 1 method)

function ∂y(f::Function, a, b)

return missing

end

👀 Reading hidden code

389 μs

missing

∂y(

(x, y) -> 7x^2 + y,

3, 7

)

👀 Reading hidden code

30.8 μs

Here we go!

Replace missing with your answer.

👀 Reading hidden code

121 μs

Exercise 3.2

👉 Write a function gradient(f, a, b) that calculates the gradient of a function $f$ at the point $(a, b)$ , given by the vector $\nabla f (a, b) := (\frac{\partial f}{\partial x} (a, b), \frac{\partial f}{\partial y} (a, b))$ .

👀 Reading hidden code

318 μs

gradient (generic function with 1 method)

function gradient(f::Function, a, b)

return missing

end

👀 Reading hidden code

393 μs

missing

gradient(

(x, y) -> 7x^2 + y,

3, 7

)

👀 Reading hidden code

52.1 μs

Here we go!

Replace missing with your answer.

👀 Reading hidden code

160 μs

Exercise 4: Minimisation using gradient descent

In this exercise we will use gradient descent to find local minima of (smooth enough) functions.

To do so we will think of a function as a hill. To find a minimum we should "roll down the hill".

Exercise 4.1

We want to minimize a 1D function, i.e. a function $f : R \to R$ . To do so we notice that the derivative tells us the direction in which the function increases. Positive slope means that the minimum is to the left, negative slope means to the right. So our gradient descent method is to take steps in the opposite direction, of a small size $η \cdot f^{'} (x_{0})$ .

👉 Write a function gradient_descent_1d_step(f, x0) that performs a single gradient descrent step, from the point x0 and using your function finite_difference_slope to approximate the derivative. The result should be the next guess for $x$ .

👀 Reading hidden code

712 μs

gradient_descent_1d_step (generic function with 1 method)

function gradient_descent_1d_step(f, x0; η=0.01)

return missing

end

👀 Reading hidden code

1.2 ms

missing

let

f = x -> x^2

# the minimum is at 0, so we should take a small step to the left

gradient_descent_1d_step(f, 5)

end

👀 Reading hidden code

34.6 μs

Here we go!

Replace missing with your answer.

👀 Reading hidden code

11.5 ms

👀 Reading hidden code

1.1 ms

👀 Reading hidden code

143 ms

$x_{0} =$ -1

👀 Reading hidden code

13.7 ms

gradient_1d_viz (generic function with 1 method)

👀 Reading hidden code

2.9 ms

👉 Write a function gradient_descent_1d(f, x0) that repeatedly applies the previous function (N_steps times), starting from the point x0, like in the vizualisation above. The result should be the final guess for $x$ .

👀 Reading hidden code

241 μs

gradient_descent_1d (generic function with 1 method)

function gradient_descent_1d(f, x0; η=0.01, N_steps=1000)

return missing

end

👀 Reading hidden code

1.4 ms

missing

let

f = x -> (x - 5)^2 - 3

# minimum should be at x = 5

gradient_descent_1d(f, 0.0)

end

👀 Reading hidden code

35.8 μs

Here we go!

Replace missing with your answer.

let

result = gradient_descent_1d(10) do x

(x - 5pi) ^ 2 + 10

end

if result isa Missing

still_missing()

elseif !(result isa Real)

keep_working(md"You need to return a number.")

else

error = abs(result - 5pi)

if error > 5.0

almost(md"It's not accurate enough yet. Maybe you need to increase the number of steps?")

elseif error > 0.02

keep_working()

else

correct()

end

👀 Reading hidden code

1.5 ms

Right now we take a fixed number of steps, even if the minimum is found quickly. What would be a better way to decide when to end the function?

👀 Reading hidden code

197 μs

blabla

better_stopping_idea = md"""

blabla

"""

👀 Reading hidden code

227 μs

Exericse 4.2

Multivariable calculus tells us that the gradient $\nabla f (a, b)$ at a point $(a, b)$ is the direction in which the function increases the fastest. So again we should take a small step in the opposite direction. Note that the gradient is a vector which tells us which direction to move in the plane $(a, b)$ . We multiply this vector with the scalar $η$ to control the step size.

👉 Write functions gradient_descent_2d_step(f, x0, y0) and gradient_descent_2d(f, x0, y0) that do the same for functions $f (x, y)$ of two variables.

👀 Reading hidden code

500 μs

gradient_descent_2d_step (generic function with 1 method)

function gradient_descent_2d_step(f, x0, y0; η=0.01)

return missing

end

👀 Reading hidden code

1.3 ms

gradient_descent_2d (generic function with 1 method)

function gradient_descent_2d(f, x0, y0; η=0.01)

return missing

end

👀 Reading hidden code

1.3 ms

missing

gradient_descent_2d(himmelbau, 0, 0)

👀 Reading hidden code

14.7 μs

@bind N_gradient_2d Slider(0:20)

👀 Reading hidden code

655 μs

👀 Reading hidden code

790 ms

$x_{0} =$ 0

👀 Reading hidden code

1.1 ms

$y_{0} =$ 0

👀 Reading hidden code

1.1 ms

himmelbau (generic function with 1 method)

himmelbau(x, y) = (x^2 + y - 11)^2 + (x + y^2 - 7)^2

👀 Reading hidden code

678 μs

We also prepared a 3D visualisation if you like! It's a bit slow...

👀 Reading hidden code

186 μs

false

run_3d_visualisation = false

👀 Reading hidden code

10.7 μs

👀 Reading hidden code

45.2 μs

gradient_2d_viz_3d (generic function with 1 method)

👀 Reading hidden code

2.8 ms

gradient_2d_viz_2d (generic function with 1 method)

👀 Reading hidden code

2.1 ms

👉 Can you find different minima?

👀 Reading hidden code

187 μs

👀 Reading hidden code

82.6 μs

Exercise 5: Learning parameter values

In this exercise we will apply gradient descent to fit a simple function $y = f_{α, β} (x)$ to some data given as pairs $(x_{i}, y_{i})$ . Here $α$ and $β$ are parameters that appear in the form of the function $f$ . We want to find the parameters that provide the best fit, i.e. the version $f_{α, β}$ of the function that is closest to the data when we vary $α$ and $β$ .

To do so we need to define what "best" means. We will define a measure of the distance between the function and the data, given by a loss function, which itself depends on the values of $α$ and $β$ . Then we will minimize the loss function over $α$ and $β$ to find those values that minimize this distance, and hence are "best" in this precise sense.

The iterative procedure by which we gradually adjust the parameter values to improve the loss function is often called machine learning or just learning, since the computer is "discovering" information in a gradual way, which is supposed to remind us of how humans learn. [Hint: This is not how humans learn.]

Exercise 5.1 - 🎲 frequencies

We generate a small dataset by throwing 10 dice, and counting the sum. We repeat this experiment many times, giving us a frequency distribution in a familiar shape.

👀 Reading hidden code

873 μs

import Statistics

👀 Reading hidden code

262 μs

dice_frequencies (generic function with 1 method)

👀 Reading hidden code

2.2 ms

10:60

Float64

0.0

0.0

0.0

0.0

0.0

5.0e-5

5.0e-5

0.0002

0.00025

0.0

dice_x, dice_y = dice_frequencies(10, 20_000)

👀 Reading hidden code

1.5 ms

👀 Reading hidden code

1.4 s

Let's try to fit a gaussian (normal) distribution. Its PDF with mean $μ$ and standard deviation $σ$ is

$f_{μ, σ} (x) := \frac{1}{σ \sqrt{2 π}} \exp [- \frac{(x - μ)^{2}}{2 σ^{2}}]$

👉 (Not graded) Manually fit a Gaussian distribution to our data by adjusting $μ$ and $σ$ until you find a good fit.

👀 Reading hidden code

350 μs

μ = 24.0

👀 Reading hidden code

16.4 ms

σ = 12

👀 Reading hidden code

1.2 ms

Show manual fit:

👀 Reading hidden code

40.4 ms

gauss (generic function with 1 method)

function gauss(x, μ, σ)

(1 / (sqrt(2π) * σ)) * exp(-(x-μ)^2 / σ^2 / 2)

end

👀 Reading hidden code

716 μs

What we just did was adjusting the function parameters until we found the best possible fit. Let's automate this process! To do so, we need to quantify how good or bad a fit is.

👉 Define a loss function to measure the "distance" between the actual data and the function. It will depend on the values of $μ$ and $σ$ that you choose:

$L (μ, σ) := \sum_{i} [f_{μ, σ} (x_{i}) - y_{i}]^{2}$

👀 Reading hidden code

357 μs

loss_dice (generic function with 1 method)

function loss_dice(μ, σ)

return missing

end

👀 Reading hidden code

357 μs

missing

loss_dice(guess_μ + 3, guess_σ) >

loss_dice(guess_μ, guess_σ)

👀 Reading hidden code

18.9 μs

👉 Use your gradient_descent_2d function to find a local minimum of $L$ , starting with initial values $μ = 30$ and $σ = 1$ . Call the found parameters found_μ and found_σ.

👀 Reading hidden code

230 μs

missing

missing

found_μ, found_σ = let

# your code here

missing, missing

end

👀 Reading hidden code

19.3 μs

Go back to the graph to see your optimized gaussian curve!

If your fit is close, then probability theory tells us that the found parameter $μ$ should be close to the weighted mean of our data, and $σ$ should approximate the sample standard deviation. We have already computed these values, and we check how close they are:

👀 Reading hidden code

342 μs

missing

abs(stats_μ - found_μ)

👀 Reading hidden code

14.9 μs

missing

abs(stats_σ - found_σ)

👀 Reading hidden code

14.5 μs

Here we go!

Replace missing with your answer.

👀 Reading hidden code

133 μs

35.01925

👀 Reading hidden code

85.4 ms

5.398599766374598

👀 Reading hidden code

115 ms

Exercise 6: Putting it all together — fitting an SIR model to data

In this exercise we will fit the (non-spatial) SIR ODE model from Exercise 1 to some data generated from the spatial model in Problem Set 4. If we are able to find a good fit, that would suggest that the spatial aspect "does not matter" too much for the dynamics of these models. If the fit is not so good, perhaps there is an important effect of space. (As usual in statistics, and indeed in modelling in general, we should be very cautious of making claims of this nature.)

This fitting procedure will be different from that in Exercise 4, however: we no longer have an explicit form for the function that we are fitting – rather, it is the output of an ODE! So what should we do?

We will try to find the parameters $β$ and $γ$ for which the output of the ODEs when we simulate it with those parameters best matches the data!

Exercise 6.1

Below the result from Homework 4, Exercise 3.2. These are the average S, I, R fractions of running 20 simulations. Click on it!

👀 Reading hidden code

627 μs

1:1000

👀 Reading hidden code

17.8 μs

Vector{Float64}

Float64

0.99

0.01

0.0

Float64

0.9895

0.0105

0.0

Float64

0.9895

0.0105

0.0

Float64

0.989

0.011

0.0

Float64

0.9885

0.0115

0.0

Float64

0.9885

0.0115

0.0

Float64

0.9885

0.0115

0.0

Float64

0.988

0.0115

0.0005

Float64

0.9865

0.013

0.0005

Float64

0.986

0.0135

0.0005

Float64

0.9855

0.014

0.0005

Float64

0.9855

0.014

0.0005

Float64

0.9855

0.014

0.0005

Float64

0.9845

0.015

0.0005

Float64

0.9845

0.015

0.0005

Float64

0.9845

0.015

0.0005

Float64

0.984

0.0155

0.0005

Float64

0.984

0.0155

0.0005

Float64

0.984

0.0155

0.0005

Float64

0.9835

0.016

0.0005

991

Float64

0.05

0.2335

0.7165

992

Float64

0.05

0.2335

0.7165

993

Float64

0.05

0.233

0.717

994

Float64

0.05

0.233

0.717

995

Float64

0.05

0.2325

0.7175

996

Float64

0.05

0.232

0.718

997

Float64

0.05

0.2315

0.7185

998

Float64

0.05

0.231

0.719

999

Float64

0.05

0.2295

0.7205

1000

Float64

0.05

0.2285

0.7215

👀 Reading hidden code

129 ms

👀 Reading hidden code

683 μs

👀 Reading hidden code

232 ms

👉 (Not graded) Manually fit the SIR curves to our data by adjusting $β$ and $γ$ until you find a good fit.

👀 Reading hidden code

249 μs

β = 0.05

👀 Reading hidden code

1.2 ms

γ = 0.005

👀 Reading hidden code

1.1 ms

Show manual fit:

👀 Reading hidden code

723 μs

👉 To do this automatically, we will again need to define a loss function $L (β, γ)$ . This will compare the solution of the SIR equations with parameters $β$ and $γ$ with your data.

This time, instead of comparing two vectors of numbers, we need to compare two vectors of vectors, the S, I, R values.

👀 Reading hidden code

354 μs

loss_sir (generic function with 1 method)

function loss_sir(β, γ)

return missing

end

👀 Reading hidden code

349 μs

missing

loss_sir(guess_β, guess_γ)

👀 Reading hidden code

16.5 μs

👉 Use this loss function to find the optimal parameters $β$ and $γ$ .

👀 Reading hidden code

210 μs

missing

missing

found_β, found_γ = let

# your code here

missing, missing

end

👀 Reading hidden code

20.3 μs

Here we go!

Replace missing with your answer.

👀 Reading hidden code

1.2 ms

Before you submit

Remember to fill in your name and Kerberos ID at the top of this notebook.

👀 Reading hidden code

402 μs

Function library

Just some helper functions used in the notebook.

👀 Reading hidden code

226 μs

hint (generic function with 1 method)

👀 Reading hidden code

514 μs

almost (generic function with 1 method)

👀 Reading hidden code

461 μs

still_missing (generic function with 2 methods)

👀 Reading hidden code

844 μs

keep_working (generic function with 2 methods)

👀 Reading hidden code

826 μs

Markdown.MD

Fantastic!

Splendid!

Great!

Yay ❤

Great! 🎉

Well done!

Keep it up!

Good job!

Awesome!

You got the right answer!

Let's move on to the next section.

👀 Reading hidden code

11.2 ms

correct (generic function with 2 methods)

👀 Reading hidden code

720 μs

not_defined (generic function with 1 method)

👀 Reading hidden code

773 μs

TODO

👀 Reading hidden code

1.7 ms

as_mime (generic function with 1 method)

👀 Reading hidden code

842 μs

Error message

cannot assign a value to variable PlutoUI.as_svg from module workspace#3

Stack trace

Here is what happened, the most recent locations are first:

from run_expression.jl:92
eval
from boot.jl:373

"""

Return the argument, but force it to be shown as SVG.

This is an optimization for Plots.jl GR plots: it makes them less jittery and keeps the page DOM small.

"""

as_svg = as_mime(MIME"image/svg+xml"())

👀 Reading hidden code

---

In the cloud (experimental)

On your computer

Frontmatter

Preview

Oopsie! You need to update Pluto to the latest version

Homework 6: Epidemic modeling III

Exercise 1: Calculus without calculus

Exercise 1.1 - tangent line

Exercise 1.2 - antiderivative

Exercise 2: Simulating the SIR differential equations

Exercise 3: Numerical gradient

Exercise 3.1

Exercise 3.2

Exercise 4: Minimisation using gradient descent

Exercise 4.1

Exericse 4.2

Exercise 5: Learning parameter values

Exercise 5.1 - 🎲 frequencies

Exercise 6: Putting it all together — fitting an SIR model to data

Exercise 6.1

Function library