Merge pull request #22 from ranocha/patch-1

Possible solution of the Julia optimization task
fix path to docs/make
2023-10-20 09:52:12 +02:00 · 2023-10-13 13:33:36 +00:00 · 2023-10-13 11:24:51 +02:00 · 2023-10-13 11:23:27 +02:00 · 2023-10-13 11:21:21 +02:00 · 2023-10-13 07:13:10 +00:00
9 changed files with 2188 additions and 3 deletions
--- a/_quarto.yml
+++ b/_quarto.yml
@@ -90,7 +90,7 @@ website:
                text: "📝 3 - Parallelization"
          - section: "Friday"
            contents:
-              - href: material/5_fri/highlightsOptim/missing.qmd
+              - href: material/5_fri/optimizing_julia/optimizing_julia.md
                text: "📝 1 - Highlights + Optimization"
          
  navbar:
--- a/installation/julia.qmd
+++ b/installation/julia.qmd
@@ -11,6 +11,15 @@ AppStore -> JuliaUp,  or `winget install julia -s msstore` in CMD
 ### Mac & Linux
 `curl -fsSL https://install.julialang.org | sh` in any shell

+# Revise.jl
+There is a julia-package `Revise.jl` that everyone should install. To do so open a julia REPL (=command line) and execute the following lines:
+```julia
+using Pkg
+Pkg.add("Revise")
+```
+
+and that's it. Revise automatically screens the active packages and updates the function if it detects code changes. Similar to `autoreload 2` in ipython.
+
 # VSCode
 To install VSCode (the recommended IDE), go to [this link](https://code.visualstudio.com/download) and download + install the correct package.

--- a/material/3_wed/docs/handout.qmd
+++ b/material/3_wed/docs/handout.qmd
@@ -50,7 +50,7 @@ makedocs(

 ### How to generate
 - `julia --project=docs/ docs/make.jl`, or
- - `]activate docs/; include("make.jl")`, or
+ - `]activate docs/; include("docs/make.jl")`, or
 - `LiveServer.jl` + `deploydocs()`

 ### How to write
--- a/material/5_fri/optimizing_julia/Manifest.toml
+++ b/material/5_fri/optimizing_julia/Manifest.toml
--- a/material/5_fri/optimizing_julia/Project.toml
+++ b/material/5_fri/optimizing_julia/Project.toml
@@ -0,0 +1,8 @@
+[deps]
+BenchmarkTools = "6e4b80f9-dd63-53aa-95a3-0cdb28fa8baf"
+ForwardDiff = "f6369f11-7733-5829-9624-2563aa707210"
+HDF5 = "f67ccb44-e63f-5c2f-98bd-6dc0ccc4ba2f"
+LaTeXStrings = "b964fa9f-0449-5b57-a5c2-d3ea65f4040f"
+Literate = "98b081ad-f1c9-55d3-8b20-4c87d4299306"
+Plots = "91a5bcdd-55d7-5caf-9e0b-520d859cae80"
+SimpleNonlinearSolve = "727e6d20-b764-4bd8-a329-72de5adea6c7"
--- a/material/5_fri/optimizing_julia/optimizing_julia.jl
+++ b/material/5_fri/optimizing_julia/optimizing_julia.jl
@@ -0,0 +1,254 @@
+# # Optimizing Julia code
+
+# This session provides an introduction to optimizing Julia code.
+# The examples are developed with Julia v1.9.3. You can download
+# all files from the summer school website:
+#
+# - [`optimizing_julia.jl`](optimizing_julia.jl)
+# - [`Project.toml`](Project.toml)
+# - [`Manifest.toml`](Manifest.toml)
+#
+# This website renders the content of
+# [`optimizing_julia.jl`](optimizing_julia.jl).
+
+# First, we install all required packages
+import Pkg
+Pkg.activate(@__DIR__)
+Pkg.instantiate()
+
+# The markdown file is created from the source code using
+# [Literate.jl](https://github.com/fredrikekre/Literate.jl).
+# You can create the markdown file via
+
+using Literate
+Literate.markdown(joinpath(@__DIR__, "optimizing_julia.jl");
+                  flavor = Literate.CommonMarkFlavor())
+
+
+# ## Basics
+#
+# The Julia manual contains basic information about performance.
+# In particular, you should be familiar with
+#
+# - https://docs.julialang.org/en/v1/manual/performance-tips/
+# - https://docs.julialang.org/en/v1/manual/style-guide/
+#
+# if you care about performance.
+
+
+# ## Example: A linear advection simulation
+#
+# This is a classical partial differential equation (PDE) simulation setup.
+# If you are not familiar with it, just ignore what's going on - but we need
+# an example. This one is similar to several problematic cases I have seen
+# in practice.
+
+# First, we generate initial data and store it in a file.
+using HDF5
+x = range(-1.0, 1.0, length = 1000)
+dx = step(x)
+h5open("initial_data.h5", "w") do io
+    u0 = sinpi.(x)
+    write_dataset(io, "x", collect(x))
+    write_dataset(io, "u0", u0)
+end
+
+# Next, we write our simulation code as a function - as we have learned from
+# the performance tips!
+function simulate()
+    x, u0 = h5open("initial_data.h5", "r") do io
+        read_dataset(io, "x"), read_dataset(io, "u0")
+    end
+
+    t = 0
+    u = u0
+    while t < t_end
+        u = u + dt * rhs(u)
+        t = t + dt
+    end
+
+    return t, x, u, u0
+end
+
+function rhs(u)
+    du = similar(u)
+    for i in eachindex(u)
+        im1 = i == firstindex(u) ? lastindex(u) : (i - 1)
+        du[i] = -(u[i] - u[im1]) / dx
+    end
+    return du
+end
+
+# Now, we can define our parameters, run a simulation,
+# and plot the results.
+using Plots, LaTeXStrings
+
+t_end = 2.5
+dt = 0.9 * dx
+t, x, u, u0 = simulate()
+plot(x, u0; label = L"u_0", xguide = L"x")
+plot!(x, u; label = L"u")
+
+# Maybe you can already spot problems in the code above.
+# Anyway, before you begin optimizing some code, you should
+# test it and make sure it's correct. Let's assume this is the
+# case here. You should write tests making sure that the code
+# continues to work as expected while we optimize it. We will
+# not do this here right now.
+
+
+# ## Profiling
+#
+# First, we need to measure the performance of our code to
+# see whether it's already reasonable. For this, you can use
+# `@time` - or better BenchmarkTools.jl.
+
+using BenchmarkTools
+@benchmark simulate()
+
+# This shows quite a lot of allocations - typically a bad sign
+# if you are working with numerical simulations.
+
+# There are also profiling tools available in Julia.
+# Some of them are already loaded in the Visual Studio Code
+# extension for Julia. If you prefer working in the REPL,
+# you can install ProfileView.jl (`@profview simulate()`) and
+# PProf.jl (`pprof()` after creating a profile via `@profview`).
+
+@profview simulate()
+
+# Task: Optimize the code!
+#
+# You can find one improved version in the file
+# [`optimizing_julia_solution.jl`](optimizing_julia_solution.jl).
+
+
+# ## Introduction to generic Julia code and AD
+#
+# One of the strengths of Julia is that you can use quite a few
+# modern tools like AD. However, you need to learn Julia a bit
+# to use everything efficiently.
+# Here, we concentrate on ForwardDiff.jl.
+
+using ForwardDiff
+using Statistics
+
+function foo0(x)
+    vector = zeros(typeof(x), 100)
+    foo0!(vector, x)
+end
+
+function foo0!(vector, scalar)
+    for i in eachindex(vector)
+        vector[i] = atan(i * scalar) / π
+    end
+
+    for _ in 1:1000
+        for i in eachindex(vector)
+            vector[i] = cos(vector[i])
+        end
+    end
+
+    return mean(vector)
+end
+
+let x = 2 * π
+    @show foo0(x)
+    @show ForwardDiff.derivative(foo0, x)
+    @benchmark foo0($x)
+end
+
+
+# The code above is basically a fixed point iteration.
+# By doing some analysis, one could figure out that it
+# should converge to the solution `x` of `x == cos(x)`,
+# the "Dottie number". See https://en.wikipedia.org/wiki/Dottie_number
+
+using SimpleNonlinearSolve
+
+function dottie_number()
+    f(u, p) = cos(u) - u
+    bounds = (0.0, 2.0)
+    prob = IntervalNonlinearProblem(f, bounds)
+    sol = solve(prob, ITP())
+    return sol.u
+end
+
+dottie_number()
+
+
+# Next, we introduce a custom struct. Can you spot the problems?
+
+struct MyData1
+    scalar
+    vector
+end
+
+function foo1(x)
+    data = MyData1(x, zeros(typeof(x), 100))
+    foo1!(data)
+end
+
+function foo1!(data)
+    (; scalar, vector) = data
+
+    for i in eachindex(vector)
+        vector[i] = atan(i * scalar) / π
+    end
+
+    for _ in 1:1000
+        for i in eachindex(vector)
+            vector[i] = cos(vector[i])
+        end
+    end
+
+    return mean(vector)
+end
+
+let x = 2 * π
+    @show foo1(x)
+    @show ForwardDiff.derivative(foo1, x)
+    @benchmark foo1($x)
+end
+
+
+# We can fix type instabilities by introducing types explicitly.
+# But we lose the possibility to use AD this way!
+
+struct MyData2
+    scalar::Float64
+    vector::Vector{Float64}
+end
+
+function foo2(x)
+    data = MyData2(x, zeros(typeof(x), 100))
+    foo1!(data)
+end
+
+let x = 2 * π
+    @show foo2(x)
+    @benchmark foo2($x)
+end
+
+let x = 2 * π
+    ForwardDiff.derivative(foo2, x)
+end
+
+# Thus, we need parametric types!
+
+struct MyData3{T}
+    scalar::T
+    vector::Vector{T}
+end
+
+function foo3(x)
+    data = MyData3(x, zeros(typeof(x), 100))
+    foo1!(data)
+end
+
+let x = 2 * π
+    @show foo3(x)
+    @show ForwardDiff.derivative(foo3, x)
+    @benchmark foo3($x)
+end
+
--- a/material/5_fri/optimizing_julia/optimizing_julia.md
+++ b/material/5_fri/optimizing_julia/optimizing_julia.md
@@ -0,0 +1,279 @@
+# Optimizing Julia code
+
+This session provides an introduction to optimizing Julia code.
+The examples are developed with Julia v1.9.3. You can download
+all files from the summer school website:
+
+- [`optimizing_julia.jl`](optimizing_julia.jl)
+- [`Project.toml`](Project.toml)
+- [`Manifest.toml`](Manifest.toml)
+
+This website renders the content of 
+[`optimizing_julia.jl`](optimizing_julia.jl).
+
+First, we install all required packages
+
+````julia
+import Pkg
+Pkg.activate(@__DIR__)
+Pkg.instantiate()
+````
+
+The markdown file is created from the source code using
+[Literate.jl](https://github.com/fredrikekre/Literate.jl).
+You can create the markdown file via
+
+````julia
+using Literate
+Literate.markdown(joinpath(@__DIR__, "optimizing_julia.jl");
+                  flavor = Literate.CommonMarkFlavor())
+````
+
+## Basics
+
+The Julia manual contains basic information about performance.
+In particular, you should be familiar with
+
+- https://docs.julialang.org/en/v1/manual/performance-tips/
+- https://docs.julialang.org/en/v1/manual/style-guide/
+
+if you care about performance.
+
+## Example: A linear advection simulation
+
+This is a classical partial differential equation (PDE) simulation setup.
+If you are not familiar with it, just ignore what's going on - but we need
+an example. This one is similar to several problematic cases I have seen
+in practice.
+
+First, we generate initial data and store it in a file.
+
+````julia
+using HDF5
+x = range(-1.0, 1.0, length = 1000)
+dx = step(x)
+h5open("initial_data.h5", "w") do io
+    u0 = sinpi.(x)
+    write_dataset(io, "x", collect(x))
+    write_dataset(io, "u0", u0)
+end
+````
+
+Next, we write our simulation code as a function - as we have learned from
+the performance tips!
+
+````julia
+function simulate()
+    x, u0 = h5open("initial_data.h5", "r") do io
+        read_dataset(io, "x"), read_dataset(io, "u0")
+    end
+
+    t = 0
+    u = u0
+    while t < t_end
+        u = u + dt * rhs(u)
+        t = t + dt
+    end
+
+    return t, x, u, u0
+end
+
+function rhs(u)
+    du = similar(u)
+    for i in eachindex(u)
+        im1 = i == firstindex(u) ? lastindex(u) : (i - 1)
+        du[i] = -(u[i] - u[im1]) / dx
+    end
+    return du
+end
+````
+
+Now, we can define our parameters, run a simulation,
+and plot the results.
+
+````julia
+using Plots, LaTeXStrings
+
+t_end = 2.5
+dt = 0.9 * dx
+t, x, u, u0 = simulate()
+plot(x, u0; label = L"u_0", xguide = L"x")
+plot!(x, u; label = L"u")
+````
+
+Maybe you can already spot problems in the code above.
+Anyway, before you begin optimizing some code, you should
+test it and make sure it's correct. Let's assume this is the
+case here. You should write tests making sure that the code
+continues to work as expected while we optimize it. We will
+not do this here right now.
+
+## Profiling
+
+First, we need to measure the performance of our code to
+see whether it's already reasonable. For this, you can use
+`@time` - or better BenchmarkTools.jl.
+
+````julia
+using BenchmarkTools
+@benchmark simulate()
+````
+
+This shows quite a lot of allocations - typically a bad sign
+if you are working with numerical simulations.
+
+There are also profiling tools available in Julia.
+Some of them are already loaded in the Visual Studio Code
+extension for Julia. If you prefer working in the REPL,
+you can install ProfileView.jl (`@profview simulate()`) and
+PProf.jl (`pprof()` after creating a profile via `@profview`).
+
+````julia
+@profview simulate()
+````
+
+Task: Optimize the code!
+
+You can find one improved version in the file
+[`optimizing_julia_solution.jl`](optimizing_julia_solution.jl).
+
+## Introduction to generic Julia code and AD
+
+One of the strengths of Julia is that you can use quite a few
+modern tools like AD. However, you need to learn Julia a bit
+to use everything efficiently.
+Here, we concentrate on ForwardDiff.jl.
+
+````julia
+using ForwardDiff
+using Statistics
+
+function foo0(x)
+    vector = zeros(typeof(x), 100)
+    foo0!(vector, x)
+end
+
+function foo0!(vector, scalar)
+    for i in eachindex(vector)
+        vector[i] = atan(i * scalar) / π
+    end
+
+    for _ in 1:1000
+        for i in eachindex(vector)
+            vector[i] = cos(vector[i])
+        end
+    end
+
+    return mean(vector)
+end
+
+let x = 2 * π
+    @show foo0(x)
+    @show ForwardDiff.derivative(foo0, x)
+    @benchmark foo0($x)
+end
+````
+
+The code above is basically a fixed point iteration.
+By doing some analysis, one could figure out that it
+should converge to the solution `x` of `x == cos(x)`,
+the "Dottie number". See https://en.wikipedia.org/wiki/Dottie_number
+
+````julia
+using SimpleNonlinearSolve
+
+function dottie_number()
+    f(u, p) = cos(u) - u
+    bounds = (0.0, 2.0)
+    prob = IntervalNonlinearProblem(f, bounds)
+    sol = solve(prob, ITP())
+    return sol.u
+end
+
+dottie_number()
+````
+
+Next, we introduce a custom struct. Can you spot the problems?
+
+````julia
+struct MyData1
+    scalar
+    vector
+end
+
+function foo1(x)
+    data = MyData1(x, zeros(typeof(x), 100))
+    foo1!(data)
+end
+
+function foo1!(data)
+    (; scalar, vector) = data
+
+    for i in eachindex(vector)
+        vector[i] = atan(i * scalar) / π
+    end
+
+    for _ in 1:1000
+        for i in eachindex(vector)
+            vector[i] = cos(vector[i])
+        end
+    end
+
+    return mean(vector)
+end
+
+let x = 2 * π
+    @show foo1(x)
+    @show ForwardDiff.derivative(foo1, x)
+    @benchmark foo1($x)
+end
+````
+
+We can fix type instabilities by introducing types explicitly.
+But we lose the possibility to use AD this way!
+
+````julia
+struct MyData2
+    scalar::Float64
+    vector::Vector{Float64}
+end
+
+function foo2(x)
+    data = MyData2(x, zeros(typeof(x), 100))
+    foo1!(data)
+end
+
+let x = 2 * π
+    @show foo2(x)
+    @benchmark foo2($x)
+end
+
+let x = 2 * π
+    ForwardDiff.derivative(foo2, x)
+end
+````
+
+Thus, we need parametric types!
+
+````julia
+struct MyData3{T}
+    scalar::T
+    vector::Vector{T}
+end
+
+function foo3(x)
+    data = MyData3(x, zeros(typeof(x), 100))
+    foo1!(data)
+end
+
+let x = 2 * π
+    @show foo3(x)
+    @show ForwardDiff.derivative(foo3, x)
+    @benchmark foo3($x)
+end
+````
+
+---
+
+*This page was generated using [Literate.jl](https://github.com/fredrikekre/Literate.jl).*
+
--- a/material/5_fri/optimizing_julia/optimizing_julia_solution.jl
+++ b/material/5_fri/optimizing_julia/optimizing_julia_solution.jl
@@ -0,0 +1,86 @@
+# # Optimizing Julia code: possible solution
+
+# First, we install all required packages
+import Pkg
+Pkg.activate(@__DIR__)
+Pkg.instantiate()
+
+# The markdown file is created from the source code using
+# [Literate.jl](https://github.com/fredrikekre/Literate.jl).
+# You can create the markdown file via
+
+using Literate
+Literate.markdown(joinpath(@__DIR__, "optimizing_julia_solution.jl");
+                  flavor = Literate.CommonMarkFlavor())
+
+# First, we generate initial data and store it in a file.
+using HDF5
+x = range(-1.0, 1.0, length = 1000)
+dx = step(x)
+h5open("initial_data.h5", "w") do io
+    u0 = sinpi.(x)
+    write_dataset(io, "x", collect(x))
+    write_dataset(io, "u0", u0)
+end
+
+function simulate(; kwargs...)
+    x, u0 = h5open("initial_data.h5", "r") do io
+        read_dataset(io, "x"), read_dataset(io, "u0")
+    end
+
+    u = copy(u0)
+    t = simulate!(u; kwargs...)
+
+    return t, x, u, u0
+end
+
+function simulate!(u; t_end, dt, dx)
+    du = similar(u)
+    t = zero(t_end)
+    while t < t_end
+        rhs!(du, u, dx)
+        # u = u + dt * du
+        # u .= u .+ dt .* du
+        @. u = u + dt * du
+        t = t + dt
+    end
+
+    return t
+end
+
+function rhs!(du, u, dx)
+    inv_dx = inv(dx)
+
+    let i = firstindex(u)
+        im1 = lastindex(u)
+        # du[i] = -(u[i] - u[im1]) / dx
+        du[i] = -inv_dx * (u[i] - u[im1])
+    end
+
+    for i in (firstindex(u) + 1):lastindex(u)
+        im1 = i - 1
+        # du[i] = -(u[i] - u[im1]) / dx
+        du[i] = -inv_dx * (u[i] - u[im1])
+    end
+    return nothing
+end
+
+# Now, we can define our parameters, run a simulation,
+# and plot the results.
+using Plots, LaTeXStrings
+
+t_end = 2.5
+dt = 0.9 * dx
+t, x, u, u0 = simulate(; t_end, dt, dx)
+plot(x, u0; label = L"u_0", xguide = L"x")
+plot!(x, u; label = L"u")
+
+
+using BenchmarkTools
+@benchmark simulate(; t_end, dt, dx)
+
+let u = h5open("initial_data.h5", "r") do io
+        read_dataset(io, "u0")
+    end
+    @benchmark simulate!(u; t_end, dt, dx)
+end
--- a/projectwork.qmd
+++ b/projectwork.qmd
@@ -16,7 +16,8 @@ Before starting, you might look at an data example. To this end, download [Tempe

 ``` julia

-using JLD2 @load "TemperatureData.jld2" \## potentially correct your path
+using JLD2 
+@load "TemperatureData.jld2" ## potentially correct your path
 ```

 This will give you data frames with information on weather stations in the Netherlands, summer temperature means (in $0.1^\circ\mathrm{C}$) and summer temperature maxima (in $0.1^\circ\mathrm{C}$). It is reasonable to assume that the temperature data are temporally independent. Furthermore, assuming a normal distribution at each station is quite common for this type of data.
Author	SHA1	Message	Date
Benedikt Ehinger	7d9860365e	Merge pull request #22 from ranocha/patch-1 Possible solution of the Julia optimization task	2023-10-20 09:52:12 +02:00
behinger (s-ccs 001)	37f4033fc4	fix path to docs/make	2023-10-13 13:33:36 +00:00
Hendrik Ranocha	66af620ebc	remove outdated version	2023-10-13 11:24:51 +02:00
Hendrik Ranocha	66113a8ac3	link to possible solution	2023-10-13 11:23:27 +02:00
Hendrik Ranocha	f8019c4c03	Possible solution of the Julia optimization task	2023-10-13 11:21:21 +02:00
behinger (s-ccs 001)	9bc8f11223	added revise to install instructions	2023-10-13 07:13:10 +00:00
Benedikt Ehinger	e07f9cc155	Merge pull request #20 from ranocha/hr/optimizing_julia add draft of material for Friday	2023-10-13 08:43:55 +02:00
Hendrik Ranocha	c5a4be65a0	add links in jl file	2023-10-13 08:27:08 +02:00
Hendrik Ranocha	f509315eba	add links to other files	2023-10-13 08:19:24 +02:00
Marco Oesting	9ec20e370a	Correct typo in project description	2023-10-12 14:27:33 +02:00
behinger (s-ccs 001)	055e4d1f93	add to sidebar	2023-10-12 08:27:09 +00:00
Hendrik Ranocha	108974595f	add draft of material for Friday	2023-10-11 16:16:40 +02:00