Update Exercises 2 and 3 (Multiple Regression).

2023-10-09 17:48:50 +02:00
parent b265c7afe0
commit a9b065ab5f
1 changed files with 19 additions and 9 deletions
--- a/material/3_wed/regression/MultipleRegressionBasics.qmd
+++ b/material/3_wed/regression/MultipleRegressionBasics.qmd
@@ -100,9 +100,10 @@ normal distribution with mean 0.0 and standard deviation 2.0, and sample
 ## Task 1
 1.  Generate $n=20$ covariates $\mathbf{x}$ randomly.
-2.  Given these covariates and true parameters $\beta_0=-3$, $\beta_1=2$
+2.  Given these covariates and the true parameters $\beta_0=-3$,
-    and $\sigma=0.5$, simulate responses from a linear model and
+    $\beta_1=2$ and $\sigma=0.5$, simulate responses from a linear model
-    estimate the coefficients $\beta_0$ and $\beta_1$.
+    (with normally distributed errors) and estimate the coefficients
    $\beta_0$ and $\beta_1$.
 3.  Play with different choices of the parameters above (including the
    sample size $n$) to see the effects on the parameter estimates and
    the $p$-values.
@@ -189,10 +190,18 @@ regression model, but we provide explicit formulas now:
    p\text{-value} = \mathbb{P}(|T| > t_i), \quad \text{where } T \sim t_{n-p}
    $$
-**Task 2**: Implement functions that estimate the $\beta$-parameters,
+::: {.callout-caution collapse="false"}
-the corresponding standard errors and the $t$-statistics. Test your
+
-functions with the \`\`\`tree''' data set and try to reproduce the
+## Task 2
 1. Implement functions that estimate the $\beta$-parameters,
 the corresponding standard errors and the $t$-statistics. 
 2. Test your functions with the `tree' data set and try to reproduce the
 output above.
 :::
 Which model is the best? For linear models, one often uses the $R^2$ characteristic.
 Roughly speaking, it gives the percentage (between 0 and 1) of the variance that can be explained by the linear model.
 ``` julia
 r2(linmod1)
@@ -305,10 +314,11 @@ model = glm(@formula(participation ~ age^2),
            SwissLabor, Binomial(), ProbitLink())
 ```
-::: callout-task
+::: {.callout-caution collapse="false"}
-**Task 3**:
+
 ##Task 3:
 1.  Reproduce the results of our data analysis of the `tree` data set
    using a generalized linear model with normal distribution family.
-2.  Generate
+2.  Generate $n=20$ random covariates $\mathbf{x}$ and Poisson-distributed counting      data with parameters $\beta_0 + \beta_1 x_i$. Re-estimate the parameters by a generalized linear model. 
 :::