Details

The solution to the mixed model equations is a maximum likelihood estimate when the distribution of the errors is normal. Maximum likelihood estimates are based on the probability model for the observed responses. In the probability model the distribution of the responses is expressed as a function of one or more parameters. PROC MIXED in SAS used restricted maximum likelihood (REML) approach by default. REML equation can be described with following (Henderson, 1959;Laird et.al. 1982; Jennrich 1986; Lindstrom & Bates, 1988; Gurka et.al 2006).

Metida.jl using optimization with Optim.jl package (Newton's Method) by default. Because variance have only positive values and ρ is limited as -1 ≤ ρ ≤ 1 in Metida.jl "link" function is used. Exponential values is optimizing in variance part and ρ is linked with sigmoid function. All steps perform with differentiable functions with forward automatic differentiation using ForwardDiff.jl package. Also MetidaNLopt.jl and MetidaCu.jl available for optimization with NLopt.jl and solving on CUDA GPU. Sweep algorithm using for variance-covariance matrix inversing in REML calculation.

Model

In matrix notation a mixed effect model can be represented as:

\[y = X\beta + Zu + \epsilon\]

where:

\[\epsilon \sim N(0, R) \\ u \sim N(0, G) \\ y \sim N(X\beta, V) \]

where V depends on covariance sructure and parameters $\theta$:

\[V = CovStruct(\theta) \]

The unknown parameters include the regression parameters in $\beta$ and covariance parameters in $\theta$.

Estimation of these model parameters relies on the use of a Newton-Ralphson (by default) algorithm. When we use either algorithm for finding REML solutions, we need to compute $V^{-1}$ and its derivatives with respect to $\theta$, which are computationally difficult for large $n$, therefor SWEEP (see https://github.com/joshday/SweepOperator.jl) algorithm used to meke oprtimization less computationaly expensive.

V

\[V_{i} = Z_{i}GZ_i'+R_{i}\]

Henderson's «mixed model equations»

\[\begin{pmatrix}X'R^{-1}X&X'R^{-1}Z\\Z'R^{-1}X&Z'R^{-1}Z+G_{-1}\end{pmatrix} \begin{pmatrix}\widehat{\beta} \\ \widehat{u} \end{pmatrix}= \begin{pmatrix}X'R^{-1}y\\Z'R^{-1}y\end{pmatrix}\]

REML

\[logREML(\theta,\beta) = -\frac{N-p}{2} - \frac{1}{2}\sum_{i=1}^nlog|V_{\theta, i}|- -\frac{1}{2}log|\sum_{i=1}^nX_i'V_{\theta, i}^{-1}X_i|-\frac{1}{2}\sum_{i=1}^n(y_i - X_{i}\beta)'V_{\theta, i}^{-1}(y_i - X_{i}\beta)\]

Actually $L(\theta) = -2logREML = L_1(\theta) + L_2(\theta) + L_3(\theta) + c$ used for optimization, where:

\[L_1(\theta) = \frac{1}{2}\sum_{i=1}^nlog|V_{i}| \\ L_2(\theta) = \frac{1}{2}log|\sum_{i=1}^nX_i'V_i^{-1}X_i| \\ L_3(\theta) = \frac{1}{2}\sum_{i=1}^n(y_i - X_{i}\beta)'V_i^{-1}(y_i - X_{i}\beta)\]

\[\nabla\mathcal{L}(\theta) = \nabla L_1(\theta) + \nabla L_2(\theta) + \nabla L_3(\theta)\]

\[\mathcal{H}\mathcal{L}(\theta) = \mathcal{H}L_1(\theta) + \mathcal{H}L_2(\theta) + \mathcal{H} L_3(\theta)\]

Weights

If weights defined:

\[ W_{i} = diag(wts_{i}) \\ V_{i} = Z_{i} G Z_i'+ W^{- \frac{1}{2}}_i R_{i} W^{- \frac{1}{2}}_i\]

where $W$ - diagonal matrix of weights.

If wts is matrix then:

\[ W_{i} = wts_{i} \\ V_{i} = Z_{i} G Z_i'+ R_{i} \circ W_{i}\]

where $\circ$ - element-wise multiplication.

Multiple random and repeated effects

If model include multiple effects ( with n random and m repeated effects) final V will be:

\[V_{i} = Z_{i, 1} G_{1} Z_{i, 1}' + ... + Z_{i, n} G_{1} Z_{i, n}'+ W^{- \frac{1}{2}}_i ( R_{i, 1} + ... + R_{i, m}) W^{- \frac{1}{2}}_i\]

Initial step

Initial (first) step before optimization may be done:

\[\theta_{n+1} = \theta_{n} - \nabla\mathcal{L}(\theta_{n}) * \mathcal{H}^{'}(\theta_{n}) , where \\ \mathcal{H}^{'}(\theta_{n}) = - \mathcal{H}L_1(\theta_{n}) + \mathcal{H} L_3(\theta) , if score \\ \mathcal{H}^{'}(\theta_{n}) = \mathcal{H} L_3(\theta_{n}) , if ai \]

\[ f^{-1}(x) = log(x)\]

where $k = 0.1$

Details

Model

V

Henderson's «mixed model equations»

REML

Weights

Multiple random and repeated effects

Initial step

Beta (β)

F

Variance covariance matrix of β

Sweep

Variance parameters link function

Variance (var) part

Exponential function (:exp)

Square function (:sq)

Identity function (:identity)

Covariance (rho) part

Sigmoid function (:sigm)

Arctangent function (:atan)

"Square" sigmoid function (:sqsigm)

Positive sigmoid function (:psigm)

Additional parameters (theta) part