probly.train.evidential.torch.rpn_ng_kl

probly.train.evidential.torch.rpn_ng_kl(mu: Tensor, kappa: Tensor, alpha: Tensor, beta: Tensor, mu0: Tensor, kappa0: Tensor, alpha0: Tensor, beta0: Tensor) Tensor[source]

KL divergence between two Normal-Gamma distributions.

Computes the KL divergence between a predicted Normal-Gamma distribution and a prior Normal-Gamma distribution, as used in Regression Prior Networks to regularize out-of-distribution predictions.

Reference:

Malinin et al., “Regression Prior Networks”, NeurIPS 2020. https://arxiv.org/abs/2006.11590

Parameters:
  • mu – Predicted mean parameter, shape (B,).

  • kappa – Predicted scaling parameter, shape (B,).

  • alpha – Predicted shape parameter, shape (B,).

  • beta – Predicted scale parameter, shape (B,).

  • mu0 – Prior mean parameter, shape (B,).

  • kappa0 – Prior scaling parameter, shape (B,).

  • alpha0 – Prior shape parameter, shape (B,).

  • beta0 – Prior scale parameter, shape (B,).

Returns:

Scalar KL divergence between predicted and prior Normal-Gamma distributions, averaged over the batch.