2 Test with simulated datasets

2.1 Checking the accuracy of numerical quadrature

x <- rt.scaled(50, mean=2, sd = 2, df=2)
log_marlik_mid(x,0,10,0,10,100)

## [1] -171.4239

log_marlik_mc(x,0,10,0,10,100000)

## [1] -171.4012

Another test

x <- rt.scaled(100, mean=2, sd = 2, df=Inf)
log_marlik_mid(x,0,10,0,10,100)

## [1] -219.7842

log_marlik_mc(x,0,10,0,10,100000)

## [1] -219.8334

## looking at the convergence

for(i in seq(10,90,by=10))
{ cat("n = ",i,",")
    cat(" Estimated Log Marginal Likelihood =", 
        log_marlik_mid(x,0,10,0,10,i),"\n")
}

## n =  10 , Estimated Log Marginal Likelihood = -221.0623 
## n =  20 , Estimated Log Marginal Likelihood = -219.781 
## n =  30 , Estimated Log Marginal Likelihood = -219.8175 
## n =  40 , Estimated Log Marginal Likelihood = -219.783 
## n =  50 , Estimated Log Marginal Likelihood = -219.7842 
## n =  60 , Estimated Log Marginal Likelihood = -219.7842 
## n =  70 , Estimated Log Marginal Likelihood = -219.7842 
## n =  80 , Estimated Log Marginal Likelihood = -219.7842 
## n =  90 , Estimated Log Marginal Likelihood = -219.7842

2.2 Comparing log marginal likelihoods of different models

2.2.1 Comparing Priors

x <- rnorm(100)

When the mean of the prior is reasonable

log_marlik_mid(x,mu_0=0,sigma_mu=0.1,w_0=0,sigma_w=1,100)

## [1] -154.532

log_marlik_mid(x,mu_0=0,sigma_mu=0.01,w_0=0,sigma_w=1,100)

## [1] -155.5615

log_marlik_mid(x,mu_0=0,sigma_mu=1,w_0=0,sigma_w=1,100)

## [1] -155.8382

log_marlik_mid(x,mu_0=0,sigma_mu=10,w_0=0,sigma_w=1,100)

## [1] -158.1214

log_marlik_mid(x,mu_0=0,sigma_mu=100,w_0=0,sigma_w=1,100)

## [1] -160.4237

log_marlik_mid(x,mu_0=0,sigma_mu=1000,w_0=0,sigma_w=1,100)

## [1] -162.7263

When the mean of the prior is unreasonable

log_marlik_mid(x,mu_0=-5,sigma_mu=0.1,w_0=0,sigma_w=1,100)

## [1] -313.8361

log_marlik_mid(x,mu_0=-5,sigma_mu=1,w_0=0,sigma_w=1,100)

## [1] -167.369

log_marlik_mid(x,mu_0=-5,sigma_mu=10,w_0=0,sigma_w=1,100)

## [1] -158.2381

log_marlik_mid(x,mu_0=-5,sigma_mu=100,w_0=0,sigma_w=1,100)

## [1] -160.4249

log_marlik_mid(x,mu_0=-5,sigma_mu=1000,w_0=0,sigma_w=1,100)

## [1] -162.7263

2.2.2 Comparing Models for Data

Data from Normal

x <- rt.scaled(100, mean=2, sd = 2, df=Inf)
log_marlik_mid(x,0,10,0,10,100, df = Inf)

## [1] -216.0858

log_marlik_mid(x,0,10,0,10,100, df = 2)

## [1] -221.8367

log_marlik_mid(x,0,10,0,10,100, df = 1)

## [1] -230.7908

log_marlik_mid(x,0,10,0,10,100, df = 0.5)

## [1] -249.0301

log_marlik_mid(x,0,0.1,0,10,100, df = Inf) # if prior is too narrow

## [1] -247.8747

log_marlik_mid(x,0,100,0,100,100, df = Inf) # if prior is too diffuse

## [1] -220.667

log_marlik_mid(x,0,1000,0,1000,100, df = Inf) # if prior is too diffuse

## [1] -225.2719

Data from t

x <- rt.scaled(100, mean=2, sd = 2, df=2)
log_marlik_mid(x,0,10,0,10,100, df = Inf)

## [1] -321.6533

log_marlik_mid(x,0,10,0,10,100, df = 2)

## [1] -280.5134

log_marlik_mid(x,0,0.1,0,10,100, df =2) # if prior is too narrow

## [1] -295.8246

log_marlik_mid(x,0,100,0,100,100, df = 2) # if prior is too diffuse

## [1] -285.0959

log_marlik_mid(x,0,1000,0,1000,100, df = 2) # if prior is too diffuse

## [1] -289.7009

We see that although the prior impacts marginal likelihood, the error in mis-specification in data model can be still detected.

2.2.3 A Case when numerical quadrature fails

x <- rt.scaled(100, mean=50, sd = 2, df=Inf)
log_marlik_mid(x,0,100,0,100,100, df = Inf)

## [1] -535.1336

log_marlik_mc(x,0,100,0,100,10000, df = Inf)

## [1] -369.4429

What has gone wrong? The inverse-logistic transformation maps most points between (0,1) to the region around 0.5. But the likelihood function has its mode around 50!

STAT 812: Computational Statistics

Midpoint Rule Approximating Marginal Likelihood of Gaussian

Longhai Li

March 2020