Publication Bias: Analytical Approaches

Analytical Approaches for Assessment of Publication Bias in Meta-analysis

In the last tutorial we talked about one of the most common visual tools for assessing whether publication bias might be present, funnel plots (Jennions et al., 2013; Nakagawa et al., 2022a; Rothstein et al., 2005). However, how do we know for sure if publication bias is actually going to be a problem? Do we have less subjective tools that can both quantitatively test and correct for the possibility of publication bias? The short answer is, we do! The caveat being here that these are still not definitive proof for publication bias and we must keep that in mind. Any results from publication bias tests should be interpreted with caution and viewed as sensitivity analyses (Koricheva et al., 2013; Noble et al., 2017) because we will never truly know how many missing effect sizes actually exist.

In this tutorial, we’ll walk through three complementary analytical tools that together provide a more defensible picture of whether publication bias is shaping our conclusions:

Multilevel Egger-type regression – including the sampling error as a moderator in a multilevel meta-regression to test for small-study effects and obtain an “adjusted” meta-analytic mean (Nakagawa et al., 2022a; Stanley and Doucouliagos, 2014).
Time-lag bias tests – including publication year as a moderator to test whether effect sizes are shrinking over time, one form of the “decline effect” (Nakagawa et al., 2022a).
Bias-robust weighting with cluster-robust variance estimation – a recent two-step framework from Yang et al. (2024) that simultaneously tackles selective reporting and non-independence, which Yang et al. (2024) show can jointly inflate effect sizes by over 100% and deflate standard errors by over 120% when ignored.

No single test is definitive, and each rests on assumptions that can fail in ecological data. The point is to triangulate: if several complementary approaches tell the same story, we can have more confidence in our conclusions.

Statistically Testing and Correcting for Publication Bias

Introduction

We’re now going to expand upon the meta-analysis by Arnold et al. (2021) to explore more rigorously whether publication bias is present, and if so, try and re-interpret effects as if such bias did not actually exist.

Download the Data

Code

# Packages
pacman::p_load(tidyverse, metafor, orchaRd)

# Download the data. Exclude NA in r and sample size columns
arnold_data <- read.csv("https://raw.githubusercontent.com/pieterarnold/fitness-rmr-meta/main/MR_Fitness_Data_revised.csv")

# Exclude some NA's in sample size and r
arnold_data <- arnold_data[complete.cases(arnold_data$n.rep) & complete.cases(arnold_data$r),]

# Calculate the effect size, ZCOR
arnold_data <- metafor::escalc(measure = "ZCOR", ri = r, ni = n.rep, data = arnold_data, var.names = c("Zr", "Zr_v"))

# Lets subset to endotherms for demonstration purposes
arnold_data_endo <- arnold_data %>% 
               mutate(endos = ifelse(Class %in% c("Mammalia", "Aves"), "endo", "ecto")) %>% 
               filter(endos == "endo" & Zr <= 3) # Note that one sample that was an extreme outlier was removed in the paper.

# Add in observation-level (residual) 
arnold_data_endo$obs <- 1:dim(arnold_data_endo)[1]

Meta-regression Approaches to Publication Bias

Nakagawa et al. (2022a; building off of work by Stanley and Doucouliagos, 2012; Stanley and Doucouliagos, 2014) outline a method for testing and accounting for publication bias using multilevel meta-regression approaches. Here, we include sampling standard error (SE) (and variance – V) directly as a moderator in our meta-regression model. We can explicitly test whether sampling SE and V significantly explain variation in our effect size. To be more explicit, we can adjust our multilevel meta-regression model as follows:

\[ \begin{aligned} y_{i} &= \mu + \beta_{se}se + s_{j[i]} + spp_{k[i]} + e_{i} + m_{i} \\ m_{i} &\sim N(0, v_{i}) \\ s_{j} &\sim N(0, \tau^2) \\ spp_{k} &\sim N(0, \sigma_{k}^2) \\ e_{i} &\sim N(0, \sigma_{e}^2) \end{aligned} \]

Of course, we could also include a suite of other moderators in our analysis to also account for more heterogeneity like we have shown in past tutorials:

\[ y_{i} = \mu + \sum_{i = 1}^{N_{m}}\beta_{m}x_{m} + \beta_{se}se + s_{j[i]} + spp_{k[i]} + e_{i} + m_{i} \\ \] Here, \(\beta_{m}\) is the slope (or contrast) for the \(m^{th}\) moderator (where m = 1, …, \(N_{moderator}\) or \(N_{m}\)). Of course, \(\beta_{se}se\) can be included in this term, but for clarity we separate it out to be clear SE or V is included as a moderator in addition to other moderators. Stanley and Doucouliagos (2012) and Stanley and Doucouliagos (2014) have shown that when \(\mu\) in a model that estimates \(\beta_{se}\) is significant, then the intercept (in this case, \(\mu\)) provides the best estimate of an adjusted meta-analytic mean that accounts for the possibility of publication bias (Nakagawa et al., 2022a) (see correction for original paper Nakagawa et al., 2022b). However, Stanley and Doucouliagos (2012) and Stanley and Doucouliagos (2014) have shown that when \(\mu\) is significant, then including SE is best replaced with the sampling variance (i.e., \(SE^2\)) instead because the intercept using SE is downwardly biased.

As such, Nakagawa et al. (2022a) recommend a two step approach. First fitting a model with SE. If the intercept of that model is significant, then re-fitting the model using V (or \(SE^2\)) as this will give us a better estimate of the adjusted mean effect size.

Why does including sampling variance in the model adjust the overall meta-analytic mean?

Task!
Answer!

Think about how the intercept, \(\mu\), changes its meaning when a continuous moderator, such as \(se\), is included in the model. When estimating \(\beta_{se}\) explicitly how do we now interpret the intercept or meta-analytic mean?

The reason why the intercept (\(\mu\)) is an adjusted meta-analytic mean is because when we include a continuous moderator the intercept itself takes on a different meaning. For example, if we fit the multilevel meta-regression model using SE than the intercept estimated in that model is the overall meta-analytic mean when the SE is zero. Or, in other words, when there is no sampling variability or uncertainty around the estimate. We can see that this is true by setting SE = 0 in the equation:

\[ y_{i} = \mu + \beta_{se}*0 = \mu \\ \]

Fitting a Multi-level Meta-Regression model to Test and Correct for Publication bias

Let’s now apply the methods proposed above to Arnold et al. (2021)’s data. To simplify, we are just going to remove other moderators and only fit SE and/or V. This just simplifies the interpretation of the intercept, but we could always include the fitness trait type moderator in these models as well.

Code

# Including sampling standard error as moderator
metareg_se <- rma.mv(yi = Zr, V = Zr_v, mods = ~ sqrt(Zr_v), test = "t", dfs = "contain", data = arnold_data_endo, random = list(~1|Ref, ~1|obs))
summary(metareg_se)


Multivariate Meta-Analysis Model (k = 209; method: REML)

  logLik  Deviance       AIC       BIC      AICc   
-77.9464  155.8928  163.8928  177.2237  164.0908   

Variance Components:

            estim    sqrt  nlvls  fixed  factor 
sigma^2.1  0.0626  0.2503     62     no     Ref 
sigma^2.2  0.0380  0.1951    209     no     obs 

Test for Residual Heterogeneity:
QE(df = 207) = 844.4496, p-val < .0001

Test of Moderators (coefficient 2):
F(df1 = 1, df2 = 207) = 14.3314, p-val = 0.0002

Model Results:

            estimate      se     tval   df    pval    ci.lb   ci.ub      
intrcpt      -0.0668  0.0727  -0.9184   60  0.3621  -0.2122  0.0787      
sqrt(Zr_v)    1.3908  0.3674   3.7857  207  0.0002   0.6665  2.1151  *** 

---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

We’ll also fit the model with sampling variance for good measure, but given that the intrcpt is not significant we can interpret the first model.

Code

# Including sampling variance as moderator
metareg_v <- rma.mv(yi = Zr, V = Zr_v, mods = ~ Zr_v, test = "t", dfs = "contain", data = arnold_data_endo, random = list(~1|Ref, ~1|obs))
summary(metareg_v)


Multivariate Meta-Analysis Model (k = 209; method: REML)

  logLik  Deviance       AIC       BIC      AICc   
-76.1349  152.2698  160.2698  173.6006  160.4678   

Variance Components:

            estim    sqrt  nlvls  fixed  factor 
sigma^2.1  0.0559  0.2364     62     no     Ref 
sigma^2.2  0.0381  0.1953    209     no     obs 

Test for Residual Heterogeneity:
QE(df = 207) = 840.6907, p-val < .0001

Test of Moderators (coefficient 2):
F(df1 = 1, df2 = 207) = 18.7982, p-val < .0001

Model Results:

         estimate      se    tval   df    pval    ci.lb   ci.ub      
intrcpt    0.0469  0.0467  1.0028   60  0.3200  -0.0466  0.1403      
Zr_v       3.0051  0.6931  4.3357  207  <.0001   1.6387  4.3716  *** 

---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

Is there evidence for publication bias? If so, what is the adjusted meta-analytic mean estimate?

Yes, there is evidence for publication bias because the slope estimate for sqrt(Zr) or the sampling standard error is significant. Given that the intrcpt is not significant we can interpret the model using SE instead of V. We can see from this model that the adjusted Zr when there is no uncertainty is -0.067 with a 95% confidence interval that overlaps zero (i.e., 95% CI = -0.212 to 0.079). In other words, if no uncertainty around estimates exists, or we have a very high powered set of studies then we would expect the correlation to be, on average, -0.067.

A note on effect size choice

The above approach works cleanly for Fisher’s \(Z_r\) because the sampling variance of \(Z_r\) depends only on sample size, not on the effect size itself. For standardised mean differences (SMD / Hedges’ \(g\)) and log response ratios (lnRR), the sampling variance is mathematically coupled to the point estimate, so using \(\sqrt{v}\) as a moderator can induce a spurious slope even when there is no publication bias at all (Nakagawa et al., 2022a). Nakagawa et al. (2022a) therefore recommend using a transformation of the effective sample size as the moderator instead of \(\sqrt{v}\) when working with SMD or lnRR. For SMD and lnRR a good default is:

\[ \text{moderator} = \sqrt{\frac{\tilde{n}_{1} + \tilde{n}_{2}}{\tilde{n}_{1}\tilde{n}_{2}}}, \]

where \(\tilde{n}_{1}\) and \(\tilde{n}_{2}\) are the per-group sample sizes. This is proportional to the “design-based” sampling standard error but does not depend on the observed effect size. In metafor this simply means replacing sqrt(Zr_v) in the mods = argument with the precomputed moderator column.

(2) Testing for Time-lag Bias

Another useful diagnostic asks whether effect sizes are shrinking over time – a pattern often called a time-lag bias or decline effect (Jennions et al., 2013; Nakagawa et al., 2022a). The intuition is that for early studies addressing a research question, large and significant results are disproportionately likely to be published first (because they are “exciting”, or simply easier to write up and accept). As the field matures, smaller and null results catch up, and the apparent effect size decays. We can test this directly by including publication year as a moderator in the multilevel model. To make the intercept interpretable as the mean effect at the average publication year, we mean-centre year first. However, publication year could be centred on any year of interest.

Code

# Mean-centre publication year so the intercept is the effect at the mean year
arnold_data_endo$year_c <- arnold_data_endo$Year - mean(arnold_data_endo$Year)

metareg_year <- rma.mv(yi = Zr, V = Zr_v,
                        mods = ~ year_c,
                        test = "t", dfs = "contain",
                        data = arnold_data_endo,
                        random = list(~1|Ref, ~1|obs))
summary(metareg_year)


Multivariate Meta-Analysis Model (k = 209; method: REML)

  logLik  Deviance       AIC       BIC      AICc   
-79.7779  159.5557  167.5557  180.8866  167.7537   

Variance Components:

            estim    sqrt  nlvls  fixed  factor 
sigma^2.1  0.0725  0.2693     62     no     Ref 
sigma^2.2  0.0370  0.1923    209     no     obs 

Test for Residual Heterogeneity:
QE(df = 207) = 935.0576, p-val < .0001

Test of Moderators (coefficient 2):
F(df1 = 1, df2 = 60) = 9.8414, p-val = 0.0026

Model Results:

         estimate      se     tval  df    pval    ci.lb    ci.ub      
intrcpt    0.1731  0.0421   4.1103  60  0.0001   0.0888   0.2573  *** 
year_c    -0.0189  0.0060  -3.1371  60  0.0026  -0.0310  -0.0069   ** 

---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

We can also fit a combined model that tests for small-study effects and time-lag bias simultaneously, which is the model advocated by Nakagawa et al. (2022a):

Code

metareg_full <- rma.mv(yi = Zr, V = Zr_v,
                        mods = ~ sqrt(Zr_v) + year_c,
                        test = "t", dfs = "contain",
                        data = arnold_data_endo,
                        random = list(~1|Ref, ~1|obs))
summary(metareg_full)


Multivariate Meta-Analysis Model (k = 209; method: REML)

  logLik  Deviance       AIC       BIC      AICc   
-74.9905  149.9811  159.9811  176.6205  160.2811   

Variance Components:

            estim    sqrt  nlvls  fixed  factor 
sigma^2.1  0.0569  0.2386     62     no     Ref 
sigma^2.2  0.0376  0.1940    209     no     obs 

Test for Residual Heterogeneity:
QE(df = 206) = 835.6877, p-val < .0001

Test of Moderators (coefficients 2:3):
F(df1 = 2, df2 = 59) = 10.2642, p-val = 0.0001

Model Results:

            estimate      se     tval   df    pval    ci.lb    ci.ub     
intrcpt      -0.0204  0.0733  -0.2779   59  0.7821  -0.1671   0.1263     
sqrt(Zr_v)    1.1453  0.3734   3.0672  206  0.0025   0.4091   1.8815  ** 
year_c       -0.0135  0.0058  -2.3212   59  0.0237  -0.0252  -0.0019   * 

---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

Interpreting the time-lag test

Task!
Answer!

Look at the slope for year_c in the combined model. Is there evidence that effect sizes in the Arnold dataset have changed systematically over time? How would you report this alongside the small-study effect test?

A significant negative slope on year_c would indicate a decline effect: more recent studies report smaller \(Z_r\) values than older ones. A positive slope would suggest the opposite – that earlier studies underestimated the effect. A slope that is indistinguishable from zero means there is no detectable time-lag bias, and the intercept (at the mean year) is a reasonable summary of the long-term average effect. Because both \(\sqrt{v}\) and year_c are included simultaneously, each slope tests for its own bias after adjusting for the other, which is useful because time-lag and small-study effects can be partially confounded (early studies were often smaller).

(3) Bias-robust Estimation with Dependent Effect Sizes

The approaches above all rely on the usual multilevel random-effects (MLMA) weights, which for effect size \(i\) are proportional to \(1/(v_i + \tau^2)\), where \(v_i\) is the sampling variance and \(\tau^2\) is the estimated between-study heterogeneity. Yang et al. (2024) point out two problems with this when selective reporting is present alongside non-independent effect sizes:

Adding \(\tau^2\) to the weights erodes the protection against small-study effects. In the classical fixed-effect model the weights are \(1/v_i\), so imprecise (small-\(n\)) studies are penalised heavily and have little influence on the pooled mean. In the random-effects model, as \(\tau^2\) grows large relative to \(v_i\), the weights become increasingly similar across studies – small studies start to receive almost as much weight as large ones. If the literature is also selectively reporting large effects from small studies (the classic file-drawer pattern), the MLMA model gives those biased effects far more influence than a fixed-effect model would. Yang et al. (2024) show this is why conventional MLMA estimates tend to inherit selective-reporting bias.
Standard parametric SEs assume a correctly specified dependence structure. Clustered effect sizes (multiple effects per study, phylogenetic clustering, shared controls) inflate type-I error rates even when random effects are included, because the working covariance structure is rarely specified perfectly.

Yang et al. (2024) re-analysed 448 ecological and evolutionary meta-analyses and showed that the joint impact of these two problems can be substantial: on average, effect sizes are overestimated and standard errors are underestimated when both are ignored. They propose a simple two-step procedure:

Step 1 – bias-robust point estimation. Fit a fixed-effect multivariate model (no study- or observation-level random effects) using a within-study variance–covariance matrix built with metafor::vcalc(). Because this model uses \(1/v_i\)-type weights rather than \(1/(v_i + \tau^2)\), imprecise studies are penalised as they would be in a classical FE meta-analysis, making the pooled mean far less sensitive to selective reporting. This is sometimes called the FE + VCV or unrestricted weighted least squares (UWLS) estimator.
Step 2 – cluster-robust variance estimation (CRVE). Wrap the Step 1 model in metafor::robust() with clubSandwich = TRUE (or equivalently call clubSandwich::coef_test()) and set cluster = to the study ID. This produces CR2 small-sample–corrected sandwich standard errors and Satterthwaite-style degrees of freedom, which remain valid even though the Step 1 model deliberately ignores between-study heterogeneity. Note that CR2 is only applicable when the underlying model has no random effects, which is exactly why Step 1 is fixed-effect.

Let’s apply this to the Arnold endotherm data. We’ll compare the naive multilevel estimate, the \(\sqrt{v}\)-adjusted Egger estimate, and the bias-robust FE + VCV estimate with CRVE.

Code

# (a) Naive multilevel meta-analytic mean (MLMA; inverse-variance + tau^2 weighted)
mod_naive <- rma.mv(yi = Zr, V = Zr_v,
                     test = "t", dfs = "contain",
                     data = arnold_data_endo,
                     random = list(~1|Ref, ~1|obs))

# (b) Step 1 of Yang et al. 2024: bias-robust FE + VCV model.
#     We build a block-diagonal within-study VCV assuming a constant sampling
#     correlation of rho = 0.5 between effect sizes from the same study, then
#     fit a fixed-effect multivariate model (no random = argument).
VCV <- metafor::vcalc(vi = Zr_v,
                       cluster = Ref,
                       obs = obs,
                       rho = 0.5,
                       data = arnold_data_endo)

mod_MLFE <- rma.mv(yi = Zr, V = VCV,
                    method = "REML",
                    test = "t", dfs = "contain",
                    data = arnold_data_endo)
summary(mod_MLFE)


Multivariate Meta-Analysis Model (k = 209; method: REML)

   logLik   Deviance        AIC        BIC       AICc   
-330.6537   661.3073   663.3073   666.6448   663.3267   

Variance Components: none

Test for Heterogeneity:
Q(df = 208) = 1115.8706, p-val < .0001

Model Results:

estimate      se    tval   df    pval   ci.lb   ci.ub      
  0.0664  0.0099  6.7056  208  <.0001  0.0469  0.0859  *** 

---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

Code

# (c) Step 2: cluster-robust inference (CR2) on the FE + VCV model,
#     clustering on study (Ref). This uses clubSandwich under the hood.
mod_MLFE_RVE <- metafor::robust(mod_MLFE,
                                 cluster = arnold_data_endo$Ref,
                                 adjust = TRUE,
                                 clubSandwich = TRUE)
mod_MLFE_RVE


Multivariate Meta-Analysis Model (k = 209; method: REML)

Variance Components: none

Test for Heterogeneity:
Q(df = 208) = 1115.8706, p-val < .0001

Number of estimates:   209
Number of clusters:    62
Estimates per cluster: 1-27 (mean: 3.37, median: 2)

Model Results:

estimate      se¹    tval¹     df¹    pval¹   ci.lb¹   ci.ub¹    
  0.0664  0.0220   3.0143   12.34   0.0105   0.0185   0.1142   * 

---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

1) results based on cluster-robust inference (var-cov estimator: CR2,
   approx t-test and confidence interval, df: Satterthwaite approx)

We can now assemble a side-by-side comparison of the three approaches:

Code

# Naive MLMA estimate
naive_row <- data.frame(Method = "Naive MLMA (Ref + obs random effects)",
                         Estimate = mod_naive$b[1],
                         CI_lower = mod_naive$ci.lb[1],
                         CI_upper = mod_naive$ci.ub[1])

# sqrt(v)-adjusted Egger-type estimate (from earlier in this tutorial)
egger_row <- data.frame(Method = "sqrt(v)-adjusted (Nakagawa 2022)",
                         Estimate = metareg_se$b[1],
                         CI_lower = metareg_se$ci.lb[1],
                         CI_upper = metareg_se$ci.ub[1])

# Bias-robust FE + VCV with CR2 cluster-robust inference (Yang 2024)
robust_row <- data.frame(Method = "FE + VCV + CR2 (Yang 2024)",
                          Estimate = mod_MLFE_RVE$b[1],
                          CI_lower = mod_MLFE_RVE$ci.lb[1],
                          CI_upper = mod_MLFE_RVE$ci.ub[1])

comparison <- rbind(naive_row, egger_row, robust_row)
flextable::flextable(comparison)

Method	Estimate	CI_lower	CI_upper
Naive MLMA (Ref + obs random effects)	0.1676	0.0774	0.2578
sqrt(v)-adjusted (Nakagawa 2022)	-0.0668	-0.2122	0.0787
FE + VCV + CR2 (Yang 2024)	0.0664	0.0185	0.1142

Which estimate would you report?

Task!
Answer!

Compare the three estimates in the table above. Do they tell the same story, or do they disagree? Which would you report as your headline estimate, and how would you use the others?

There is no single “right” answer here, and that is the point. The naive REML estimate is what most readers will expect to see, but it is the most vulnerable to both selective reporting and under-estimated standard errors. The \(\sqrt{v}\)-adjusted estimate asks “what would the effect be in a hypothetical study with no sampling error?” and is the standard ecological publication-bias sensitivity check. The bias-robust + CRVE estimate asks “what does the pooled mean look like if we do not let the smallest-variance studies dominate, and if we refuse to trust the parametric standard errors?”. A defensible strategy is to report the naive estimate as the primary result and present the other two as sensitivity analyses – if all three agree qualitatively, the conclusion is robust. If they disagree strongly, the disagreement itself is the result, and you should be cautious about any strong claims from the naive model.

Conclusions

Rather than relying on a single publication-bias test, the current best-practice workflow for ecological meta-analyses is to triangulate across several complementary tools:

Start with a contour-enhanced funnel plot of both raw effects and meta-analytic residuals to build intuition about what might be missing and where.
Fit a multilevel Egger-type model including \(\sqrt{v}\) (or the effective-sample-size moderator for SMD/lnRR) to test for small-study effects and obtain an adjusted mean (Nakagawa et al., 2022a).
Add publication year as a moderator to test for time-lag bias and decline effects (Nakagawa et al., 2022a).
Finally, refit the model using bias-robust weights and cluster-robust variance estimation to check whether your pooled mean is being driven by selective reporting or by assumed-away non-independence (Yang et al., 2024).

If these analyses tell a consistent story, you can report your meta-analytic mean with reasonable confidence. If they disagree, the disagreement is itself an important result and should be reported transparently. None of these tools provides definitive proof that publication bias is (or is not) present – they are sensitivity analyses designed to make your conclusions more defensible in the face of an inherently incomplete literature (Koricheva et al., 2013; Noble et al., 2017).

References

Arnold, P. A., Delean, S., Cassey, P. and White, C. R. (2021). Meta‐analysis reveals that resting metabolic rate is not consistently related to fitness and performance in animals. Journal of Comparative Physiology B 191, 1097–1110.

Jennions, M. D., Lortie, C. J., Rosenberg, M. S. and Rothstein, H. R. (2013). Publication and related biases. In: Handbook of Meta-Analysis in Ecology and Evolution (eds J. Koricheva, J. Gurevitch & K. Mengersen). Princeton University Press, Princeton and Oxford. pp 207–236.

Koricheva, J., Gurevitch, J. and Mengersen, K. (2013). Handbook of meta-analysis in ecology and evolution. Princeton University Press, Princeton, New Jersey.

Nakagawa, S., Lagisz, M., Jennions, M. D., Koricheva, J., Daniel W. A. Noble, T. H. P., Sánchez-Tójar, A., Yang, Y. and O’Dea, R. E. (2022b). CORRIGENDUM: Methods for testing publication bias in ecological and evolutionary meta-analyses. Methods in Ecology and Evolution 13, 4–21, DOI: 10.1111/2041–210X.13908.

Nakagawa, S., Lagisz, M., Jennions, M. D., Koricheva, J., Daniel W. A. Noble, T. H. P., Sánchez-Tójar, A., Yang, Y. and O’Dea, R. E. (2022a). Methods for testing publication bias in ecological and evolutionary meta-analyses. Methods in Ecology and Evolution 13, 4–21.

Noble, D. W. A., Lagisz, M., O’Dea, R. E. and Nakagawa, S. (2017). Non‐independence and sensitivity analyses in ecological and evolutionary meta‐analyses. Molecular Ecology 26, 2410–2425.

Rothstein, H. R., Sutton, A. J. and Borenstein, M. (2005). Publication bias in meta-analysis: Prevention, assessment and adjustments. Wiley, Chichester.

Stanley, T. D. and Doucouliagos, H. (2012). Meta-regression analysis in eco- nomics and business. routledge.

Stanley, T. D. and Doucouliagos, H. (2014). Meta-regression approxi- mations to reduce publication selection bias. Research Synthesis Methods 5, 60–78.

Yang, Y., Zwet, E. van, Heck, R., Lagisz, M., Williams, C., Senior, A. M. and Nakagawa, S. (2024). Robust point and variance estimation for meta-analyses with selective reporting and dependent effect sizes. Methods in Ecology and Evolution 15, 1593–1610.

Session Information

R version 4.5.3 (2026-03-11)

Platform: aarch64-apple-darwin20

attached base packages: stats, graphics, grDevices, utils, datasets, methods and base

other attached packages: magick(v.2.9.1), equatags(v.0.2.1), mathjaxr(v.2.0-0), pander(v.0.6.6), orchaRd(v.2.2.0), lubridate(v.1.9.5), forcats(v.1.0.1), stringr(v.1.6.0), dplyr(v.1.2.1), purrr(v.1.2.2), readr(v.2.2.0), tidyr(v.1.3.2), tibble(v.3.3.1), ggplot2(v.4.0.3), tidyverse(v.2.0.0), flextable(v.0.9.9), clubSandwich(v.0.6.2), metafor(v.5.0-1), numDeriv(v.2016.8-1.1), metadat(v.1.4-0) and Matrix(v.1.7-4)

loaded via a namespace (and not attached): xslt(v.1.5.1), gtable(v.0.3.6), xfun(v.0.57), htmlwidgets(v.1.6.4), lattice(v.0.22-9), tzdb(v.0.5.0), vctrs(v.0.7.3), tools(v.4.5.3), generics(v.0.1.4), curl(v.7.1.0), sandwich(v.3.1-1), pacman(v.0.5.1), pkgconfig(v.2.0.3), katex(v.1.5.0), data.table(v.1.18.2.1), RColorBrewer(v.1.1-3), S7(v.0.2.2), uuid(v.1.2-2), lifecycle(v.1.0.5), compiler(v.4.5.3), farver(v.2.1.2), textshaping(v.1.0.5), fontquiver(v.0.2.1), fontLiberation(v.0.1.0), htmltools(v.0.5.9), yaml(v.2.3.12), pillar(v.1.11.1), openssl(v.2.4.0), nlme(v.3.1-168), fontBitstreamVera(v.0.1.1), tidyselect(v.1.2.1), zip(v.2.3.3), digest(v.0.6.39), stringi(v.1.8.7), fastmap(v.1.2.0), grid(v.4.5.3), cli(v.3.6.6), magrittr(v.2.0.5), withr(v.3.0.2), gdtools(v.0.4.2), scales(v.1.4.0), timechange(v.0.4.0), rmarkdown(v.2.31), officer(v.0.6.10), otel(v.0.2.0), askpass(v.1.2.1), ragg(v.1.5.2), zoo(v.1.8-14), hms(v.1.1.4), evaluate(v.1.0.5), knitr(v.1.51), V8(v.6.0.4), rlang(v.1.2.0), Rcpp(v.1.1.1-1.1), glue(v.1.8.1), xml2(v.1.5.2), jsonlite(v.2.0.0), R6(v.2.6.1) and systemfonts(v.1.3.2)