Part 5: Meta-analysis Following on from the meta-analysis lab… Your trusty assistant calls you in a panic to tell you that he accidently messed up when he sent you the data that we looked at in the...

1 answer below »
Please see attached files.


Part 5: Meta-analysis Following on from the meta-analysis lab… Your trusty assistant calls you in a panic to tell you that he accidently messed up when he sent you the data that we looked at in the meta-analysis labs! He confesses that he accidently sent you his office mate’s data on meta-analysis of European drinking habits instead! The real data is in a file called “systematic review data for assignment.csv" that he has now sent to you. Your assistant is worried because he knows you’ve done lots of work on this analysis already, but you reassure him and say that it shouldn’t be too much extra work to adapt the code you already have to look at the proper data set you now have. Read in the new proper data and have a good look it. How many studies/papers did your assistant find? What is the maximum number of species considered in any one study? How many studies included that maximum number of species? How many total results/records/species has your assistant recorded? How many different authors? Which author has the most studies included in the dataset? Which environment had the most studies? Which method was used in most studies? Within this dataset, which author had studied the fewest species? Were the same number of reps used in each study? Were the same number of reps used for each species within each study? You’ve recently read a paper on standardising data for metanalysis. You didn’t do this when you looked at the old dataset because you were thinking that the data was already quite standardised, since it is all relative growth rates, rather than absolute growth amounts. But as you’ve thought about it more, you think that perhaps it makes sense to standardise all the relative growth rates by the mean base relative growth rate for each species. This should avoid the possible issue that species will exhibit larger effects due to conspecific soil, just because they grow more generally. You know this shouldn’t change the results much anyway, but decide to do it for the sake of avoiding possible reviewer criticism. First make a forest plot of your data as it comes in the data sheet. Next standardise the data by dividing all the columns from ‘meanbase’ to ‘ub’ (ie six columns) by the meanbase growth rate (ie make everything relative to the mean base relative growth rate for that species). Then make the forest plot again on the standardised data. What is the effect of the standardisation? From now on, use the standardised effect size for everything! For how many records/species was a significant difference found? For what proportion of records/species was a significant difference found? What is the mean unweighted (but standardised) effect size across the studies? What is the mean standardised effect size across the studies, when we weight by the number of reps used for that species in the study? Make some forest plots using colouring and ordering to get a visual impression of whether effect size seems to vary with environment, plant types, method, author and/or study. Just from the forest plot, which environment looks like it has the smallest effect size overall (ie closest to zero)? Estimate the effect sizes for each environment by calculating the mean standardised effect for each environment, weighted by the number of replicates. What is the estimated effect size for species in arid environments? Now estimate the effect sizes for each environment by calculating the mean standardised effect for each environment, weighted by the inverse standard error. Does the relative ranking of the effect sizes for the four environments change? Do the results of these estimates match what you saw in the forest plot about which environment looks like it has the smallest effect size overall? Now imagine the very likely situation where none of the non-significant papers had actually been published ie create a new data set that only includes the significant studies. Make a forest plot of this reduced data set and check that it makes sense. Now what is the mean unweighted (but standardised) effect size across the studies? Why is this estimate bigger than the previous estimate based on the non-significant studies as well? Estimate the effect sizes for each environment by calculating the mean standardised effect for each environment, weighted by the number of replicates. What is the estimated effect size for species in arid environments? For which environment has estimated effect size changed the most by excluding the non-significant results? Now try looking at the effects of environment, plant type, and method using the metareg function, using the data set with the non-significant results excluded. Consider only additive effects. You might want to use Forest plots too. Write down your conclusions from this analysis. (There is a list of possible conclusions to select from in the online quiz, so probably best to look at the options as soon as you can.) Now try looking at the effects of environment, plant type, method and author using the metareg function, using the data set with the non-significant results excluded. Consider only additive effects, except also include an interaction between environment and plant type. You might want to use Forest plots too. Write down your conclusions from this analysis. (There is a list of possible conclusions to select from in the online quiz, so probably best to look at the options as soon as you can.)
Answered Same DayAug 28, 2022

Answer To: Part 5: Meta-analysis Following on from the meta-analysis lab… Your trusty assistant calls you in a...

Mohd answered on Aug 28 2022
60 Votes
-
-
-
2022-08-28
Importing required packages
library(dplyr)
library(markdown)
library(knitr)
library(magrittr)
library(ggplot2)
library(skimr)
library(readr)
systematic <- read_csv("systematic.csv")
View(systematic)
How many studies/papers did your assistant find?
number_studies<-unique(systematic$study)
length(number_studies)
## [1] 60
What is the maximum number of species considered in any one study?
systematic%>%
group_by(study)%>%
summarise(number_species=length(unique(record)))%>%
arrange(desc(number_species))
## # A tibble: 60 x 2
## study number_species
##
## 1 1 5
## 2 3
5
## 3 7 5
## 4 10 5
## 5 16 5
## 6 30 5
## 7 36 5
## 8 40 5
## 9 44 5
## 10 48 5
## # ... with 50 more rows
How many studies included that maximum number of species?
species_by_study<-systematic%>%
group_by(study)%>%
summarise(number_species=length(unique(record)))%>%
arrange(desc(number_species))
species_by_study%>%
filter(number_species==5)
## # A tibble: 11 x 2
## study number_species
##
## 1 1 5
## 2 3 5
## 3 7 5
## 4 10 5
## 5 16 5
## 6 30 5
## 7 36 5
## 8 40 5
## 9 44 5
## 10 48 5
## 11 60 5
How many total results/records/species has your assistant recorded?
length(unique(systematic$record))
## [1] 181
How many different authors?
number_authors<-unique(systematic$author)
length(number_authors)
## [1] 23
Which author has the most studies included in the dataset?
systematic%>%
group_by(author)%>%
summarise(total_study=length(unique(study)))%>%
arrange(desc(total_study))
## # A tibble: 23 x 2
## author total_study
##
## 1 Y 6
## 2 P 5
## 3 D 4
## 4 J 4
## 5 L 4
## 6 F 3
## 7 H 3
## 8 I 3
## 9 T 3
## 10 W 3
## # ... with 13 more rows
Which environment had the most studies?
systematic%>%
group_by(environment)%>%
summarise(total_study=length(unique(study)))%>%
arrange(desc(total_study))
## # A tibble: 4 x 2
## environment total_study
##
## 1 grassland 19
## 2 heath 15
## 3 arid 14
## 4 forest 12
Which method was used in most studies?
systematic%>%
count(method,study,sort = TRUE)
## # A tibble: 60 x 3
## method study n
##
## 1 a 1 5
## 2 a 3 5
## 3 a 7 5
## 4 a 10 5
## 5 a 40 5
## 6 a 44 5
## 7 b 16 5
## 8 b 30 5
## 9 b 60 5
## 10 d 36 5
## # ... with 50 more rows
Within this dataset, which author had studied the fewest species?
systematic%>%
group_by(author)%>%
summarise(number_species=length(unique(record)))%>%
arrange(number_species)
## # A tibble: 23 x 2
## author number_species
##
## 1 G 2
## 2 B 3
## 3 X 3
## 4 C 4
## 5 K 4
## 6 S 4
## 7 I 5
## 8 Q 5
## 9 U 5
## 10 F 6
## # ... with 13 more rows
Were the same number of reps used in each study?
systematic%>%
count(study,nreps)
## # A tibble: 60 x 3
## study nreps n
##
## 1 1 9 5
## 2 2 8 2
## 3 3 19 5
## 4 4 12 2
## 5 5 17 1
## 6 6 16 4
## 7 7 10 5
## 8 8 21 1
## 9 9 22 3
## 10 10 5 5
## # ... with 50 more rows
Were the same number of reps used for each species within each study?
Forest plot before standardization
ggplot(data=systematic, aes(y=record, x=effsz,
xmin=lb,
xmax=ub)) +
geom_point() +
geom_errorbarh(height=0.5)
Standardization
systematic_df<-systematic
systematic%>%
filter(sig=="TRUE")%>%
arrange(desc(effsz))
## # A tibble: 106 x 16
## record study author environment method spn planttype nreps p meanbase
##
## 1 15 5 B heath d 1 monocot 17 0 1.27
## 2 55 17 I heath d 1 monocot 15 0 1.61
## 3 79 26 S grassland d 3 dicot 14 0.00001 1.59
## 4 64 20 U heath d 2 monocot 11 0 1.67
## 5 163 54 P heath a 2 monocot 8 0.00149 1.49
## 6 77 26 S grassland d 1 monocot 14 0 1.44
## 7 65 20 U heath d 3 dicot 11 0.00007 1.47
## 8 121 40 H grassland a 5 monocot 11 0.00028 1.62
## 9 67 22 K heath b 1 monocot 11 0.00097 1.53
## 10 120 40 H grassland a 4 monocot 11 0.00001 1.63
## # ... with 96 more rows, and 6 more variables: meancon , effsz ,
## # stderr , lb , ub , sig
growth<-systematic_df$meancon-systematic_df$meanbase
systematic_df$meancon<-systematic_df$meancon/growth
systematic_df$effsz<-systematic_df$effsz/growth
systematic_df$stderr<-systematic_df$stderr/growth
systematic_df$lb<-systematic_df$lb/growth
systematic_df$ub<-systematic_df$ub/growth
systematic_df$meanbase<-systematic_df$meanbase/growth
ggplot(data=systematic_df, aes(y=record, x=effsz,
xmin=lb,
xmax=ub)) +
geom_point() +
geom_errorbarh(height=0.1)
systematic_df%>%
filter(sig=="FALSE")%>%
arrange(desc(effsz))
## # A tibble: 75 x 16
## record study author environment method spn planttype nreps p meanbase
##
## 1 3 1 O grassland a 3 dicot 9 0.781 42.8
## 2 169 56 R forest c 3 monocot 18 0.461 -24.4
## 3 122 41 G grassland c 1 monocot 11 0.348 11.0...
SOLUTION.PDF

Answer To This Question Is Available To Download

Related Questions & Answers

More Questions »

Submit New Assignment

Copy and Paste Your Assignment Here