This is a follow up to the Graphing - Adult Size vs. Newborn Size problem.
We’ve graphed the relationship between adult size and new born size in mammals and now it’s time to analyze the relationship statistically.
Optional: If you want, plot a histogram of the residuals of the regression to make sure that they are roughly normally distributed (you can do this with just a single line of code)
This is a follow up to the Scientific Python 7 exercise.
Dr. Granger is interested in studying the factors controlling the size and carbon storage of shrubs. This research is part of a larger area of research trying to understand carbon storage by plants. She has conducted a small preliminary experiment looking at the effect of three different treatments on shrub volume at four different locations. She wants to conduct a preliminary analysis of these data to include in a grant proposal and she would like you to conduct the analysis for her (she might be a world renowned expert in carbon storage in plants, but she sure doesn’t know much about computers). She has placed a data file on the web for you to download. She wants you to run an ANOVA to determine if the different experimental treatments lead to differences in shrub carbon.
.head()
method.1.8 + 2 *
log(volume)
where volume
is the volume of the shrub. You’ll need to use
the numpy
version of the log()
function. Call the function to get a
column of shrub carbons and then print out that column.data['carbon'] = get_shrub_carbons(lengths, widths, heights)
. Print out the
entire dataframe.anova_lm()
. You can import anova_lm()
using from statsmodels.stats.anova
import anova_lm
.This is a follow up to the Scientific Python 1 and Graphing 1 problems.
We have previously compared the average masses of extant and extinct species on different continents to try to understand whether size has an influence on extinction in mammals. We’ve done this by looking at the means and by comparing the histograms for extinct and extant species, but we haven’t done any statistics yet to actually test if the average sizes are different.
Perform either a two-sample t-test or an ANOVA (with only two categories they’ll accomplish the same thing; if you’re not very comfortable with statistics in general I’d recommend using the two-sample t-test) to compare the average sizes of the extinct and extant species in each continent.
Save the results as a csv file named size_comparison_results.csv
, where the
first column is the name of the continent, the second column is the average mass
for the extant species, the third column is the average mass for the extinct
species, and the fourth column is the p-value for whether or not they are
different.