Go to TogaWare.com Home Page. Data Science Desktop Survival Guide
by Graham Williams
Duck Duck Go



CLICK HERE TO VISIT THE UPDATED SURVIVAL GUIDE

Analysis of Variance ANOVA

ds %>%
  group_by(x) %>%
  summarise(
    n    = n(),
    sm   = mean(y),
    rssq = sum((y - sm)^2)
  ) %>%
  summarise(
    om   = sum(n * sm) / sum(n),
    ssq  = sum(n * (sm - om)^2),
    rssq = sum(rssq),
    df1  = n_distinct(x) - 1,
    mse  = ssq / df1,
    df2  = sum(n) - n_distinct(x),
    rmse = rssq / df2,
    F    = mse / rmse,
    P    = pf(F, df1, df2, lower.tail=FALSE)
  )


Support further development by purchasing the PDF version of the book.
Other online resources include the GNU/Linux Desktop Survival Guide.
Books available on Amazon include Data Mining with Rattle and Essentials of Data Science.
Popular open source software includes rattle and wajig.
Hosted by Togaware, a pioneer of free and open source software since 1984.
Copyright © 2000-2020 Togaware Pty Ltd. . Creative Commons ShareAlike V4.