These data are the results of a greenshouse factorial experiment performed to evaluate the effect of defoliation on the production of cotton at different growth stages. The experiment is a \(5\times 5\) factorial with 5 replications in a complete randomized design. The experimental unit was a pot with 2 cotton plants. The response variable is the number of bolls produced at the end of the crop cycle. The observed number of cotton bolls is a count variable with underdispersion (sample variance less than the sample mean).
A data.frame
with 125 records and 4 variables,
described below.
defol
A numeric factor with 5 levels that represents the
(artifitial) levels of defoliation (percent in leaf area removed
with scissors) applied for all leaves of the plants.
phenol
A categorical ordered factor with 5 levels that
represents the (phenological) growth stages of the cotton plants
in which the levels of defoliation was applied.
rept
Integer variable that indexes each experimenal unit
in each treatment cell.
bolls
The number of bolls produced (count variable)
evaluated at harvest of cotton.
Silva, A. M., Degrande, P. E., Suekane, R., Fernandes, M. G., Zeviani, W. M. (2012). Impacto de diferentes níveis de desfolha artificial nos estádios fenológicos do algodoeiro. Revista de Ciências Agrárias, 35(1), 163–172. http://www.scielo.mec.pt/pdf/rca/v35n1/v35n1a16.pdf.
Zeviani, W. M., Ribeiro, P. J., Bonat, W. H., Shimakura, S. E., Muniz, J. A. (2014). The Gamma-count distribution in the analysis of experimental underdispersed data. Journal of Applied Statistics, 41(12), 1–11. http://doi.org/10.1080/02664763.2014.922168, http://leg.ufpr.br/doku.php/publications:papercompanions:zeviani-jas2014.
library(lattice) library(latticeExtra)#>data(cotton_defol) str(cotton_defol)#> 'data.frame': 125 obs. of 4 variables: #> $ phenol: Factor w/ 5 levels "vegetative","flower bud",..: 1 1 1 1 1 1 1 1 1 1 ... #> $ defol : num 0 0 0 0 0 0.25 0.25 0.25 0.25 0.25 ... #> $ rept : int 1 2 3 4 5 1 2 3 4 5 ... #> $ bolls : num 10 9 8 8 10 11 9 10 10 10 ...# x11(width = 7, height = 2.8) xyplot(bolls ~ defol | phenol, data = cotton_defol, layout = c(NA, 1), type = c("p", "smooth"), xlab = "Artificial defoliation level", ylab = "Number of bolls produced", xlim = extendrange(c(0:1), f = 0.15), jitter.x = TRUE)# Sample mean and variance in each treatment cell. mv <- aggregate(bolls ~ phenol + defol, data = cotton_defol, FUN = function(x) { c(mean = mean(x), var = var(x)) }) str(mv)#> 'data.frame': 25 obs. of 3 variables: #> $ phenol: Factor w/ 5 levels "vegetative","flower bud",..: 1 2 3 4 5 1 2 3 4 5 ... #> $ defol : num 0 0 0 0 0 0.25 0.25 0.25 0.25 0.25 ... #> $ bolls : num [1:25, 1:2] 9 8.4 9.4 8.6 10 10 9.4 5.8 7 10 ... #> ..- attr(*, "dimnames")=List of 2 #> .. ..$ : NULL #> .. ..$ : chr "mean" "var"xlim <- ylim <- extendrange(c(mv$bolls), f = 0.05) # Evidence of underdispersion. xyplot(bolls[, "var"] ~ bolls[, "mean"], data = mv, grid = TRUE, aspect = "iso", type = c("p", "r"), xlim = xlim, ylim = ylim, ylab = "Sample variance", xlab = "Sample mean") + layer(panel.abline(a = 0, b = 1, lty = 2))