Methods Inf Med 2005; 44(03): 423-430
DOI: 10.1055/s-0038-1633988
Original Article
Schattauer GmbH

Design and Analysis of Two-color Microarray Experiments Using Linear Models

F. Bretz
1   B&SR, Novartis Pharma AG, Basel, Switzerland
,
J. Landgrebe
2   Abteilung Biochemie II, Universität Göttingen, Göttingen, Germany
,
E. Brunner
3   Abteilung Medizinische Statistik, Universität Göttingen, Göttingen, Germany
› Author Affiliations
Further Information

Publication History

Publication Date:
06 February 2018 (online)

Summary

Objectives: A variety of linear models have recently been proposed for the design and analysis of micro-array experiments. This article gives an introduction to the most common models and describes their respective characteristics.

Methods: We focus on the application of linear models to logarithmized and normalized microarray data from two-color arrays. Linear models can be applied at different stages of evaluating microarray experiments, such as experimental design, background correction, normalization and hypothesis testing. Both one-stage and two-stage linear models including technical and possibly biological replicates are described. Issues related to selecting robust and efficient microarray designs are also discussed.

Results: Linear models provide flexible and powerful tools, which are easily implemented and interpreted. The methods are illustrated with an experiment performed in our laboratory, which demonstrates the value of using linear models for the evaluation of current microarray experiments.

Conclusions: Linear models provide a flexible approach to properly account for variability, both across and within genes. This allows the experimenter to adequately model the sources of variability, which are assumed to be of major influence on the final measurements. In addition, design considerations essential for any well-planned microarray experiments are best incorporated using linear models. Results from such experimental design investigations show that the widely used common reference design is often substantially less efficient than alternative designs and its use is therefore not recommended.

 
  • References

  • 1 Speed T. Statistical Analysis of Gene Expression Microarray Data. Boca Raton: CRC 2003
  • 2 Parmigiani G, Garett ES, Irizarry RA, Zeger SL. The Analysis of Gene Expression Data. New York: Springer;
  • 3 Simon RM, Korn EL, McShane LM, Radmacher MD, Wright GW, Zhao Y. Design and Analysis of DNA Microarray Investigations. New York: Springer;
  • 4 Rocke DM, Durbin B. A Model for Measurement Error for Gene Expression Arrays. Journal of Computational Biology 2001; 8 (06) 557-69.
  • 5 Kerr M, Martin M, Churchill G. Analysis of variance in microarray data. Journal of Computational Biology 2000; 7: 819-37.
  • 6 Kerr MK. Linear models for microarray data analysis: Hidden similarities and differences. Journal of Computational Biology 2003; 10 (06) 891-901.
  • 7 Lee MLT, Lu W, Whitmore GA, Beier D. Models for microarray gene expression data. Journal of Biopharmaceutical Statistics 2002; 12 (01) 1-19.
  • 8 Ittrich C. Normalization for Two-channel Microarray Data. Methods Inf Med 2005; 44: 418-22. (this issue)
  • 9 Kerr MK, Churchill GA. Experimental design for gene expression microarrays. Biostatistics 2001; 2: 183-201.
  • 10 Wolfinger RD, Gibson G, Wolfinger ED, Bennett L, Hamadeh H, Bushel P, Afshari C, Paules RS. Assessing gene significance from cDNA microarray expression data via mixed models. Journal of Computational Biology 2001; 8 (06) 625-37.
  • 11 Landgrebe J, Bretz F, Brunner E. Efficient design and analysis of factorial two color microarray Experiments. Computational and Statistical Data Analysis. 2004. (in press)
  • 12 Yang YH, Dudoit S, Luu P, Speed TP. Normalization for cDNA microarray data: A robust composite method addressing single and multiple slide systematic variation. Nucleic Acids Research 2002; 30: e15
  • 13 Landgrebe J, Bretz F, Brunner E. Efficient twosample designs for microarray experiments with biological replications. In Silico Biology 2004 4. 38 Available at http://www.bioinfo.de/isb/2004/04/0038/
  • 14 Kerr MK. Design considerations for efficient and effective microarray studies. Biometrics 2003; b 59: 822-8.
  • 15 Glonek G, Solomon P. Factorial and time course designs for cDNA microarray experiments. Biostatistics 2004; 5: 89-112.
  • 16 Vass K, Wit E. Background correction through ANOVA. Website. http://www.stats.gla.ac.uk/~microarray/research.html [last access: August 21, 2004]
  • 17 Jin W, Riley RM, Wolfinger RD, White KP, Passador- Gurgel G, Gibson G. The contributions of sex, genotype and age to transcriptional variance in Drosophila melanogaster. Nature Genetics 2001; 29: 389-95.
  • 18 Dobbin K, Simon R. Comparison of microarray designs for class comparison and class discovery. Bioinformatics 2002; 18: 1438-45.
  • 19 Dobbin K, Shih JH, Simon R. Statistical design of reverse dye microarrays. Bioinformatics 2003; 19: 803-10.
  • 20 Churchill G. Fundamentals of experimental design for cDNA microarrays. Nature Genetics 2002; 32: 490-5.
  • 21 Huber W, von Heydebreck A, Sultmann H, Poustka A, Vingron M. Variance stabilization applied to microarray data calibration and to the quantification of differential expression. Bioinformatics 2002; 18 (Suppl. 01) Suppl S96-S104.
  • 22 Durbin BP, Hardin JS, Hawkins DM, Rocke DM. A variance-stabilizing transformation for geneexpression microarray data. Bioinformatics 2002; 18 (Suppl. 01) Suppl S105-S110.
  • 23 Cui X, Kerr K, Churchill G. Data transformations for cDNA microarray data. Statistical Applications in Genetics and Molecular Biology 2002; 2 (01) Article 4
  • 24 Rocke DM, Durbin B. Approximate variancestabilizing transformations for gene-expression microarray data. Bioinformatics 2003; 19 (08) 966-72.
  • 25 Ravishanker N, Dey DK. A First Course in Linear Model Theory. Chapman & Hall / CRC; 2002
  • 26 Searle S. Linear Models. New York: Wiley; 1971
  • 27 Rao CR, Mitra SK. Generalized Inverse of Matrices and its Applications. New York: Wiley; 1971
  • 28 Pukelsheim F. Optimal Design of Experiments. New York: Wiley; 1993
  • 29 Bretz F, Landgrebe J, Brunner E. Multiplicity issues in microarray experiments. Methods Inf Med 2005; 44: 431-7. (this issue)
  • 30 Hommel G, Kropf S. Tests for Differentiation in Gene Expression Using a Data-Driven Order or Weights for Hypotheses. Biometrical Journal. 2005. (to appear)
  • 31 Benjamini Y, Hochberg Y. Controlling the false discovery rate: A practical and powerful approach to multiple testing. J Roy Stat Soc B 1995; 57: 289-300.