Small Sample Issues for Microarray-Based Classification

Dougherty, Edward R.

doi:https://doi.org/10.1002/cfg.62

International Journal of Genomics

On this page

Abstract Copyright Related Articles

Review Article | Open Access

Volume 2 | Article ID 984620 | https://doi.org/10.1002/cfg.62

Small Sample Issues for Microarray-Based Classification

Edward R. Dougherty¹

Abstract

In order to study the molecular biological differences between normal and diseased tissues, it is desirable to perform classification among diseases and stages of disease using microarray-based gene-expression values. Owing to the limited number of microarrays typically used in these studies, serious issues arise with respect to the design, performance and analysis of classifiers based on microarray data. This paper reviews some fundamental issues facing small-sample classification: classification rules, constrained classifiers, error estimation and feature selection. It discusses both unconstrained and constrained classifier design from sample data, and the contributions to classifier error from constrained optimization and lack of optimality owing to design from sample data. The difficulty with estimating classifier error when confined to small samples is addressed, particularly estimating the error from training data. The impact of small samples on the ability to include more than a few variables as classifier features is explained.

Copyright

Copyright © 2001 Hindawi Publishing Corporation. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation Order printed copies

Views

330

Downloads

1446

Citations