Latent Variables in High Dimensional Data Workshop
for the
11th International Conference on Molecular Systems Biology 2009
 
Lisa McFerrin and William Atchley
 
    This workshop introduces the concept and relevant statistical methodology for “Latent Variables and Latent Structure” as they relate to analyses of high dimensional molecular data (HDMD).  A latent variable model relates a set of observed or manifest variables to a set of underlying latent variables. Latent variables are not directly observed but are rather inferred through a mathematical model constructed from variables that are observed and directly measured.  
 
    HDMD typically contain thousands of data points arising from a substantially smaller number of sampling units.  Such data present many complexities.  They are typically highly interdependent, exhibit complex underlying components of variability, and meaningful replication is rare.  The workshop will focus on three multivariate statistical methods that can facilitate description and analyses of latent variables and latent structure inherent to HDMD.  These statistical methods are intended to reduce the dimensionality of HDMD such that biologically meaningful patterns of multidimensional covariability are exposed and relevant biological questions can be explored.  
 
    The workshop and tutorials will introduce principal components analysis, common factor analysis and discriminant analysis.  These three methods will be demonstrated using a set of R programs together with annotated output and documentation.  Additional information can be found on the Resources page.
North Carolina State Universityhttp://www.ncsu.edushapeimage_2_link_0
Atchley Labhttp://www.atchleylab.orgshapeimage_3_link_0
Principal Component
Analysis
Factor Analysis
Discriminant Function Analysis
ICMSBhttp://www.picb.ac.cn/icmsb2009/index.htmlshapeimage_4_link_0
Workshop Materials
             Slides (.ppt)                                       Tutorial (.docx)(.pdf)
NCSU geneticshttp://www.genomics.ncsu.edushapeimage_5_link_0
NCSU Bioinformaticshttp://www.bioinformatics.ncsu.edushapeimage_6_link_0
ResourcesResources.htmlshapeimage_9_link_0
HDMD R Package
 
                    Code:        package (.R)             examples (.R)
                    Data:         AA54(.csv)                bHLH(.tab)