Ongoing Study
Variable Selection for Nonparametric Quantile Regression
With Dr. Hao Helen Zhang, Dr. Howard D. Bondell and Dr. Hui Zou
cosso package now available at CRAN.
Key Reference:
Li,
Y., Liu, Y. and Zhu, J. (2007) Quantile Regression in
Reproducing Kernel Hilbert Spaces. J.
Am. Stat. Assoc.,
102,
255-268
Lin, Y. and Zhang, H. H. (2006) Component Selection and Smoothing in Smoothing Spline Analysis of Variance Models. Ann. Stat. 34, 2272-2297
Storlie, C., Bondell, H., Reich, B., and Zhang, H.H. (2010) Surface Estimation, Variable Selection, and the Nonparametric Oracle Property. Stat. Sinica, 21, 679-705
Forward Selection in High-Dimensional Feature Space
With Dr. Howard Bondell, Dr. Hao Helen Zhang and Dr. Leonard Stefanski
Key Reference:
Fan, J. and Lv, J. (2008) Sure Independence Screeing for Ultrahigh Dimensional Feature Space with Discussion. J. Roy. Stat. Soc. B 70, 849-911
Wang, H. (2009) Forward Regression for Ultra-High Dimensional Variable Screening. J. Am. Stat. Assoc. 104, 1512-1524
Least Squares Approximation
Key Reference:
Wang, H. and Leng C. (2007) Unified LASSO estimation by Least Squares Approximation . J. Am. Stat. Assoc. 102, 1039-1048
Previous Studies
Diagnosis of Multivariate Normal Mixture Model
Picture taken from LosAlamos National Laboratory. http://www.lanl.gov
Motivated by analyzing flow cytometry data, Dr. Lung-An Li, ISS, AS, adopted the multivariate normal mixture model to study this kind of data.
In this study, we are orginally dealt with a 300000 observations by 9 biomarkers dataset per mice. Due to the difficulties arisen from dimensionality and of validity normal assumption, we used only 4 of these 9 biomarkers and further partition the data into proper subsets.
The purpose of this study aims to provide rapid screening procedures of disease phenotypes via high throughput methods.Furthermore, biologists can identify mutant genes responsible for pheno-deviant mouse model.
As a well-established modeling technique, mixture model has been widely applied in various fields, the statistical chellenges in this study is to provide a visual inspection procedure to examines the model fitting. Unlike univariate or multivariate cases, various statistical procedures are reaily available to perform normality test. However, a tool for multivariate nomal mixture data is still unavailable.
Willingness-to-Pay
Self-Organized Map (SOM)
Chromatin immunoprecipitation (ChIP) Sequencing
In this study, under the guidence of Dr. Chen-Hsin Chen and Dr. Ueng-Cheng Yang, I was assigned to study those peak finding algorithms implemented in currently-available ChIP-Seq analysis programss, such as CisGenome(Ji and Wang), Peak Finder (The Wold Lab).

Picture taken from Illumina company. http://www.illumina.com

