A Robust Near Infrared Modeling by Least Angel Regression and Sampling Error Profile Analysis
  
View Full Text    Download reader
DOI:
KeyWord:least angle regression  regression coefficient  Monte Carlo sampling  sampling error profile analysis  variable selection  near infrared spectroscopy
  
AuthorInstitution
XIONG Qin,ZHANG Ruo qiu,LI Hui,CHEN Wan chao,DU Yi ping 1.华东理工大学化学与分子工程学院上海市功能性材料化学重点实验室;2.德宏师范高等专科学校理工系
Hits: 2328
Download times: 1158
Abstract:
      A novel variable selection method based on sampling error profile analysis frame and least angel regression(SEPA-LAR) was proposed in order to build a robust NIR model.Based on SEPA-LAR,more models were obtained by Monte Carlo sampling(MCS),and the LAR regression coefficients at each wavelength were statistically analyzed,which were sorted by the sum sequence of their absolute values.Wavelengths containing larger sums of the absolute values of regression coefficients were selected,and a model with the wavelengths was built.Samples in the independent validation dataset were applied in the evaluation of the model.NIR datasets of corn moisture,diesel density and cheese fat were used to evaluate the performance of SEPA-LAR.Errors of root mean squared error of prediction(RMSEP) estimated with the validation dataset are 0001 44%(moisture),0001 58 g/mL(density) and 113 g/100 g(fat content),respectively.The results showed that,compared with Monte Carlo uninformative variable elimination(MCUVE),moving window partial least squares regression(MWPLS) and competitive adaptive reweighted sampling(CARS),SEPA-LAR could select less wavelengths and has smaller prediction error.The calibration model built by SEPA-LAR has good predictive ability,stability and interpretability.
Close