|
Research on Application of Feature Selection Algorithm Based on Combination of Random Forest and Game Theory in Near Infrared Spectroscopy |
|
View Full Text Download reader |
DOI: |
KeyWord:NIR spectroscopy random forest feature selection shapley value production area identification |
Author | Institution |
KONG Qing-qing,DING Xiang-qian,GONG Hui-li,LI Zhong-ren,TANG Xing-hong,YU Chun-xia |
1.中国海洋大学信息科学与工程学院;2.云南中烟工业有限责任公司技术中心 |
|
Hits: 1962 |
Download times: 739 |
Abstract: |
The feature selection algorithm based on the combination of random forest and game theory was proposeed in this paper as noise and redundant information in the near infrared spectroscopy would lead to the low recognition rate of a model.This algorithm was first used to measure the feature significance according to the random forest and select some features related to classification,then compute the weights of selected characters by using the improved Shapley values and mutual information computed to remove redundant information from the weighted feature set and get the optimal feature subset.To validate effectiveness of this algorithm,the tobacco leaf production area identification model was established.The experimental results indicated that the algorithm proposed in this paper had a good recognition on the area of tobacco leaf production with a recognition rate of 95.88%. |
Close |