Tracing Geographical Origins of Teas Based on FT-NIR Spectroscopy: Introduction of Model Updating and Imbalanced Data Handling Approaches

Joint Authors

Fu, Xian-Shu
Hong, Xue-Zhen
Wang, Zheng-Liang
Zhang, Li
Yu, Xiao-Ping
Ye, Zi-Hong

Source

Journal of Analytical Methods in Chemistry

Issue

Vol. 2019, Issue 2019 (31 Dec. 2019), pp.1-8, 8 p.

Publisher

Hindawi Publishing Corporation

Publication Date

2019-01-03

Country of Publication

Egypt

No. of Pages

8

Main Subjects

Chemistry

Abstract EN

This work presents a reliable approach to trace teas’ geographical origins despite changes in teas caused by different harvest years.

A total of 1447 tea samples collected from various areas in 2014 (660 samples) and 2015 (787 samples) were detected by FT-NIR.

Seven classifiers trained on the 2014 dataset all succeeded to trace origins of samples collected in 2014; however, they all failed to predict origins for the 2015 samples due to different data distributions and imbalanced dataset.

Three outlier detection based undersampling approaches—one-class SVM (OC-SVM), isolation forest and elliptic envelope—were then proposed; as a result, the highest macro average recall (MAR) for the 2015 dataset was improved from 56.86% to 73.95% (by SVM).

A model updating approach was also applied, and the prediction MAR was significantly improved with increase in the updating rate.

The best MAR (90.31%) was first achieved by the OC-SVM combined SVM classifier at a 50% rate.

American Psychological Association (APA)

Hong, Xue-Zhen& Fu, Xian-Shu& Wang, Zheng-Liang& Zhang, Li& Yu, Xiao-Ping& Ye, Zi-Hong. 2019. Tracing Geographical Origins of Teas Based on FT-NIR Spectroscopy: Introduction of Model Updating and Imbalanced Data Handling Approaches. Journal of Analytical Methods in Chemistry،Vol. 2019, no. 2019, pp.1-8.
https://search.emarefa.net/detail/BIM-1169025

Modern Language Association (MLA)

Hong, Xue-Zhen…[et al.]. Tracing Geographical Origins of Teas Based on FT-NIR Spectroscopy: Introduction of Model Updating and Imbalanced Data Handling Approaches. Journal of Analytical Methods in Chemistry No. 2019 (2019), pp.1-8.
https://search.emarefa.net/detail/BIM-1169025

American Medical Association (AMA)

Hong, Xue-Zhen& Fu, Xian-Shu& Wang, Zheng-Liang& Zhang, Li& Yu, Xiao-Ping& Ye, Zi-Hong. Tracing Geographical Origins of Teas Based on FT-NIR Spectroscopy: Introduction of Model Updating and Imbalanced Data Handling Approaches. Journal of Analytical Methods in Chemistry. 2019. Vol. 2019, no. 2019, pp.1-8.
https://search.emarefa.net/detail/BIM-1169025

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references

Record ID

BIM-1169025