TY - JOUR
T1 - An ensemble-based incremental learning approach to data fusion
AU - Parikh, Devi
AU - Polikar, Robi
N1 - Funding Information:
Manuscript received November 29, 2005; revised May 9, 2006 and July 24, 2006. This material is based upon work supported by the National Science Foundation under Grant ECS-0239090. This paper was recommended by Associate Editor N. Chawla. D. Parikh was with the Electrical and Computer Engineering, Rowan University, Glassboro, NJ 08028 USA. She is now with the Electrical and Computer Engineering, Carnegie Mellon University, Pittsburgh, PA 15213 USA. R. Polikar is with the Electrical and Computer Engineering, Rowan University, Glassboro, NJ 08028 USA (e-mail: [email protected]). Color versions of one or more of the figures in this paper are available online at http://ieeexplore.ieee.org. Digital Object Identifier 10.1109/TSMCB.2006.883873
PY - 2007/4
Y1 - 2007/4
N2 - This paper introduces Learn++, an ensemble of classifiers based algorithm originally developed for incremental learning, and now adapted for information/data fusion applications. Recognizing the conceptual similarity between incremental learning and data fusion, Learn++ follows an alternative approach to data fusion, i.e., sequentially generating an ensemble of classifiers that specifically seek the most discriminating information from each data set. It was observed that Learn++ based data fusion consistently outperforms a similarly configured ensemble classifier trained on any of the individual data sources across several applications. Furthermore, even if the classifiers trained on individual data sources are fine tuned for the given problem, Learn++ can still achieve a statistically significant improvement by combining them, if the additional data sets carry complementary information. The algorithm can also identify - albeit indirectly - those data sets that do not carry such additional information. Finally, it was shown that the algorithm can consecutively learn both the supplementary novel information coming from additional data of the same source, and the complementary information coming from new data sources without requiring access to any of the previously seen data.
AB - This paper introduces Learn++, an ensemble of classifiers based algorithm originally developed for incremental learning, and now adapted for information/data fusion applications. Recognizing the conceptual similarity between incremental learning and data fusion, Learn++ follows an alternative approach to data fusion, i.e., sequentially generating an ensemble of classifiers that specifically seek the most discriminating information from each data set. It was observed that Learn++ based data fusion consistently outperforms a similarly configured ensemble classifier trained on any of the individual data sources across several applications. Furthermore, even if the classifiers trained on individual data sources are fine tuned for the given problem, Learn++ can still achieve a statistically significant improvement by combining them, if the additional data sets carry complementary information. The algorithm can also identify - albeit indirectly - those data sets that do not carry such additional information. Finally, it was shown that the algorithm can consecutively learn both the supplementary novel information coming from additional data of the same source, and the complementary information coming from new data sources without requiring access to any of the previously seen data.
UR - http://www.scopus.com/inward/record.url?scp=34047104426&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=34047104426&partnerID=8YFLogxK
U2 - 10.1109/TSMCB.2006.883873
DO - 10.1109/TSMCB.2006.883873
M3 - Article
C2 - 17416170
AN - SCOPUS:34047104426
SN - 1083-4419
VL - 37
SP - 437
EP - 450
JO - IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
JF - IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
IS - 2
ER -