A Comparative Survey: Benchmarking for Pool-based Active Learning

Xueying Zhan; Huan Liu; Qing Li; Antoni B. Chan

A Comparative Survey: Benchmarking for Pool-based Active Learning

Xueying Zhan, Huan Liu, Qing Li, Antoni B. Chan

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Abstract

Active learning (AL) is a subfield of machine learning (ML) in which a learning algorithm aims to achieve good accuracy with fewer training samples by interactively querying the oracles to label new data points. Pool-based AL is well-motivated in many ML tasks, where unlabeled data is abundant, but their labels are hard or costly to obtain. Although many pool-based AL methods have been developed, some important questions remain unanswered such as how to: 1) determine the current state-of-the-art technique; 2) evaluate the relative benefit of new methods for various properties of the dataset; 3) understand what specific problems merit greater attention; and 4) measure the progress of the field over time.In this paper, we survey and compare various AL strategies used in both recently proposed and classic highly-cited methods. We propose to benchmark pool-based AL methods with a variety of datasets and quantitative metric, and draw insights from the comparative empirical results.

Original language	English (US)
Title of host publication	Proceedings of the 30th International Joint Conference on Artificial Intelligence, IJCAI 2021
Editors	Zhi-Hua Zhou
Publisher	International Joint Conferences on Artificial Intelligence
Pages	4679-4686
Number of pages	8
ISBN (Electronic)	9780999241196
State	Published - 2021
Event	30th International Joint Conference on Artificial Intelligence, IJCAI 2021 - Virtual, Online, Canada Duration: Aug 19 2021 → Aug 27 2021

Publication series

Name	IJCAI International Joint Conference on Artificial Intelligence
ISSN (Print)	1045-0823

Conference

Conference	30th International Joint Conference on Artificial Intelligence, IJCAI 2021
Country/Territory	Canada
City	Virtual, Online
Period	8/19/21 → 8/27/21

ASJC Scopus subject areas

Artificial Intelligence

Cite this

Zhan, X., Liu, H., Li, Q., & Chan, A. B. (2021). A Comparative Survey: Benchmarking for Pool-based Active Learning. In Z.-H. Zhou (Ed.), Proceedings of the 30th International Joint Conference on Artificial Intelligence, IJCAI 2021 (pp. 4679-4686). (IJCAI International Joint Conference on Artificial Intelligence). International Joint Conferences on Artificial Intelligence.

A Comparative Survey: Benchmarking for Pool-based Active Learning. / Zhan, Xueying; Liu, Huan; Li, Qing et al.
Proceedings of the 30th International Joint Conference on Artificial Intelligence, IJCAI 2021. ed. / Zhi-Hua Zhou. International Joint Conferences on Artificial Intelligence, 2021. p. 4679-4686 (IJCAI International Joint Conference on Artificial Intelligence).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Zhan, X, Liu, H, Li, Q & Chan, AB 2021, A Comparative Survey: Benchmarking for Pool-based Active Learning. in Z-H Zhou (ed.), Proceedings of the 30th International Joint Conference on Artificial Intelligence, IJCAI 2021. IJCAI International Joint Conference on Artificial Intelligence, International Joint Conferences on Artificial Intelligence, pp. 4679-4686, 30th International Joint Conference on Artificial Intelligence, IJCAI 2021, Virtual, Online, Canada, 8/19/21.

Zhan, Xueying ; Liu, Huan ; Li, Qing et al. / A Comparative Survey : Benchmarking for Pool-based Active Learning. Proceedings of the 30th International Joint Conference on Artificial Intelligence, IJCAI 2021. editor / Zhi-Hua Zhou. International Joint Conferences on Artificial Intelligence, 2021. pp. 4679-4686 (IJCAI International Joint Conference on Artificial Intelligence).

@inproceedings{7bfa7c0280e647f3b1477022a170edcf,

title = "A Comparative Survey: Benchmarking for Pool-based Active Learning",

abstract = "Active learning (AL) is a subfield of machine learning (ML) in which a learning algorithm aims to achieve good accuracy with fewer training samples by interactively querying the oracles to label new data points. Pool-based AL is well-motivated in many ML tasks, where unlabeled data is abundant, but their labels are hard or costly to obtain. Although many pool-based AL methods have been developed, some important questions remain unanswered such as how to: 1) determine the current state-of-the-art technique; 2) evaluate the relative benefit of new methods for various properties of the dataset; 3) understand what specific problems merit greater attention; and 4) measure the progress of the field over time.In this paper, we survey and compare various AL strategies used in both recently proposed and classic highly-cited methods. We propose to benchmark pool-based AL methods with a variety of datasets and quantitative metric, and draw insights from the comparative empirical results.",

author = "Xueying Zhan and Huan Liu and Qing Li and Chan, {Antoni B.}",

note = "Funding Information: This work was supported by a grant from the Research Grants Council of the Hong Kong Special Administrative Region, China (Project No. CityU 11215820). Publisher Copyright: {\textcopyright} 2021 International Joint Conferences on Artificial Intelligence. All rights reserved.; 30th International Joint Conference on Artificial Intelligence, IJCAI 2021 ; Conference date: 19-08-2021 Through 27-08-2021",

year = "2021",

language = "English (US)",

series = "IJCAI International Joint Conference on Artificial Intelligence",

publisher = "International Joint Conferences on Artificial Intelligence",

pages = "4679--4686",

editor = "Zhi-Hua Zhou",

booktitle = "Proceedings of the 30th International Joint Conference on Artificial Intelligence, IJCAI 2021",

}

TY - GEN

T1 - A Comparative Survey

T2 - 30th International Joint Conference on Artificial Intelligence, IJCAI 2021

AU - Zhan, Xueying

AU - Liu, Huan

AU - Li, Qing

AU - Chan, Antoni B.

N1 - Funding Information: This work was supported by a grant from the Research Grants Council of the Hong Kong Special Administrative Region, China (Project No. CityU 11215820). Publisher Copyright: © 2021 International Joint Conferences on Artificial Intelligence. All rights reserved.

PY - 2021

Y1 - 2021

N2 - Active learning (AL) is a subfield of machine learning (ML) in which a learning algorithm aims to achieve good accuracy with fewer training samples by interactively querying the oracles to label new data points. Pool-based AL is well-motivated in many ML tasks, where unlabeled data is abundant, but their labels are hard or costly to obtain. Although many pool-based AL methods have been developed, some important questions remain unanswered such as how to: 1) determine the current state-of-the-art technique; 2) evaluate the relative benefit of new methods for various properties of the dataset; 3) understand what specific problems merit greater attention; and 4) measure the progress of the field over time.In this paper, we survey and compare various AL strategies used in both recently proposed and classic highly-cited methods. We propose to benchmark pool-based AL methods with a variety of datasets and quantitative metric, and draw insights from the comparative empirical results.

AB - Active learning (AL) is a subfield of machine learning (ML) in which a learning algorithm aims to achieve good accuracy with fewer training samples by interactively querying the oracles to label new data points. Pool-based AL is well-motivated in many ML tasks, where unlabeled data is abundant, but their labels are hard or costly to obtain. Although many pool-based AL methods have been developed, some important questions remain unanswered such as how to: 1) determine the current state-of-the-art technique; 2) evaluate the relative benefit of new methods for various properties of the dataset; 3) understand what specific problems merit greater attention; and 4) measure the progress of the field over time.In this paper, we survey and compare various AL strategies used in both recently proposed and classic highly-cited methods. We propose to benchmark pool-based AL methods with a variety of datasets and quantitative metric, and draw insights from the comparative empirical results.

UR - http://www.scopus.com/inward/record.url?scp=85125470486&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85125470486&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:85125470486

T3 - IJCAI International Joint Conference on Artificial Intelligence

SP - 4679

EP - 4686

BT - Proceedings of the 30th International Joint Conference on Artificial Intelligence, IJCAI 2021

A2 - Zhou, Zhi-Hua

PB - International Joint Conferences on Artificial Intelligence

Y2 - 19 August 2021 through 27 August 2021

ER -

A Comparative Survey: Benchmarking for Pool-based Active Learning

Abstract

Publication series

Conference

ASJC Scopus subject areas

Other files and links

Fingerprint

Cite this