TY - GEN
T1 - BANNER
T2 - 13th Pacific Symposium on Biocomputing, PSB 2008
AU - Leaman, Robert
AU - Gonzalez, Graciela
PY - 2008/12/1
Y1 - 2008/12/1
N2 - There has been an increasing amount of research on biomedical named entity recognition, the most basic text extraction problem, resulting in significant progress by different research teams around the world. This has created a need for a freely-available, open source system implementing the advances described in the literature. In this paper we present BANNER, an open-source, executable survey of advances in biomedical named entity recognition, intended to serve as a benchmark for the field. BANNER is implemented in Java as a machine-learning system based on conditional random fields and includes a wide survey of the best techniques recently described in the literature. It is designed to maximize domain independence by not employing brittle semantic features or rule-based processing steps, and achieves significantly better performance than existing baseline systems. It is therefore useful to developers as an extensible NER implementation, to researchers as a standard for comparing innovative techniques, and to biologists requiring the ability to find novel entities in large amounts of text. BANNER is available for download at .
AB - There has been an increasing amount of research on biomedical named entity recognition, the most basic text extraction problem, resulting in significant progress by different research teams around the world. This has created a need for a freely-available, open source system implementing the advances described in the literature. In this paper we present BANNER, an open-source, executable survey of advances in biomedical named entity recognition, intended to serve as a benchmark for the field. BANNER is implemented in Java as a machine-learning system based on conditional random fields and includes a wide survey of the best techniques recently described in the literature. It is designed to maximize domain independence by not employing brittle semantic features or rule-based processing steps, and achieves significantly better performance than existing baseline systems. It is therefore useful to developers as an extensible NER implementation, to researchers as a standard for comparing innovative techniques, and to biologists requiring the ability to find novel entities in large amounts of text. BANNER is available for download at .
UR - http://www.scopus.com/inward/record.url?scp=40549140499&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=40549140499&partnerID=8YFLogxK
M3 - Conference contribution
C2 - 18229723
AN - SCOPUS:40549140499
SN - 9812776087
SN - 9789812776082
T3 - Pacific Symposium on Biocomputing 2008, PSB 2008
SP - 652
EP - 663
BT - Pacific Symposium on Biocomputing 2008, PSB 2008
Y2 - 4 January 2008 through 8 January 2008
ER -