Abstract
In this paper, we present a fully automated extraction system, named IntEx, to identify gene and protein interactions in biomedical text. Our approach is based on first splitting complex sentences into simple clausal structures made up of syntactic roles. Then, tagging biological entities with the help of biomedical and linguistic ontologies. Finally, extracting complete interactions by analyzing the matching contents of syntactic roles and their linguistically significant combinations. Our extraction system handles complex sentences and extracts multiple and nested interactions specified in a sentence. Experimental evaluations with two other state of the art extraction systems indicate that the IntEx system achieves better performance without the labor intensive pattern engineering requirement.
Original language | English (US) |
---|---|
Pages | 54-61 |
Number of pages | 8 |
State | Published - 2005 |
Event | 2005 ACL-ISMB Workshop on Linking Biological Literature, Ontologies and Databases: Mining Biological Semantics, ACL-ISMB 2005 - Detroit, United States Duration: Jun 24 2005 → … |
Conference
Conference | 2005 ACL-ISMB Workshop on Linking Biological Literature, Ontologies and Databases: Mining Biological Semantics, ACL-ISMB 2005 |
---|---|
Country/Territory | United States |
City | Detroit |
Period | 6/24/05 → … |
ASJC Scopus subject areas
- Biochemistry, Genetics and Molecular Biology(all)
- Artificial Intelligence
- Information Systems