Full metadata record
DC FieldValueLanguage
dc.creatorKrallinger, M. (Martin)-
dc.creatorRabal, O. (Obdulia)-
dc.creatorLeitner, F. (Florian)-
dc.creatorVazquez, M. (Miguel)-
dc.creatorSalgado, D. (David)-
dc.creatorLu, Z. (Zhiyong)-
dc.creatorLeaman, R. (Robert)-
dc.creatorLu, Y. (Yanan)-
dc.creatorJi, D. (Donghong)-
dc.creatorLowe, D.M. (Daniel M.)-
dc.creatorSayle, R.A. (Roger A.)-
dc.creatorBatista-Navarro, R.T. (Riza Theresa)-
dc.creatorRak, R. (Rafal)-
dc.creatorHuber, T. (Torsten)-
dc.creatorRocktäschel, T. (Tim)-
dc.creatorMatos, S. (Sérgio)-
dc.creatorCampos, D. (David)-
dc.creatorTang, B. (Buzhou)-
dc.creatorXu, H. (Hua)-
dc.creatorMunkhdalai, T. (Tsendsuren)-
dc.creatorRyu, K.H. (Keun Ho)-
dc.creatorRamanan, S.V. (S.V.)-
dc.creatorNathan, S. (Senthil)-
dc.creatorZitnik, S. (Slavko)-
dc.creatorBajec, M. (Marko)-
dc.creatorWeber, L. (Lutz)-
dc.creatorIrmer, M. (Matthias)-
dc.creatorAkhondi, S.A. (Saber A.)-
dc.creatorKors, J.A. (Jan A.)-
dc.creatorXu, S. (Shuo)-
dc.creatorAn, X. (Xin)-
dc.creatorSikdar, U.K. (Utpal Kumar)-
dc.creatorEkbal, A. (Asif)-
dc.creatorYoshioka, M. (Masaharu)-
dc.creatorDieb, T.M. (Thaer M.)-
dc.creatorChoi, M. (Miji)-
dc.creatorVerspoor, K. (Karin)-
dc.creatorKhabsa, M. (Madian)-
dc.creatorGiles, C.L. (C. Lee)-
dc.creatorLiu, H. (Hongfang)-
dc.creatorRavikumar, K.E. (Komandur Elayavilli)-
dc.creatorLamurias, A. (Andre)-
dc.creatorCouto, F.M. (Francisco M.)-
dc.creatorDai, H.J (Hong-Jie)-
dc.creatorTzong-Han-Tsai, R. (Richard)-
dc.creatorAta, C. (Caglar)-
dc.creatorCan, T. (Tolga)-
dc.creatorUsié, A. (Anabel)-
dc.creatorAlves, R. (Rui)-
dc.creatorSegura-Bedmar, I. (Isabel)-
dc.creatorMartínez, P. (Paloma)-
dc.creatorOyarzabal, J. (Julen)-
dc.creatorValencia, A. (Alfonso)-
dc.date.accessioned2015-05-08T11:40:48Z-
dc.date.available2015-05-08T11:40:48Z-
dc.date.issued2015-
dc.identifier.citationKrallinger M, Rabal O, Leitner F, Vazquez M, Salgado D, Lu Z, et al. The CHEMDNER corpus of chemicals and drugs and its annotation principles. J Cheminform. 2015;7(1)es_ES
dc.identifier.issn1758-2946-
dc.identifier.urihttps://hdl.handle.net/10171/38261-
dc.description.abstractThe automatic extraction of chemical information from text requires the recognition of chemical entity mentions as one of its key steps. When developing supervised named entity recognition (NER) systems, the availability of a large, manually annotated text corpus is desirable. Furthermore, large corpora permit the robust evaluation and comparison of different approaches that detect chemicals in documents. We present the CHEMDNER corpus, a collection of 10,000 PubMed abstracts that contain a total of 84,355 chemical entity mentions labeled manually by expert chemistry literature curators, following annotation guidelines specifically defined for this task. The abstracts of the CHEMDNER corpus were selected to be representative for all major chemical disciplines. Each of the chemical entity mentions was manually labeled according to its structure-associated chemical entity mention (SACEM) class: abbreviation, family, formula, identifier, multiple, systematic and trivial. The difficulty and consistency of tagging chemicals in text was measured using an agreement study between annotators, obtaining a percentage agreement of 91. For a subset of the CHEMDNER corpus (the test set of 3,000 abstracts) we provide not only the Gold Standard manual annotations, but also mentions automatically detected by the 26 teams that participated in the BioCreative IV CHEMDNER chemical mention recognition task. In addition, we release the CHEMDNER silver standard corpus of automatically extracted mentions from 17,000 randomly selected PubMed abstracts. A version of the CHEMDNER corpus in the BioC format has been generated as well. We propose a standard for required minimum information about entity annotations for the construction of domain specific corpora on chemical and drug entities. The CHEMDNER corpus and annotation guidelines are available at: http://www.biocreative.org/resources/biocreative-iv/chemdner-corpus/es_ES
dc.language.isoenges_ES
dc.publisherChemistry Centrales_ES
dc.relationinfo:eu-repo/grantAgreement/EC/FP7/115002;222886-
dc.rightsinfo:eu-repo/semantics/openAccesses_ES
dc.subjectNamed entity recognitiones_ES
dc.subjectBioCreativees_ES
dc.subjectText mininges_ES
dc.subjectChemical entity recognitiones_ES
dc.subjectMachine learninges_ES
dc.subjectChemical indexinges_ES
dc.subjectChemNLPes_ES
dc.titleThe CHEMDNER corpus of chemicals and drugs and its annotation principleses_ES
dc.typeinfo:eu-repo/semantics/articlees_ES
dc.identifier.doihttp://dx.doi.org/10.1186/1758-2946-7-S1-S2es_ES

Files in This Item:
Thumbnail
File
1758-2946-7-S1-S2.pdf
Description
Size
2.35 MB
Format
Adobe PDF


Statistics and impact
0 citas en
0 citas en

Items in Dadun are protected by copyright, with all rights reserved, unless otherwise indicated.