Ruiz-Cavero, G. (G.)

Search Results

Now showing 1 - 1 of 1
  • Thumbnail Image
    Unsupervised ensemble learning for genome sequencing
    (2022) Villalvilla-Ornat, P. (P.); Pages-Zamora, A. (A.); Ochoa-Álvarez, I. (Idoia); Ruiz-Cavero, G. (G.)
    Unsupervised ensemble learning refers to methods devised for a particular task that combine data pro-vided by decision learners taking into account their reliability, which is usually inferred from the data. Here, the variant calling step of the next generation sequencing technologies is formulated as an unsuper-vised ensemble classification problem. A variant calling algorithm based on the expectation-maximization algorithm is further proposed that estimates the maximum-a-posteriori decision among a number of classes larger than the number of different labels provided by the learners. Experimental results with real human DNA sequencing data show that the proposed algorithm is competitive compared to state-of -the-art variant callers as GATK, HTSLIB, and Platypus.(c) 2022 The Author(s). Published by Elsevier Ltd.This is an open access article under the CC BY-NC-ND license ( http://creativecommons.org/licenses/by-nc-nd/4.0/ )