Robust active learning with binary responses
dadun.citation.endingPage | 14 | es_ES |
dadun.citation.number | 220 | es_ES |
dadun.citation.publicationName | Journal of Statistical Planning and Inference | es_ES |
dadun.citation.startingPage | 1 | es_ES |
dc.contributor.author | López-Fidalgo, J. (Jesús) | |
dc.contributor.author | Wiens, D.P. (Douglas P.) | |
dc.date.accessioned | 2022-08-04T12:49:59Z | |
dc.date.available | 2022-08-04T12:49:59Z | |
dc.date.issued | 2022 | |
dc.description.abstract | We introduce a method of Robust Learning (‘robl’) for binary data, and propose its use in situations where Active Learning is appropriate, and where sampling the predictors is easy and cheap, but learning the responses is hard and expensive. We seek robustness against both modelling errors and the mislabelling of the binary responses. Thus we aim to sample effectively from the population of predictors, and learn the responses only for an ‘influential’ sub-population. This is carried out by probability weighted sampling, for which we derive optimal ‘unbiased’ sampling weights, and weighted likelihood estimation, for which we also derive optimal estimation weights. The robustness issues can lead to biased estimates and classifiers; it is somewhat remarkable that our weights eliminate the mean of the bias – which is a random variable as a result of the sampling – due to both types of errors mentioned above. These weights are then tailored to minimize the mean squared error of the predicted values. Simulation studies indicate that when bias is of significant concern, robl allows for substantial reductions, relative to Passive Learning, in the prediction errors. The methods are then illustrated in real-data analyses. | es_ES |
dc.description.note | This is an open access article under the CC BY-NC-ND license | es_ES |
dc.description.sponsorship | Jesús López–Fidalgo is sponsored by Ministerio de Economía y Competitividad, Spain MTM2016-80539-C2-1-R and by Program Salvador de Madariaga PRX18/00339. The research of Douglas P. Wiens is supported by the Natural Sciences and Engineering Council of Canada. The authors thank Linglong Kong, José Moler, Haiying Wang and Zhichun Zhai for helpful comments. | es_ES |
dc.identifier.citation | López-Fidalgo, J. (Jesús); Wiens, D.P. (Douglas P.). "Robust active learning with binary responses". Journal of Statistical Planning and Inference. (220), 2022, 1 - 14 | es |
dc.identifier.doi | 10.1016/j.jspi.2022.01.004 | |
dc.identifier.issn | 0378-3758 | |
dc.identifier.uri | https://hdl.handle.net/10171/63880 | |
dc.language.iso | eng | es_ES |
dc.publisher | Elsevier | es_ES |
dc.relation.center | Escuela de Ingeniería - TECNUN | |
dc.relation.department | Departamento de Organización industrial | |
dc.rights | info:eu-repo/semantics/openAccess | es_ES |
dc.subject | Logistic regression | es_ES |
dc.subject | Loss function | es_ES |
dc.subject | Mislabelling | es_ES |
dc.subject | Optimal subsampling | es_ES |
dc.subject | Probit regression | es_ES |
dc.subject | Sequential sampling | es_ES |
dc.title | Robust active learning with binary responses | es_ES |
dc.type | info:eu-repo/semantics/article | es_ES |
dspace.entity.type | Publication | es |
relation.isAuthorOfPublication | 169ec035-21e6-475f-be38-215b04163b04 | |
relation.isAuthorOfPublication | e36bb69a-1713-4607-8d8a-1fde7b616e15 | |
relation.isAuthorOfPublication.latestForDiscovery | 169ec035-21e6-475f-be38-215b04163b04 |