Sangam: A Confluence of Knowledge Streams

Sequence labeling with multiple annotators

Show simple item record

dc.contributor Massachusetts Institute of Technology. Department of Civil and Environmental Engineering
dc.contributor Pereira, Francisco
dc.contributor Pereira, Francisco C.
dc.creator Rodrigues, Filipe
dc.creator Ribeiro, Bernardete
dc.creator Pereira, Francisco C.
dc.date 2014-02-28T17:08:04Z
dc.date 2014-02-28T17:08:04Z
dc.date 2013-10
dc.date 2012-11
dc.date.accessioned 2023-03-01T18:06:02Z
dc.date.available 2023-03-01T18:06:02Z
dc.identifier 0885-6125
dc.identifier 1573-0565
dc.identifier http://hdl.handle.net/1721.1/85189
dc.identifier Rodrigues, Filipe, Francisco Pereira, and Bernardete Ribeiro. “Sequence Labeling with Multiple Annotators.” Mach Learn (October 4, 2013).
dc.identifier https://orcid.org/0000-0001-5457-9909
dc.identifier.uri http://localhost:8080/xmlui/handle/CUHPOERS/278745
dc.description The increasingly popular use of Crowdsourcing as a resource to obtain labeled data has been contributing to the wide awareness of the machine learning community to the problem of supervised learning from multiple annotators. Several approaches have been proposed to deal with this issue, but they disregard sequence labeling problems. However, these are very common, for example, among the Natural Language Processing and Bioinformatics communities. In this paper, we present a probabilistic approach for sequence labeling using Conditional Random Fields (CRF) for situations where label sequences from multiple annotators are available but there is no actual ground truth. The approach uses the Expectation-Maximization algorithm to jointly learn the CRF model parameters, the reliability of the annotators and the estimated ground truth. When it comes to performance, the proposed method (CRF-MA) significantly outperforms typical approaches such as majority voting.
dc.format application/pdf
dc.language en_US
dc.publisher Springer-Verlag
dc.relation http://dx.doi.org/10.1007/s10994-013-5411-2
dc.relation Machine Learning
dc.rights Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.
dc.source Francisco Pereira
dc.title Sequence labeling with multiple annotators
dc.type Article
dc.type http://purl.org/eprint/type/JournalArticle


Files in this item

Files Size Format View
Pereira_Sequence Labeling.pdf 665.0Kb application/pdf View/Open

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse