EQA-PMR-large

我要开发同款
匿名用户2024年07月31日
34阅读
所属分类ai、roberta、Pytorch、问答、NeurIPS 2023、QA、Alibaba、F1、nlp
开源地址https://modelscope.cn/models/iic/EQA-PMR-large
授权协议Apache License 2.0

作品详情

EQA-PMR-large

EQA-PMR-large is initialized with PMR-large and further fine-tuned on 6 Extractive Question Answering (EQA) training data from training split of MRQA.

The model performance on the in-dev sets are:

SQuAD NewsQA HotpotQA NaturalQuestions TriviaQA SearchQA
RoBERTa-large (single-task model) 94.2 73.8 81.6 83.3 85.1 85.7
PMR-large (single-task model) 94.5 74.0 83.6 83.8 85.1 88.3
EQA-PMR-large (multi-task model) 94.2 73.7 66.9 82.3 85.4 88.7

Note that the performance of RoBERTa-large and PMR-large are single-task fine-tuning, while EQA-PMR-large is a multi-task fine-tuned model. As it is fine-tuned on multiple datasets, we believe that EQA-PMR-large has a better generalization capability to other EQA tasks than PMR-large and RoBERTa-large.

How to use

You can try the codes from this repo for both training and inference.

BibTeX entry and citation info

@article{xu2022clozing,
  title={From Clozing to Comprehending: Retrofitting Pre-trained Language Model to Pre-trained Machine Reader},
  author={Xu, Weiwen and Li, Xin and Zhang, Wenxuan and Zhou, Meng and Bing, Lidong and Lam, Wai and Si, Luo},
  journal={arXiv preprint arXiv:2212.04755},
  year={2022}
}
声明:本文仅代表作者观点,不代表本站立场。如果侵犯到您的合法权益,请联系我们删除侵权资源!如果遇到资源链接失效,请您通过评论或工单的方式通知管理员。未经允许,不得转载,本站所有资源文章禁止商业使用运营!
下载安装【程序员客栈】APP
实时对接需求、及时收发消息、丰富的开放项目需求、随时随地查看项目状态

评论