MultiNLI数据集
官方网址:https://cims.nyu.edu/~sbowman/multinli/
The Multi-Genre Natural Language Inference (MultiNLI) corpus is a crowd-sourced collection of 433k sentence pairs annotated with textual entailment information. The corpus is modeled on the SNLI corpus, but differs in that covers a range of genres of spoken and written text, and supports a distinctive cross-genre generalization evaluation. The corpus served as the basis for the shared task of the RepEval 2017 Workshop at EMNLP in Copenhagen.
多体裁自然语言推理语料库是一个由433k个句子对组成的集合,这些句子对带有文本蕴涵信息。语料库以SNLI语料库为模型,但不同之处在于它涵盖了口语和书面语篇的一系列体裁,并支持独特的跨体裁泛化评价。语料库是哥本哈根EMNLP 2017年RepEval研讨会共同任务的基础。
大家可以到官网地址下载数据集,我自己也在百度网盘分享了一份。可关注本人公众号,回复“2020081302”获取下载链接。
只要自己有时间,都尽量写写文章,与大家交流分享。
本人公众号:
****博客地址:https://blog.****.net/ispeasant