MultiNLI数据集

官方网址：https://cims.nyu.edu/~sbowman/multinli/

The Multi-Genre Natural Language Inference (MultiNLI) corpus is a crowd-sourced collection of 433k sentence pairs annotated with textual entailment information. The corpus is modeled on the SNLI corpus, but differs in that covers a range of genres of spoken and written text, and supports a distinctive cross-genre generalization evaluation. The corpus served as the basis for the shared task of the RepEval 2017 Workshop at EMNLP in Copenhagen.

多体裁自然语言推理语料库是一个由433k个句子对组成的集合，这些句子对带有文本蕴涵信息。语料库以SNLI语料库为模型，但不同之处在于它涵盖了口语和书面语篇的一系列体裁，并支持独特的跨体裁泛化评价。语料库是哥本哈根EMNLP 2017年RepEval研讨会共同任务的基础。

大家可以到官网地址下载数据集，我自己也在百度网盘分享了一份。可关注本人公众号，回复“2020081302”获取下载链接。

只要自己有时间，都尽量写写文章，与大家交流分享。

本人公众号：

MultiNLI数据集

****博客地址：https://blog.****.net/ispeasant

相关推荐