Skip to Content
No preview available

Actions

Download Analytics Citations

Export to: EndNote  |  Zotero  |  Mendeley

Collections

This file is in the following collections:

MuLD: The Multitask Long Document Benchmark [dataset]

Hotpot Annotated - valid [dataset] Open Access

MuLD (Multitask Long Document Benchmark) is a set of 6 NLP tasks where the inputs consist of at least 10,000 words. The benchmark covers a wide variety of task types including translation, summarization, question answering, and classification. Additionally there is a range of output lengths from a single word classification label all the way up to an output longer than the input text.

Descriptions

Resource type
Dataset
Contributors
Data collector: Hudson, G Thomas 1
1 Durham Univesity
Funder
Research methods
Other description
Keyword
nlp
multitask
long document
Subject
Location
Language
Cited in
Identifier
ark:/32150/r21j92g751c
Rights
MIT Licence (MIT)

Publisher
Durham University
Date Created

File Details

Depositor
G.T. Hudson
Date Uploaded
Date Modified
3 May 2022, 13:05:22
Audit Status
Audits have not yet been run on this file.
Related Files
hotpot_annotated_train.json
narrativeqa_train.json.bz2
vlsp_test.json.bz2
style_change_validation.json.bz2
narrativeqa_test.json.bz2
opensubtitles_train.json.bz2
style_change_train.json.bz2
narrativeqa_validation.json.bz2
character_id_test.json.bz2
style_change_test.json.bz2
character_id_validation.json.bz2
opensubtitles_test.json.bz2
character_id_train.json.bz2
Characterization
File format: plain (Plain text)
Mime type: text/plain
File size: 699516701
Filename: hotpot_annotated_valid.json
Original checksum: 6ef753c124dd3bf6b910d74f9e107e7a
Well formed: true
Valid: true
Character set: US-ASCII
Activity of users you follow
User Activity Date