Skip to Content
× You are about to create a new metadata only record. This record does not auto assign a DOI. To allocate a new DOI use the 'Upload data and allocate DOI' option.

MuLD: The Multitask Long Document Benchmark [dataset]

MuLD (Multitask Long Document Benchmark) is a set of 6 NLP tasks where the inputs consist of at least 10,000 words. The benchmark covers a wide variety of task types including translation, summarization, question answering, and classification. Additionally there is a range of output lengths from a single word classification label all the way up to an output longer than the input text.

Descriptions

Collection icon

Actions

Items in this Collection

Sort the listing of items    
List of items in this collection
  Title Date Uploaded Visibility Action
  26 April 2022 Open Access
File Name:
style_change_test.json.bz2
File Format:
x-bzip2 (bzip2 compressed data, block size = 900k, BZ2, Bzip2)
Creator:
Depositor:
G.T. Hudson
Edit Access:
Users: mjxs37
  26 April 2022 Open Access
File Name:
opensubtitles_test.json.bz2
File Format:
x-bzip2 (bzip2 compressed data, block size = 900k, BZ2, Bzip2)
Creator:
Depositor:
G.T. Hudson
Edit Access:
Users: mjxs37
  26 April 2022 Open Access
File Name:
character_id_validation.json.bz2
File Format:
x-bzip2 (bzip2 compressed data, block size = 900k, BZ2, Bzip2)
Creator:
Depositor:
G.T. Hudson
Edit Access:
Users: mjxs37
  26 April 2022 Open Access
File Name:
character_id_train.json.bz2
File Format:
x-bzip2 (bzip2 compressed data, block size = 900k, BZ2, Bzip2)
Creator:
Depositor:
G.T. Hudson
Edit Access:
Users: mjxs37