Corpus:
- file: uk-statutecorpus_passages_2020-2024_v1.0.jsonl.gz
- passages: 124796
- years: 2020–2024

Evaluation dataset (Eval100):
- file: uk-statutecorpus_eval100_2020-2024_v1.0.jsonl
- queries: 100
- judgements: 300
- rel distribution: {3: 100, 2: 100, 1: 100}

Distillation:
- file: uk-statutecorpus_distill_voyage-rerank-2.5_teacher-score_5221_v1.0.jsonl
- examples: 5221
- teacher_score min: 0.0001382519694743678
- teacher_score max: 0.9999208450317383
- teacher_score mean: 0.5798115107355502
- teacher_score std: 0.3913850324613577
