Exporters¶
Output format writers. Each writes one JSONL file into the output directory.
curatorkit.exporters ¶
AlpacaExporter ¶
CorpusExporter ¶
DPOExporter ¶
Bases: BaseExporter
Export preference data in TRL DPO format.
Only samples with task_type "preference" or "implicit_preference" are written. All others are skipped (not rejected — skipping is intentional when exporting a multi-task pipeline subset).
GRPOExporter ¶
Bases: BaseExporter
Export to GRPO group rollout format.
Uses DataSample.responses and DataSample.reward_scores if populated. Falls back to empty arrays when no rollouts have been generated.