Text data for
fine-tuning.

A free, open dataset of clean text for training language models. Community-rated and filtered.

1

Upload

Share text, markdown, or HTML documents. We're looking for useful, non-harmful content.

2

Rate

Vote on documents. Help filter out low-quality or antisocial content.

3

Download

Get the curated dataset filtered by community ratings.