Skip to content

Legacy data layout

Latest

Choose a tag to compare

@carolisteia carolisteia released this 22 Jul 15:13
· 18 commits to main since this release

This release preserves the original dataset structure, with raw/, split/, and non_split/ directories, prior to restructuring.

It serves as a historical reference point before refocusing this repository exclusively on the De Regimine Principum data.
All multilingual training data will be moved to dedicated repositories for better separation of scope.