NLHF

Note that this training code works for a small preference dataset from stanford, so try it out and run the training code if you feel interested.

Credit: @BojanFaletic, @Hong, Claude3 and GPT4

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
NLHF_Lemma_1_proof.pdf		NLHF_Lemma_1_proof.pdf
NLHF_Lemma_2_proof.pdf		NLHF_Lemma_2_proof.pdf
README.md		README.md
nlhf.py		nlhf.py

Provide feedback