Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
1 change: 1 addition & 0 deletions corpora_and_data_resources.rst
Original file line number Diff line number Diff line change
Expand Up @@ -176,6 +176,7 @@ Natural Language Inference (NLI)

* `Hebrew Paraphrase Dataset <https://github.com/NNLP-IL/Hebrew-Paraphrase-Dataset>`_ {`CC BY 4.0 <https://creativecommons.org/licenses/by/4.0/>`_} - A high-quality paraphrase dataset in Hebrew, consisting of 9785 instances. The dataset includes both paragraph-level (75%) and sentence-level (25%) paraphrases generated with the help of a large language model. Among these, 300 instances have been manually validated as gold standard examples.

* `LCHAIM Dataset <https://huggingface.co/datasets/HebArabNlpProject/LCHAIM>`_ {`CC BY 4.0 <https://creativecommons.org/licenses/by/4.0/>`_} - A long context, multi-premise NLI dataset, translated and validated from CoNTRoL, consisting of 8,325 pairs in Hebrew. Published at ACL 2025

Paraphrase Detection and Generation
^^^^^^^^^^^^^^^^^^^^^^^^^^
Expand Down
2 changes: 2 additions & 0 deletions models_tools_services.rst
Original file line number Diff line number Diff line change
Expand Up @@ -348,6 +348,8 @@ Fine-Tuned Language Models

* `Universal Language Model Fine-tuning for Text Classification (ULMFiT) in Hebrew <https://github.com/hanan9m/hebrew_ULMFiT?fbclid=IwAR0wJkoxmaCmhuZnSVOLBo1Mo362v6-66PmXutOr9FhhoItIHoqG_2MzV8E>`_ - The weights (e.g. a trained model) for a Hebrew version for Howard's and Ruder's ULMFiT model. Trained on the Hebrew Wikipedia corpus.

* `LongHero-LCHAIM <https://huggingface.co/oriel9p/LongHero-FT-HebNLI-LCHAIM>`_ - a LongHero, fine tuned over HebNLI and then over the LCHAIM long context NLI dataset.

Multilingual Models
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

Expand Down