Skip to content
@hitz-zentroa

HiTZ zentroa

HiTZ is a reference center on Language Technologies. Its aim is to promote research, training, technological transfer and innovation in Artificial Intelligence.

Pinned Loading

  1. lm-contamination lm-contamination Public

    The LM Contamination Index is a manually created database of contamination evidences for LMs.

    Python 75 4

Repositories

Showing 10 of 15 repositories
  • arena Public

    Hizkuntza-Eredu Handiak ebaluatzeko arena

    hitz-zentroa/arena’s past year of commit activity
    0 0 0 0 Updated Nov 7, 2024
  • accelerate Public Forked from huggingface/accelerate

    🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

    hitz-zentroa/accelerate’s past year of commit activity
    Python 0 Apache-2.0 977 0 0 Updated Nov 1, 2024
  • hitz-zentroa/eval-MCG-COLING-2025’s past year of commit activity
    Jupyter Notebook 0 Apache-2.0 0 0 0 Updated Oct 31, 2024
  • GoLLIE Public

    Guideline following Large Language Model for Information Extraction

    hitz-zentroa/GoLLIE’s past year of commit activity
    Python 311 Apache-2.0 21 2 1 Updated Oct 27, 2024
  • cn-eval Public
    hitz-zentroa/cn-eval’s past year of commit activity
    Jupyter Notebook 1 Apache-2.0 0 0 0 Updated Oct 16, 2024
  • hitz-zentroa/critical_questions_generation’s past year of commit activity
    Python 0 0 0 0 Updated Oct 10, 2024
  • ses-lemma Public

    Evaluating Shortest Edit Script Methods for Contextual Lemmatization

    hitz-zentroa/ses-lemma’s past year of commit activity
    Python 0 Apache-2.0 0 0 0 Updated Jul 18, 2024
  • xnli-eu Public

    XNLIeu: a dataset for cross-lingual NLI in Basque

    hitz-zentroa/xnli-eu’s past year of commit activity
    Jupyter Notebook 0 0 0 0 Updated Jul 1, 2024
  • latxa Public

    Latxa: An Open Language Model and Evaluation Suite for Basque

    hitz-zentroa/latxa’s past year of commit activity
    Shell 25 MIT 0 1 0 Updated Jun 11, 2024
  • This-is-not-a-Dataset Public

    We introduce a large semi-automatically generated dataset of ~400,000 descriptive sentences about commonsense knowledge that can be true or false in which negation is present in about 2/3 of the corpus in different forms that we use to evaluate LLMs

    hitz-zentroa/This-is-not-a-Dataset’s past year of commit activity
    Python 11 Apache-2.0 1 0 0 Updated May 13, 2024

Top languages

Loading…

Most used topics

Loading…