Skip to content
Change the repository type filter

All

    Repositories list

    • Official repository for FactMM-RAG: Fact-Aware Multimodal Retrieval Augmentation for Accurate Medical Radiology Report Generation [NAACL 2025]
      Python
      MIT License
      0101Updated Mar 2, 2025Mar 2, 2025
    • Craw4LLM

      Public
      Official repository for "Craw4LLM: Efficient Web Crawling for LLM Pretraining"
      Python
      MIT License
      5055100Updated Feb 24, 2025Feb 24, 2025
    • Official repository for Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning [ICLR 2025]
      Python
      MIT License
      34100Updated Jan 24, 2025Jan 24, 2025
    • RAGViz

      Public
      Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]
      TypeScript
      MIT License
      117910Updated Jan 18, 2025Jan 18, 2025
    • Interpret and control dense embedding via sparse autoencoder.
      Python
      MIT License
      0300Updated Dec 31, 2024Dec 31, 2024
    • MATES

      Public
      Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models [NeurIPS 2024]
      Python
      MIT License
      76030Updated Nov 14, 2024Nov 14, 2024
    • esae

      Public
      Python
      0000Updated Oct 29, 2024Oct 29, 2024
    • Python
      0100Updated Oct 23, 2024Oct 23, 2024
    • Python
      1600Updated Aug 23, 2024Aug 23, 2024
    • Python
      0300Updated Jun 20, 2024Jun 20, 2024