Skip to content

KaguraRuri/MedEthicEval

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 

Repository files navigation

MedEthicEval Evaluation Benchmark

This repository contains a benchmark for evaluating large language models (LLMs) on their ability to handle medical ethics tasks. The primary focus of the benchmark is to assess LLMs' performance in the domains of medical knowledge and ethical decision-making.

Repository Structure

The repository currently contains the following folder:

  • dataset/
    This folder contains the datasets used for evaluating LLMs on medical ethics tasks. It includes data for various tasks such as:
    • Knowledge Evaluation: Assessing the model's grasp of medical ethics knowledge.
    • Detecting Violations: Evaluating the model's ability to identify violations of medical ethics.
    • Priority Dilemma: Testing the model’s decision-making in ethically charged dilemmas with clear priorities.
    • Equilibrium Dilemma: Evaluating how well the model handles ethically neutral or balanced dilemmas.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published