Size of the test set (annotations.json) #3

qishenghu · 2024-04-15T15:10:41Z

Thanks for the good work.

According to the arxiv paper, there should be approximately 1,000 annotated response for FAVABENCH detection task. But seems like the 'annotations.json' file from this link (https://huggingface.co/datasets/fava-uw/fava-data/tree/main) contains only 460 records. Could you kindly help me understand which file might be the correct annotated response for FAVABENCH?

Thanks!

khunkin · 2024-04-18T08:53:07Z

The 'annotations.json' file from this link(submitted on Jan 15, 2024) should correspond to version 2(submitted on Jan 17, 2024) of the submission on arXiv, which states: "Our benchmark consists of about 400 responses of ChatGPT and Llama2-Chat 70B." And the version 3 (submitted on Feb 21, 2024) mentions: "...annotating approximately 1,000 responses of three widely used LMs." To date (Apr 18, 2024), the authors have not updated the data to the latest version. Hope that the authors will update the data.

Thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Size of the test set (annotations.json) #3

Size of the test set (annotations.json) #3

qishenghu commented Apr 15, 2024

khunkin commented Apr 18, 2024

Size of the test set (annotations.json) #3

Size of the test set (annotations.json) #3

Comments

qishenghu commented Apr 15, 2024

khunkin commented Apr 18, 2024