Exploiting Graph Structures for Reproducibility Badge Prediction in Academic Papers
The number of publications is growing at an accelerated pace, leaving researchers with insufficient time to assess every paper that is published. However, by harnessing the power of machine learning and transfer learning, we can develop models that can predict the likelihood of reproducibility badges for these papers. The primary objective of the project is to construct graphs connecting different scholarly papers from the ACM digital library dataset and build predictive models using graph neural networks to constrast them with tabular machine learning models in estimating the likelihood of research papers receiving reproducibility badges.
--------------------------------------TBA
In an effort to build a consolidated repository of datasets pertaining to reproducibility of scholarly articles, we initiated reproducibility/datasets. The central idea here was to work towards studying "All things Reproducibility in Science". Data collection for this repository is included within the ACMBadges sub-directory, which is one of the data sources part of the comprehensive data collection efforts. Specifically, the fourth phase included the largest collection of Badged and Unbadged articles from ACM Digital Library.
This work is supported in part by NSF Grant No. 2022443.
If you find this work useful, please cite our paper:
TBA