DKPro Statistics is a collection of open-licensed statistical tools written in Java. The software library is divided into the following modules:
-
DKPro Agreement (dkpro-statistics-agreement) is a module for computing multiple inter-rater agreement measures using a shared interface and data model. Based on this model, the software allows for analyzing coding (i.e., assigning categories to fixed items) and unitizing studies (i.e., segmenting the data into codable units). The software has been described in our COLING 2014 demo paper (see below).
-
DKPro Correlation (dkpro-statistics-correlation) is a module for computing correlation and association measures. It is currently under construction.
-
DKPro Significance (dkpro-statistics-significance) is a module for assessing statistical significance. It is currently under construction.
The latest version of DKPro Statistics is available via Maven Central. If you use Maven as your build tool, then you can add DKPro Statistics as a dependency in your pom.xml file:
org.dkpro.statistics dkpro-statistics 2.1.0In addition to that, you can add each of the modules described above separately (e.g., artifactId dkpro-statistics-agreement).
DKPro Statistics is available as open source software under the Apache License 2.0 (ASL). The software thus comes "as is" without any warranty (see license text for more details).
A more detailed description of DKPro Statistics is available in our scientific articles:
- Christian M. Meyer, Margot Mieskes, Christian Stab, and Iryna Gurevych: DKPro Agreement: An Open-Source Java Library for Measuring Inter-Rater Agreement, in: Proceedings of the 25th International Conference on Computational Linguistics: System Demonstrations (COLING), p. 105–109, August 2014. Dublin, Ireland. http://www.aclweb.org/anthology/C/C14/C14-2023.pdf
Please cite our COLING 2014 paper if you use the software in your scientific work.
Prior to being available as open source software, DKPro Statistics has been a research project at the Ubiquitous Knowledge Processing (UKP) Lab of the Technische Universität Darmstadt, Germany. The following people have mainly contributed to this project (in alphabetical order):
- Richard Eckart de Castilho
- Iryna Gurevych
- Roland Kluge
- Christian M. Meyer
- Margot Mieskes
- Christian Stab
- Torsten Zesch