Constructing CIs for 'the' Generalization Error

This is the code to reproduce the experiments from the Paper: Constructing CIs for 'the' Generalization Error.

The content of this repository is:

./analysis/ contains the code to process the results and to reproduce all the figures.
./data/ contains some input data and is the folder where generated datasets are stored.
./datamodels/ contains the code related to the generation of the datasets.
./experiments contains the code for the main experiments to investigate the inference methods.
./figures is the folder where the figures are stored. Only the figures from the main paper are stored here, some additional l
./inferGE/ is the R package that implements confidence interval methods that are being compared. This is research code. If you want to use the well-performing inference methods in R use this repository: https://github.com/mlr-org/mlr3inferr.
./renv/ and renv.lock are for the reproducible R environment
./results/ contains the final results (such as figures, tables) included in the paper

Reproducibility

The instructions to reproduce the experiments are separated into:

the dataset generation, see ./datamodels/README.md. Note that the resulting datasets are also made available on OpenML, so the main experiments can be reproduced without this step. Note: The code and instructions to reproduce the results are still being cleaned up.
the main experiments, see ./experiments/README.md. Note: The code and instructions to reproduce the results are still being cleaned up.
To recreate the preprocessing and the figures from the paper you need to first download the additional material from zenodo: https://zenodo.org/records/13744382. Specifically, move the content of the results data into the ./results subdirectory of this repository.

Downloading the datasets

Because of the large size of the benchmark datasets, it is important to download them in parquet format. However, the download might still fail. In this case, simply retry until it works.

Extending the experiment

In order to evaluate new inference methods, the following steps need to be followed:

In case the inference method requires a new resampling method that is not yet implemented in mlr3, you need to implement a new Resampling class, e.g. by adding it to the inferGE R package in the folder with the same name. For an example, e.g. see ResamplingNestedCV.
Implement the inference method itself, e.g. in the inferGE packages. As an example see the infer_bates.R file which uses the resample result ccreated by ResamplingNestedCV.
Add the resampling method to the experiment definition from ./experiments/resample.
Add the inferece method to the definitions from ./experiments/ci.R
Run the resample experiment and then the CIs.

Don't hesitate to contact us if you want to reuse this code!

Converting the Files to another format

If you don't want to work with R but still work with the results via e.g. python, you can achieve this by:

Starting the R interpreter
Read in the relevant .rds file using readRDS(<path>)
Write the data e.g. to CSV using the write.csv function.

Name		Name	Last commit message	Last commit date
Latest commit History 317 Commits
analysis		analysis
attic		attic
data		data
datamodels		datamodels
experiments		experiments
figures		figures
inferGE		inferGE
renv		renv
results		results
.Rprofile		.Rprofile
.gitignore		.gitignore
.gitkeep		.gitkeep
.lintr		.lintr
CITATION.cff		CITATION.cff
Makefile		Makefile
README.md		README.md
batchtools.conf.R		batchtools.conf.R
environment.yml		environment.yml
paper_2023_ci_for_ge.Rproj		paper_2023_ci_for_ge.Rproj
renv.lock		renv.lock
slurm_wyoming.tmpl		slurm_wyoming.tmpl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Constructing CIs for 'the' Generalization Error

Reproducibility

Downloading the datasets

Extending the experiment

Converting the Files to another format

About

Releases

Packages

Contributors 2

Languages

slds-lmu/paper_2023_ci_for_ge

Folders and files

Latest commit

History

Repository files navigation

Constructing CIs for 'the' Generalization Error

Reproducibility

Downloading the datasets

Extending the experiment

Converting the Files to another format

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages