Assessing Adversarial Effects of Noise in Missing Data Imputation

Codebase for the conference paper: Assessing Adversarial Effects of Noise in Missing Data Imputation, accepted and presented at the 34th Brazilian Conference on Intelligent Systems (BRACIS)

Paper Details

Authors: Arthur Dantas Mangussi, Ricardo Cardoso Pereira, Pedro Henriques Abreu, and Ana Carolina Lorena
Abtract: In real-world scenarios, a wide variety of datasets contain inconsistencies. One example of such inconsistency is missing data (MD), which refers to the absence of information in one or more variables. Missing imputation strategies emerged as a possible solution for addressing this problem, which can replace the missing values based on mean, median, or Machine Learning (ML) techniques. The performance of such strategies depends on multiple factors. One factor that influences the missing value imputation (MVI) methods is the presence of noisy instances, described as anything that obscures the relationship between the features of an instance and its class, having an adversarial effect. However, the interaction between MD and noisy instances has received little attention in the literature. This work fills this gap by investigating missing and noisy data interplay. Our experimental setup begins with generating missingness under the Missing Not at Random (MNAR) mechanism in a multivariate scenario and performing imputation using seven state-of-the-art MVI methods. Our methodology involves applying a noise filter before performing the imputation task and evaluating the quality of the imputation directly. Additionally, we measure the classification performance with the new estimates. This approach is applied to both synthetic data and 11 real-world datasets. The effects of noise filtering before imputation are evaluated. The results show that noise preprocessing before the imputation task improves the imputation quality and the classification performance for imputed datasets.
Year: 2024
Published in: Will be available as soon as the conference proceedings are published.
DOI: Will be available as soon as the conference proceedings are published.
Contact: [email protected]

Paper and Presentation

The original paper could be acess here
The PDF presentation is available here

Dependencies

You'll need a working Python environment to run the code. The required dependencies are specified in the file requirements.txt.

You can install all required dependencies by running:

pip install -r requirements.txt

Citation

If you use this work, please cite:

Bibtex entry:

Will be available as soon as the conference proceedings are published.

Acknowledgements

The authors gratefully acknowledge the Brazilian funding agencies FAPESP (Fundação Amparo à Pesquisa do Estado de São Paulo) under grants 2022/10553-6, 2023/13688-2, and 2021/06870-3. Moreover, this research was supported by Portuguese Recovery and Resilience Plan (PRR) through project C645008882-00000055 Center for Responsable AI.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
Algorithms		Algorithms
Imputed Datasets		Imputed Datasets
Original Datasets		Original Datasets
Results		Results
Tempos		Tempos
presentations		presentations
utilsMsc		utilsMsc
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
baseline.py		baseline.py
classifica_baseline.py		classifica_baseline.py
classificação_noise.py		classificação_noise.py
noisy_generator.ipynb		noisy_generator.ipynb
requirements.txt		requirements.txt
setup_filter_noise.py		setup_filter_noise.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Assessing Adversarial Effects of Noise in Missing Data Imputation

Paper Details

Paper and Presentation

Dependencies

Citation

Acknowledgements

About

Releases

Packages

Languages

License

ArthurMangussi/FilterNoise

Folders and files

Latest commit

History

Repository files navigation

Assessing Adversarial Effects of Noise in Missing Data Imputation

Paper Details

Paper and Presentation

Dependencies

Citation

Acknowledgements

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages