Dengue Cases Nepal 2024 Data Extraction

This project is a Python script that extracts tabular data from a PDF file of Epidemiology and Disease Control Division Department of Health Services, Ministry of Health & Population (containing dengue cases data) and saves it to an Excel file. The extracted data can then be used for further analysis or reporting.

ℹ️ Visit the public data download platform to view dengue situation reports for Nepal 2024 on https://konishon.github.io/data-dengue-situation-report-nepal-2024/

Features

Extract tables from a PDF document using pdfplumber.
Convert the extracted tables into a pandas DataFrame.
Save the DataFrame to an Excel file with a name derived from the original PDF file.

Installation

To install the required packages, follow these steps:

Clone the repository or download the script files.
Install the necessary Python dependencies using pip:

pip install -r requirements.txt

Usage

Generating CSVs from PDFs

python dengue_cases_nepal_data_extraction.py "data/PDFs/extracted/67049d6319129-2.pdf" --auto-clean

This will generate an CSV file named dengue_cases_nepal_extracted_tables.CSV

Generatic Static Public Data Download Page

python generate_html.py data/CSVs/66f3af5ec20b2-2_extracted_tables.csv

This will generate the public data download page avalaible at https://konishon.github.io/data-dengue-situation-report-nepal-2024/

Error Handling

If the PDF file doesn't exist, the script will raise a FileNotFoundError.
If no tables are found in the PDF, the script will raise a ValueError.

License

This project is licensed under the MIT License. Feel free to use and modify it according to your needs.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
data		data
docs		docs
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
dengue_cases_nepal_data_extraction.py		dengue_cases_nepal_data_extraction.py
generate_html.py		generate_html.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dengue Cases Nepal 2024 Data Extraction

Features

Installation

Usage

Generating CSVs from PDFs

Generatic Static Public Data Download Page

Error Handling

License

About

Releases 2

Packages

Languages

License

konishon/data-dengue-situation-report-nepal-2024

Folders and files

Latest commit

History

Repository files navigation

Dengue Cases Nepal 2024 Data Extraction

Features

Installation

Usage

Generating CSVs from PDFs

Generatic Static Public Data Download Page

Error Handling

License

About

Resources

License

Stars

Watchers

Forks

Releases 2

Packages 0

Languages

Packages