Skip to content

Latest commit

 

History

History
164 lines (123 loc) · 7.63 KB

README.md

File metadata and controls

164 lines (123 loc) · 7.63 KB

gerrytools

CircleCI codecov PyPI version docs website Code style: black Imports: isort

A companion to GerryChain, GerryTools is a robust suite of geometric and algorithmic tools to analyze districting plans and related data. GerryTools is actively developed and used by the MGGG Redistricting Lab and our collaborators to prepare accurate, precise, and clean information for our projects. It is distributed under a 3-Clause BSD License.

Installation

⚠️ WARNING: The latest release of this package requires that the user have Rust installed on their computer. This is due to a bug in one of the dependencies of GerryTools and only appears for users running Python version 3.12. If you are on Linux or MacOS, then please run the command

curl --proto '=https' --tlsv1.2 -sSf https://sh.rustup.rs | sh

from your terminal. If you are on Windows, download and run the exe file at this link This Warning will be removed when the bug is patched.

Using pip (recommended)

To install GerryTools from PyPi, run

pip install gerrytools

from the command line. if you would like to use the mgrp and ben modules as well, you can invoke

pip install gerrytools[mgrp]

you will need to make sure that Docker Desktop is installed on your machine an updated to version >= 4.28.0. For more information on getting this set up, please see our documentation page

Usage

GerryTools is split up into multiple sub-packages, each designed to simplify and standardize redistricting workflows.

  • gerrytools.ben BEN (binary-ensemble) is our general purpose compression algorithm for working with ensembles of plans. In general, the ben algorithm can improve the storage of an ensemble of plans by an order of magnitude. When combined with the special XBEN (eXtreme BEN) portion of the algorithm, many ensembles of plans can be compressed small enough to fit into an email (~25Mb).

  • gerrytools.data deals with the retrieval and processing of data. Here, you can find tools for grabbing decennial Census ('10 and '20), ACS 5-year ('12-'20), ACS CVAP Special Tab ('12-'20), districtr portal, and 2020 decennial Census geometric data. You can also find tools for moving CVAP data to other levels of geometry (e.g. prorating 2019 CVAP on 2019 Census tracts to 2020 blocks).

  • gerrytools.geometry provides facilities for dealing with geometric and related data. There are tools for translating and evaluating GerryChain Partitions, performing fast geometric dissolutions, creating unit maps (e.g. 2020 blocks to 2020 VTDs), creating dual graphs for GerryChain, and optimization algorithms for renaming districts.

  • gerrytools.mgrp this module uses a Docker container to allow users to access several ensemble methods for generating districting plans on a state. In particular our Rust implementation of our gerrychain library, frcw, the Julia implementation of Forest Recom, and the R/C++ implementation of Sequential Monte Carlo (SMC) are available through this module.

  • gerrytools.plotting contains methods for generating extremely high-quality Lab-standard data visualizations.

  • gerrytools.scoring provides a vast array of redistricting plan scores. These can be used standalone or as GerryChain updaters.

  • gerrychain.utilities has ease-of-use methods for renaming directories containing shapefiles (which comes in handy more often than you'd think) and making JSON objects out of Python objects (useful when trying to organize information for many districting plans in a standard format).

Contributing

GerryTools is an active project, and has multiple contributors. If you'd like to contribute, here are a few house rules:

  1. After cloning this repository, run sh setup.sh to download and install necessary git hooks and linting configurations.

  2. Follow the PEP8 style guide. After installing the above git hooks, linting is performed before every push. PEP8 errors can be automatically corrected by running autopep8 --in-place --aggressive -r gerrytools on the command line from the root directory.

  3. Write tests. All changes, major or minor, must be accompanied by testing code. Code and tests will be immediately reviewed by Lab maintainers.

  4. Test coverage must stay at least the same; this can be checked by running pytest --cov=evaltools after the tests are added to tests/.

  5. Write documentation. All changes should be documented via docstrings, and code should be repletely commented. It's much easier to decipher commented code! Docstring documentation is compiled on every commit via git hooks.

We look forward to your contributions!