- Installation
- Prepare Data for Filecoin Deal
- Make Filecoin Deal
- Enrich STAC server with CIDs
- Contributing
- FAQ
- License
- SP - Storage Provider
- CID - Content Identifier
- CAR - Content-Addressable aRchive
- Piece - The main unit of negotiation for data that users store on the Filecoin network.
- Sector - A sector is a fixed-size (32GiB) chunk of data that is stored on the Filecoin network. A sector is made up of one or more pieces. A sector is the smallest unit of storage that a storage provider can commit to storing on the Filecoin network.
- Deal - A deal is a commitment by a storage provider to store a piece of data for a client. A deal is made up of a piece and a sector. A deal is the smallest unit of storage that a client can commit to storing on the Filecoin network.
- STAC - SpatioTemporal Asset Catalog
- S3 - Amazon Simple Storage Service
- Singularity - A tool for packing data into CAR files and creating Filecoin deals.
This project uses Poetry for dependency management and is compatible with python versions >3.9 and <3.12. To install all necessary dependencies, follow these steps:
Note: Mac OS with M1 chip may encounter problems A general solution is to use
brew install
for the missing executable. See description and solution at pyproj4/pyproj#1027
-
Install Poetry if you haven't already:
curl -sSL https://install.python-poetry.org | python3 -
-
Clone the repository:
git clone https://github.com/easierdata/easier.git cd easier
-
Install the dependencies:
poetry install
Included in the repo root is a .vscode
folder that contains VSCode files with predefined settings. Recommended VSCode extensions are also included that can be installed for a more tailored development experience for this project. When this workspace is opened for the first time, VSCode will prompt to install the recommended extensions. You can also review this list through Extensions -> Show Recommended Extensions command.
Note: Workspace settings override user settings.
The linters Black, Mypy and Ruff have been integrated into a pre-commit configuration file (file named .pre-commit-config.yaml
) as to ensure that no bugs or issues appear in your code before pushing changes to your repo. To install the pre-commit git hooks:
-
Run the following command:
pre-commit install
now
pre-commit
will run automatically ongit commit
. -
(Optional) Run against all files in your repo with the following command:
pre-commit run --all-files
Note: mypy is disabled by default. To enable, uncomment the
mypy
section in the.pre-commit-config.yaml
file and re-run thepre-commit install
command. Additionally, set themypy
properties found in .vscode/settings.json
totrue
to enable the mypy extension in VSCode.
Adding pre-commit plugins to your project is done with the .pre-commit-config.yaml
configuration file. To view a list of all supported plugins, visit this link.
We use the Fast API STAC Server to serve STAC metadata and respond to STAC API requests. The server is configured to use a PostgreSQL database to store STAC metadata. The server and database are run in Docker containers and spun up with docker compose.
Once installed, you can run the server and database with the following command:
make docker-run-pgstac
See the Singularity documentation for installation steps.
TODO
See Building and Installing Boost for installation steps.
Currently, there is a performance hit when multiple sectors are accessed when retrieving files on Filcoin. For geospatial data, there is an opportunity for optimization by packing data that is 'nearer' to each other in each sector. This can reduce the number of sectors accessed when retrieving data. See [this blog post] for more info.
by the Storage Provider. If you are working with a SP, ask them if they can perform this step for you.
Before making a Filecoin deal, the data will need to be serialized into a “Content-Addressable aRchive” (.car) file, which is in raw bytes format. This .car file represents a Filecoin Piece which is the main unit of negotiation for data that users store on the Filecoin network.
We will use Singularity to pack data into .CAR files. See the Singularity Docs for more info. Example Python Notebook to pack CAR files
*In order to finalize a Filecoin deal, the storage provider will need access to the data (raw or placed into .CARs). There are a few ways to do this. If you are not working with a SP, you can skip this step.
A - Transfer the data manually by flying/mailing physical hard drives to the SP.
B - Transfer Data to S3 Bucket holding area
When we worked with an SP, we used Rsync to transfer the data to an S3 bucket. The SP then pulled the data from the S3 bucket and placed it into Filecoin deals.
Script we used to transfer the data to S3
C - Set up an HTTP server on your own infrastructure so the data can be pushed/pulled to the SP.
sudo apt install nginx
Edit /etc/nginx/sites-available/default
and add below lines
server {
...
location / {
root /home/user/outDir;
}
...
}
where outDir
is the directory where the CAR files are located.
If you are working with a SP, you can stop here. If you are not working with a SP, you will need to continue with the following steps.
We'll point you to the singularity docs for this step. See this section of the docs for more info on how to create a deal.
If you are working with an SP, you will need to get the CIDs from them. Here is a script you can follow to ask the SP for the CIDs. Once you have the CIDs, you can run the following command to enrich the STAC server with the CIDs.
$ python3 code/pyScripts/validate_stac.py --stac_file testdata/<filename>.json
$ make run-landsat-pgstac
$ curl http://<domain>/stac/collections
$ curl http://<domain>/stac/search?bbox=-122.5,37.5,-122.3,37.7&datetime=2019-01-01/2019-12-31
- Fetch Landsat Data via API
- Pack car files with Singularity
- Web3-enriched STAC-Server
-- TODO
-- TODO
-
Option 1
- 🍴 Fork this repo!
-
Option 2
- 👯 Clone to your local machine using:
$ git clone [email protected]:easierdata/easier.git
- 👯 Clone to your local machine using:
- HACK AWAY! 🔨🔨🔨
- 🔃 Create a new pull request using:
<a href="https://github.com/easierdata/easier/compare" rel="noopener noreferrer" target="_blank">
https://github.com/easierdata/easier/compare</a>
.
-- TODO