Skip to content

Latest commit

 

History

History
57 lines (38 loc) · 3.24 KB

README.md

File metadata and controls

57 lines (38 loc) · 3.24 KB

DOI

EABlock

EABlock is an engine-agnostic computational block to solve the problem of entity alignment as part of a declarative knowledge graph creation pipeline. EABlock is composed of a set of functions defined using function ontology and can be called from RML mapping rules and an efficient strategy to evaluate them. The functions in EABlock rely on another engine for solving the tasks of named entity recognition (NER) and entity linking (EL) that are required for entity alignment. For instance Falcon2.0 which performs both NER and EL. The interpreter of the EABLock follows eager evaluation strategy which enables the execution of EABLock functions prior to the execution of the RML mapping rules.

pipeline

Publication:

EABlock research paper titled: "EABlock: A Declarative Entity Alignment Block for Knowledge Graph Creation Pipelines" is published at the proceedings of SAC 2022, acm sigapp conference.

How to apply?

  1. Create your config.ini file following the example provided in config.ini
  2. Choose one of the options below to run:

Run with Python3

pip install -r requirements.txt
python3 /PATH_TO_EABlock/Functions_Interpreter/run_translator.py /PATH_TO_YOUR_CONFIG_FILE/YOUR_CONFIG_FILE.ini

Directly use the docker image:

# move to docker-compose directory
cd docker

# run the docker instance
docker-compose up -d

# execution
docker exec -it EABlock python3 /EABlock/run_translator.py /source/config-test.ini

Build the docker image locally:

cd Functions_Interpreter

docker build -t sdmtib/EABlock:1.0 .

Reproducibility:

We have empirically studied the performance of knowledge graph pipelines using EABlock and the result knowledge graphs generated by such pipelines. All the results of the three categories of the experimental studies that are performed in the paper to evaluate the performance of EABlock can be reproduced. The setup codes are available here and all data and intermediate datasets (outcome of codes) are accessible here. The details of the path to the datasets related to each experiment category is also provided in a seperated DATASETS txt files (example ).

Authors