Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

error parse quad error #2

Open
zengxianhua opened this issue Dec 14, 2019 · 7 comments
Open

error parse quad error #2

zengxianhua opened this issue Dec 14, 2019 · 7 comments

Comments

@zengxianhua
Copy link

when dealing with HumanDiseaseOntology rdf representation file(url:https://github.com/DiseaseOntology/HumanDiseaseOntology/blob/master/src/ontology/doid.owl), occurs parsing errors such as
InstanceAggregator:194 - Error parsing quad [urn:x-arq:DefaultGraphNode -3be79fdb:16f03ccdbaf:-5dd9 http://www.w3.org/1999/02/22-rdf-syntax-ns#first http://purl.obolibrary.org/obo/DOID_0111104]: Subject has to be a URI, encountered: -3be79fdb:16f03ccdbaf:-5dd9

can you give any adivice?

@prihoda
Copy link
Collaborator

prihoda commented Dec 16, 2019

You will have to convert the owl file to a n-triples file, e.g. using http://www.l3s.de/~minack/rdf2rdf/

@zengxianhua
Copy link
Author

zengxianhua commented Dec 17, 2019

thanks for your answer, i had tried it ,first i got n-triples file file ,but it appeared same errors, any advices?

@prihoda
Copy link
Collaborator

prihoda commented Dec 18, 2019

Can you attach the n-triples file here?

@zengxianhua
Copy link
Author

doid.nt.zip
thanks for your responding.here is the doid.nt zip file

@prihoda
Copy link
Collaborator

prihoda commented Jan 4, 2020

What command are you running? Parsing and showing some basic stats works for me with your doid.nt file:

mvn exec:java -Dexec.args="stats \
--input.file /path/to/doid.nt \
--input.batchSize 1000000 \
--stat OBJECT_URI_COUNT --stat SUBJECT_URI_COUNT"

@zengxianhua
Copy link
Author

Very appreciate for your responding.Here is my script which results in failure.Any suggestions?
mvn exec:java -Dexec.args="convert \ --input.file doid.nt \ --output.target CSV \ --output.folder ~/Desktop/0"

@prihoda
Copy link
Collaborator

prihoda commented Jan 13, 2020

One thing that I had to fix in your command was to use absolute paths to the input and output files, maybe that is your error. Otherwise only warnings were shown, the pipeline finished successfully. The output is stored in the output folder and in subfolders based on the table name, e.g. ~/Desktop/0/annotationproperty/part-00000.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants