Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Schema.rdfs.org files maintenance #61

Open
mmr99869 opened this issue Nov 17, 2014 · 3 comments
Open

Schema.rdfs.org files maintenance #61

mmr99869 opened this issue Nov 17, 2014 · 3 comments

Comments

@mmr99869
Copy link

At http://schema.rdfs.org/ website I could find different format files in which schema.org terms are represented. However, those files do not have the latest version of schema.org available.

For instance, the property "hasPart" of http://schema.org/CreativeWork type does not appear on the JSON file (neither on the other schema.rdfs.org file formats such as RDF/XML, CSV, and so on).

I would like to know where I could find the latest version of schema.org in JSON file format.

In the scrapers sub-project website (https://github.com/mhausenblas/schema-org-rdf/tree/master/scrapers) I could find a process based on a Python script to generate a JSON file with schema.org terms. However, I would like to have a website link (such as the link for the all.json file available at schema.rdfs.org: http://schema.rdfs.org/all.json).

Is that possible?

Thank you,

Martín Menes Rouco

@Jonadabe
Copy link

Hi there,

In the meantime you can use http://www.easyrdf.org/converter with this link http://schema.org/docs/schema_org_rdfa.html and convert it to any format as you need!

Just a thought.

@mmr99869
Copy link
Author

Hi Jonadabe,

After analyzing your website link (http://schema.org/docs/schema_org_rdfa.html) and a month of work (this trouble with schema.org was just a part of a bigger project I made for the Universitat Politècnica de Catalunya - BarcelonaTech), I would like to share with you (and the other software developers here) an application I made to automatically obtain the latest schema.org version available as a JSON file.

First, I thought to do what you explained about the website link and the Easy RDF Converter. However, the converter did not make a good JSON conversion and the website link is not up-to-date to schema.org v1.91. Also, I had troubles with the link because the internal HTML style is not respected and there are some out-dated schema.org components.

Taking into account that I needed a JSON file with the latest schema.org version, I decided to develop an application to do so.

Website Schema.org Designer JSON Parser is an application that connects to http://www.schema.org/ website and analyzes its HTML data in order to make a JSON file representing the latest schema.org.

That application is part of the "Website Schema.org Designer" project and you can view it and download it from here:

http://mpi.upc.edu/gmc-en/tools/schemaorg/

I hope this helps to other software developers with the same problem ;)

Thank you,

Martín Menes Rouco

@janober
Copy link

janober commented Nov 13, 2015

Fixed the script and created a pull request #66 . So if anybody wants to scrape by themself again they can. For all other ones did I create the following website where you can simply download the up to date versions (JSON, CSV & Turtle): http://schema.link.fish/

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants