-
Notifications
You must be signed in to change notification settings - Fork 17
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
reading wikiart captions from jsonl files #28
Comments
import json with open("./StyleGallery.jsonl", 'r') as f: |
Thanks for your reply. This outputs all the tags, so how can I tell if it belongs to the wikiart dataset? Because MultiGen-20M and JourneyDB also have "content_prompt". |
Word "wikiart" should be in data["image_file"]. |
I see what you mean, I tried the following code: with open("./json_files/StyleGallery.jsonl", 'r') as f: with open("./json_files/StyleGallery.jsonl", 'r') as f: |
Hi, i found something wrong in our StyleGallery.jsonl, i will update the correct version soon. |
It might take two hours. |
Hi, i have updated it in here. |
Hello, I want to read the caption of the wikiart section in the latest JSONL file. I am using the following code, but I am unable to read it.
##################################################
import json
import jsonlines
import pprint
with open('./json_files/StyleGallery.jsonl') as file:
for line in jsonlines.Reader(file):
if "img_file" in line :
pprint.pprint(line["img_file"])
The text was updated successfully, but these errors were encountered: