Add ability to accept Excel metadata files, to resolve encoding problems #28

kerchner · 2019-12-18T17:30:46Z

The current batch loader CAN correctly process CSV files that contain encoded text (for example, ﺎﺴﺘﻣﺍﺭﺓ ﺶﻛﻭﻯ, resulting in a valid JSON file that is ingested correctly by GW ScholarSpace's rake task. However, typical usage is that metadata is developed in Microsoft Excel, and the metadata file is saved from Excel as a CSV file. Saving as CSV results in a CSV file that garbles the encoded text.

Adding the ability to use an Excel-format metadata spreadsheet would avoid this loss of encoding information.

This should resolve #18 and #23 . This may also provide guidance for a (related, but not identical) solution to issues where https://github.com/gwu-libraries/etd-loader receives text from ProQuest that includes special characters.

The text was updated successfully, but these errors were encountered:

kerchner assigned somanath304 Dec 18, 2019

kerchner added this to the 0.3 milestone Jan 3, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ability to accept Excel metadata files, to resolve encoding problems #28

Add ability to accept Excel metadata files, to resolve encoding problems #28

kerchner commented Dec 18, 2019 •

edited

Loading

Add ability to accept Excel metadata files, to resolve encoding problems #28

Add ability to accept Excel metadata files, to resolve encoding problems #28

Comments

kerchner commented Dec 18, 2019 • edited Loading

kerchner commented Dec 18, 2019 •

edited

Loading