You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
These datasets have the names of most Indian voters( not sure about the language of these datasets as I haven't actually seen them )
Both of them are access restricted, but you folks might get access if you request it.
This data also has PII, even though it is indeed published by the ECI for public consumption. Care needs to be taken in filtering out the address information and the voter ID information.
Alternatively, I have the data for this year as pdfs. From what I have seen it has names of Indian voters in local languages and for some states English and a third language( not sure if Bhashini was used to transliterate this ). But this needs to be OCRed out and the original dataset is about 5 TB.
If you folks think this is a useful dataset, I can provide access.
The text was updated successfully, but these errors were encountered:
Indian Electoral Rolls containing the names of all Indian voters are available at multiple places
https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/MUEGDT
https://zenodo.org/communities/india-religion-politics-raw/records?q=&l=list&p=1&s=10&sort=newest
These datasets have the names of most Indian voters( not sure about the language of these datasets as I haven't actually seen them )
Both of them are access restricted, but you folks might get access if you request it.
This data also has PII, even though it is indeed published by the ECI for public consumption. Care needs to be taken in filtering out the address information and the voter ID information.
Alternatively, I have the data for this year as pdfs. From what I have seen it has names of Indian voters in local languages and for some states English and a third language( not sure if Bhashini was used to transliterate this ). But this needs to be OCRed out and the original dataset is about 5 TB.
If you folks think this is a useful dataset, I can provide access.
The text was updated successfully, but these errors were encountered: