Natural Language Toolkit for Indic Languages aims to provide out of the box support for various NLP tasks that an application developer might need
-
Updated
Jan 20, 2024 - Python
Natural Language Toolkit for Indic Languages aims to provide out of the box support for various NLP tasks that an application developer might need
Open source speech to text models for Indic Languages
A privacy aware versatile keyboard for Android, supporting 23 languages and 60 layouts. Mirror of original GitLab repository - https://gitlab.com/indicproject/indic-keyboard
Repository containing experimentation platform on how to train, infer on wav2vec2 models.
Anek is a variable type-family which supports nine Indian scripts plus Latin in two (weight & width) axes.
Anuvaad - Open Sourced Document Translation Platform for Indic Languages
A directory of Indic (Indian) language computing resources.
Code repository for "Introducing Airavata: Hindi Instruction-tuned LLM"
OCR Tamil is a powerful tool that can detect and recognize text in Tamil images with high accuracy on Natural Scenes
State-Of-The-Art & ready to use mini NLP models for Indian Languages
Software and Resources for Mitigating Online Gender Based Violence in India
Finite-state script normalization and processing utilities
A pipeline for transliteration, spell correction, POS tagging and word sense disambiguation of Hinglish code mixed data to Hindi Devanagari script.
Web Interface for Transliteration for Indic languages.
इंग्रजी ते मराठीचा कोश. English to Marathi thesaurus.
Code for the ACL 2020 Paper on Schwa Deletion in Hindi and Punjabi
ASCII <-> Unicode conversion library
Lot Of Indic Tweets
Machine Translation from English to Odia language.
Generate large textual corpora for almost any language by crawling the web
Add a description, image, and links to the indic-languages topic page so that developers can more easily learn about it.
To associate your repository with the indic-languages topic, visit your repo's landing page and select "manage topics."