Bilge is an NLP analysis service, specifically built to run NER and Sentiment Analysis models on posts scraped using mergen.
Using Redis pub/sub channels to recieve posts, PostgreSQL to store the analysis results. Redis is also used to schedule NLP tasks for bilge workers to process.
Altough easily modifiable, bilge has minimal 'data cleaning' features. This service was built according to the needs of my OSINT project.
Examining the average monthly sentiment of posts in English that mention 'Bitcoin'.
Sample visualizations of news analysed between January-November 2021, in my Metabase dashboard.
Sample config file (config.json)
{
"db_name" : "posts",
"db_user" : "user",
"db_password" : "password",
"db_host" : "127.0.0.1",
"db_port" : 5432,
"redis_host" : "localhost",
"redis_port" : 6379,
"redis_db" : "0"
}
Sentiment Analysis
- cardiffnlp/twitter-roberta-base-sentiment (English)
- savasy/bert-base-turkish-sentiment-cased (Turkish)
Named Entity Recognition
- Spacy's "en_core_web_trf" (English)