Skip to content
View DesmondChoy's full-sized avatar

Block or report DesmondChoy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
DesmondChoy/README.md

Hello, my name is Desmond

I am a Data Science Manager at Pacific Life Re, where I primarily operate as an individual contributor (Shiny apps linked to actuarial assumptions, 100% local LLM RAG chatbot, ML predictive models, etc) but occasionally help to lead large team projects as scrum master.

I relish the journey of learning, growing and applying my knowledge in data science, machine learning, and AI to build new tools and explore interesting datasets. Perhaps that's why I'm drawn to both my current field and past (investment research & portfolio management) as they are multidisciplinary in nature.

Off work, I enjoy tinkering with open source tools to build and experiment, as well as exploring unusual datasets. Please refer to Project Highlights below for more details.

🔗 Connect with me

LinkedIn Twitter

🏗️ Projects Highlights

Building GenAI apps to solve real world problems:

  • Multi-Agent Recommender System (GitHub link)
    You want certain coverage but don't wish to spend time crawling through all the different policies out there?
    Let a system of agents help you - they work together to provide personalized recommendations for travel insurance.
    Customizable and scalable.
  • AI Voice Assistant (GitHub link)
    Have a fluid life-like conversation when buying insurance.
    Customers’ transcript (unstructured) is converted to structured output which can be used for downstream LLM/ML workflows.
  • Podcast Summarizer (GitHub link)
    Too busy to listen to all the interesting podcasts out there?
    Try condensing them into nuggets of wisdom. Utilizes text splitting and multi-step prompting.
  • Retrieval Augmented Generation (GitHub link)
    RAG over financial documents. Retrieval methods are evaluated.

ML end-to-end pipeline: What Makes A Cup Of Coffee Memorable?

  • Showcased detailed feature engineering pipeline (thresholding, knn imputing, normalizing, etc)
  • Trained, resampled, and tuned hyperparameters for LASSO, Random Forest, and XGBoost models, then evaluating models on test data
  • Utilized variable importance to understand top predictors across models (one of my created feature ranked highest)

Full list of projects: Python | R

⚡ Technologies

Below is a list of open source technologies (frameworks, libraries, and languages) I regularly use and am familiar with.

🗣️ Programming Languages
Python, R, SQL

🧹 Data Tidying, Visualization, EDA
Python: Polars, Pandas, Plotnine
R: Tidyverse, ggplot2, Shiny

🤖 Machine Learning
Python: Scikit-Learn, XGBoost
R: Tidymodels

🎲 Deep Learning / LLMs
Transformers, PyTorch, Langchain, CrewAI

☁️ Cloud
AWS, Github, Bitbucket

Pinned Loading

  1. char-kuay-teow-or-kl-hokkien-mee char-kuay-teow-or-kl-hokkien-mee Public

    Using Computer Vision models to discern different noodle dishes

    Jupyter Notebook

  2. podcast_transcript podcast_transcript Public

    transcribing podcast audio, then summarizing it into meaningful nuggets of wisdom

    Jupyter Notebook

  3. ai_underwriter ai_underwriter Public

    Take a risk assessment given by an AI voice assistant underwriter

    Jupyter Notebook 5

  4. agents_recsys agents_recsys Public

    Multi-agent workflow to facilitate insurance recommendations

    Jupyter Notebook