Research: Investigate Feasibility of Using LangSmith to Automate Test & Evaluation of RAG #257

davidgxue · 2024-01-11T05:00:06Z

Research Issue: Investigate Feasibility of Using LangSmith for Test & Evaluation of RAG

Context

This is an action item as a result of this research issue: #195.

We are exploring the possibility of automating the rest & evaluation process for the RAG application using LangSmith. The objective is to compare pre and post-changes responses generated by RAG, assess the alignment with a reference answer, and determine the overall improvement in outcomes.

Proposed Approach

Utilize LangSmith's Dataset and Test feature (LangSmith Documentation) to conduct head-to-head comparisons of responses. This involves evaluating whether the RAG application's outputs align with a predefined reference answer and identifying any improvements.

Implementation Details

No immediate code changes are required. The investigation will likely involve running local scripts to assess the feasibility of integrating LangSmith into the Test & Evaluation workflow. At the end, the script may be uploaded to the repository depending on whether it is secure to be published to the public.

Action Items

Investigate the feasibility of LangSmith with the current test & evaluation process for RAG.
Assess the effectiveness of LangSmith in comparing responses and identifying improvements.
Identify costs associated with using LangSmith's test and evaluation features.
Document findings and considerations regarding the integration of LangSmith.

This research initiative is not expected to result in immediate code changes and aims to explore the potential benefits of leveraging LangSmith for enhanced Test & Evaluation of RAG.

davidgxue self-assigned this Jan 11, 2024

davidgxue modified the milestone: 0.3.0 Jan 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Research: Investigate Feasibility of Using LangSmith to Automate Test & Evaluation of RAG #257

Research: Investigate Feasibility of Using LangSmith to Automate Test & Evaluation of RAG #257

davidgxue commented Jan 11, 2024

Research: Investigate Feasibility of Using LangSmith to Automate Test & Evaluation of RAG #257

Research: Investigate Feasibility of Using LangSmith to Automate Test & Evaluation of RAG #257

Comments

davidgxue commented Jan 11, 2024

Research Issue: Investigate Feasibility of Using LangSmith for Test & Evaluation of RAG

Context

Proposed Approach

Implementation Details

Action Items