Spaces:

mwalker22
/

TMD-SDG-via-LangGraph

Sleeping

TMD-SDG-via-LangGraph / experiments /README.md

Implemented processing of a dataset through the LangGraph, along with evaluation rules. This will allow comparing the processing of this LangGraph against RAGAS runs on the same dataset.

5b1bd96 8 months ago

preview code

raw

history blame contribute delete

2.07 kB

Experiments: Synthetic Data Generation & Evaluation

This folder contains scripts for running batch experiments and evaluations on your RAG pipeline using LangSmith.

evaluate_on_dataset.py: Runs your RAG pipeline on all questions in the LangSmith dataset and logs predictions.
evaluate_predictions.py: Runs automated evaluation (Correctness, Helpfulness, Dopeness) on predictions using LangSmith evaluators.

Prerequisites

Python 3.10+
All project dependencies installed (see project root requirements)
API keys set as environment variables:
- OPENAI_API_KEY
- LANGCHAIN_API_KEY
(Optional) Vectorstore location:
- VECTORSTORE_PATH (default: /tmp/vectorstore)
LangSmith Tracing:
- LANGCHAIN_TRACING_V2 (must be set to true to enable tracing in LangSmith)

Usage

Run the RAG pipeline and log predictions:

export OPENAI_API_KEY=sk-...
export LANGCHAIN_API_KEY=ls-...
export LANGCHAIN_TRACING_V2=true
export VECTORSTORE_PATH=/tmp/vectorstore  # or your preferred path
python evaluate_on_dataset.py

This will process all questions in the LangSmith dataset and log your app's predictions.

Run evaluation on predictions:
```
python evaluate_predictions.py
```
This will score your predictions for correctness, helpfulness, and dopeness, and log results to LangSmith.
View Results:
- Go to your LangSmith dashboard and open the relevant project/dataset to see experiment results and metrics.

Notes

Make sure your dataset name matches between scripts and LangSmith.
You can rerun these scripts as you update your pipeline or data.
The vectorstore will be stored in /tmp/vectorstore by default, which is suitable for cloud environments like Hugging Face Spaces. Set VECTORSTORE_PATH if you want to use a different location.
Tracing: Setting LANGCHAIN_TRACING_V2=true is required for detailed trace logging in LangSmith. Without this, traces will not appear in your LangSmith dashboard.

Experiments: Synthetic Data Generation & Evaluation

Contents

Prerequisites

Usage

Notes