RAG Evaluation Suite

RAG Chatbot + GPT-4o Judge
RAG Chatbot

⚖️ LLM Judge

After receiving a response, run the GPT-4o judge to score faithfulness and relevancy.


Faithfulness

Answer Relevancy

QA Regression Suite

Compare live Pinecone answers against your ground-truth golden dataset.