16-Evaluations
Generate synthetic test dataset (with RAGAS)Evaluation using RAGASHF-UploadLangSmith-DatasetLLM-as-JudgeEmbedding-based Evaluator(embedding_distance)LangSmith Custom LLM EvaluationHeuristic EvaluationCompare experiment evaluationsSummary EvaluatorsGroundedness EvaluationPairwise EvaluationLangSmith Repeat EvaluationLangSmith Online EvaluationLangFuse Online Evaluation
Last updated