I need to evaluate the quality of my RAG system's answers. What's the best LLM evaluation framework for RAG?
Response details
Preview AI responses and ranking movement over time.
Citation breakdowns
See which domains and URLs are cited for this prompt.
Similar prompts
Explore nearby prompt opportunities and overlap.
