
Athina AI
Monitor LLMs and automatically detect hallucinations in prod





About | Details |
---|---|
Name: | Athina AI |
Submited By: | Johnathan Homenick |
Release Date | 1 year ago |
Website | Visit Website |
Category |
Athina helps developers monitor and evaluate LLMs in production. Get complete visibility into RAG pipeline and 40+ preset eval metrics to detect hallucinations and measure performance
This is an important area of focus for anyone seriously using generative AI. Good luck.
10 months ago
Hi Shiv, this is very interesting. I think there is great value in able to test prompts with different models.
1 year ago
Looks super useful. Congratulations on the progress! And thanks for having a free tier :D
1 year ago
This is excellent. Very helpful use case, highly recommend. Good job Shiv.
1 year ago
Detecting Digital hallucinations, who would have thought we would need to deal with this?
1 year ago
It's inspiring to see how your team identified a common challenge in taking LLMs into production and developed a solution to address it. What specific benefits do you anticipate it bringing to developers and organizations?
1 year ago
Shiv & Himanshu, Huge congrats on rolling out Athina AI! It’s evident the amount of dedication and insight that went into addressing the complexities of deploying LLMs. The detailed monitoring and robust collection of evaluation metrics Athina offers seem like it could revolutionize the way developers approach model deployment. How it sheds light on model performance and identifies hallucinations - feels impressive. By the way, what’s the story behind the name 'Athina'? Excited to watch Athina AI evolve and make a significant impact on developers’ efficiency and workflow optimization! Best of luck to you and your entire team!
1 year ago
Hey Shiv, A few questions: - can I moderate hallucinations in real time and stop my llm from sending response to user for example? Or is it just alerting? - is it possible to evaluate rag when the vector store changes ? Do you have side by side retrieval evaluation? - if my app has multiple chained llm calls, can I evaluate the entire flow ? Thx
1 year ago
Congrats on the launch Shiv and the team. Athina will drive a tangible impact. The product seems to tackle a concrete problem in an intuitive, user-friendly way. Inspiring work.
1 year ago