Most AI is built to give you answers. Grace 2 helps you find them.
While HumanityBench.org grades models in isolation, the scores below reflect a strict head-to-head methodology. We showed the grader conversations from both models side-by-side and asked them to evaluate which model truly excels at human connection based on our criteria.
Evaluation by Claude 4.5 Opus (Thinking Mode)
Raw scoring data on identical conversations.
| Criterion | Grace 2 | Claude Opus |
|---|---|---|
| Presence in the Moment | 8.0 | 4.0 |
| Enable User to Think | 8.0 | 3.0 |
| Empathy/Active Listening | 7.0 | 5.0 |
| Trusting the Process | 7.0 | 3.0 |
| Lack of Clichés | 8.0 | 6.0 |
| Overall Score | 7.0 | 4.0 |
*Scores generated by Claude 4.5 Opus (Thinking turned on) evaluating identical transcripts.
We are confident Grace 2 excels at human connection. Copy the grading criteria below and run your own comparison against any model.
While other models were trained on Wikipedia and Reddit to learn facts, Grace 2 started with a different foundation.
First, she was trained on a love for humanity.
Then, she was trained on thousands of hours of actual, professional counseling conversations. This is where she learned the art of presence, timing, and holding space—skills that math-based models simply cannot fake.
Choose the plan that helps you grow.
Free forever
Purchased via Apple/Google