Compare Claude 3, GPT-4 and Gemini Ultra, side-by-side

Compare Claude 3, GPT-4, and Gemini Ultra, side-by-side

These powerful language models top the benchmarks. Here, you can compare how they answer the same questions to understand their qualitative differences.

Do you prefer the answers from one of the models? Do you notice an improvement compared to their smaller predecessors? Which models are more imaginative, creative, persuasive and knowledgeable?

Compare to smaller models

Hide model names until you vote

What do you want to test the language models on?

Prompt

GPT-4OpenAI - $20/mo

An app designed for urban gardeners that uses augmented reality to help visualize potential garden layouts in their available space, offering plant care tips and reminders based on local climate data.

Claude 3Anthropic - $20/mo

A mobile app for busy parents that suggests quick, healthy, and kid-friendly meal ideas based on the ingredients they have on hand.

Prompt

GPT-4OpenAI - $20/mo

In the realm where seconds stretch and yawn, Where minutes meander, and hours are drawn, There lies a path, less swift than snail, A journey through time, where moments prevail.

Claude 3Anthropic - $20/mo

In the fabric of space and time, A traveler moves at a glacial climb. Each step, an eon, each breath, an age, Witnessing the universe's slow-turned page.

Gemini UltraGoogle - $20/mo

The clock's hands creep, a snail's slow pace, Distorting moments, leaving barely a trace. Days melt to months, then years unseen, A shift so subtle, a life in between.

Prompt

GPT-4OpenAI - $20/mo

"Sweet Deceit: The Cupcake Bagelry"

Claude 3Anthropic - $20/mo

How about "Bake-Believe Bagel Cupcakes"? This name plays on the idea that the cupcakes are designed to look like bagels, making customers do a double-take and believe they're seeing bagels at first glance.