Claude 3.5 vs. Claude 3.7: Fresh Test

Trickle

Hey everyone! With the release of Claude 3.7 today, a hybrid reasoning model, I decided to test it against Claude 3.5 using @Trickle to generate websites from the same single prompt.

Check out the results:👇

Trickle + Claude 3.5: View Website
Trickle + Claude 3.7: View Website

Findings:🔎

Claude 3.7 creates a more polished, complete website with better details and interactivity.
Clearly, Claude 3.7’s coding capabilities are superior to 3.5, handling more complex structures and finer details.
However, it’s more token-heavy than 3.5.

I’d love to hear if anyone else has tried Claude 3.7 yet or has insights into these trade-offs.

Let’s discuss!🙂

1.4K views

Replies

Best

Cole Stark

Quadratic

We added it to our AI-powered spreadsheet, and the results are insane! See here

Report

6mo ago

Victoria Wu

Trickle

@cole_at_quadratic Coool! It looks good. Have you compared it with Claude 3.5?

Report

6mo ago

Leah Madden - AMA VC/M&A/Finance

@cole_at_quadratic can't wait to check out your launch! I'm dying for the robots to come for financial models!

Report

6mo ago

Kay Kwak

Launching soon!

Claude 3.7 is definitely groundbreaking and impressive, but I believe Claude can go even further and evolve even more! Let's go 🚀🚀

Report

6mo ago

Victoria Wu

Trickle

@kay_arkain That is for sure. From the version number perspective, 3.7 is just a transitional version, and we look forward to stronger versions being released later.

Report

6mo ago

Andres Vlaeminck

Revealio: Discover & Connect

I used Claude 3.5 extensively during the development cycle of my latest app. Recently, I tried Claude 3.7 and noticed something important: Claude 3.5 actually performed better in one key area.

While 3.7 is more advanced in many ways, I've found it to be too docile - it rarely pushes back on questionable requests. Claude 3.5, on the other hand, would often suggest alternative (and frankly better) approaches when my initial idea wasn't optimal.

This constructive resistance from 3.5 led to better outcomes in my development process. I value an AI assistant that acts as a thoughtful collaborator rather than just following instructions without question.

Has anyone else noticed this shift in behavior between versions?

Report

6mo ago

Victoria Wu

Trickle

@andres_vlaeminck It’s interesting to hear that you found the 3.5 version to be more proactive in offering alternative solutions. I haven't tested that, I only tested its coding abilities.

Report

6mo ago

Luke

This update is great! When you say you used them in combination, how exactly did you do that?

Claude generates decent code that represents a Trickle style website, however I'm wondering how you put these two together as Trickle seems to have their own Chat AI Assistant.

Report

6mo ago

Victoria Wu

Trickle

@luke_arwayda Oh, I see! I wasn't clear enough earlier. What I meant is that Trickle integrates both Claude 3.5 and 3.7. You can call either version to build a website and compare the results. The output from 3.7 is generally more polished and refined compared to 3.5.

Report

6mo ago

Yunxi Chang

I find your comparison of Claude 3.5 and 3.7 fascinating. It's clear 3.7 has an edge in website creation quality. I haven't tried it yet but am eager to. Curious about others' experiences too!

Report

6mo ago

Christian Safka

Pinch

From testing 3.7 vs 3.5 in Cursor Agent, I've found it's a lot better at estimating when it has enough context to implement or it needs to go check more files first. Looking forward to trying it some more

Report

6mo ago

Victoria Wu

Trickle

@christian_safka I agree! This significantly enhances its coding capabilities. When building projects, the output code from Claude 3.7 is much larger, with more details and a more complete structure. On the other hand, 3.5 tends to be simpler and more bare-bones.

Report

6mo ago