Rendered at 23:03:28 GMT+0000 (Coordinated Universal Time) with Cloudflare Workers.
osti 51 minutes ago [-]
Given that DeepSwe is one of the very few coding benchmarks worth taking a look at, this achieves rather excellent result at it (not far from opus 4.8).
From looking at the results and my own impression of 5.1 and other models, I think this is the best Chinese coding model by some non-insignificant margin.
LaurensBER 44 minutes ago [-]
I've been very pleased with it's performance over the last few days.
It's definitely not near Opus 4.8 level but it's very impressive nonetheless and it does do design extremely well.
From looking at the results and my own impression of 5.1 and other models, I think this is the best Chinese coding model by some non-insignificant margin.
It's definitely not near Opus 4.8 level but it's very impressive nonetheless and it does do design extremely well.