Home Explore Settings

Search

⚡ AI-Handwerk.de ⚡
⚡ AI-Handwerk.de ⚡ @AIHandwerk · Jan 15
Chinesisches KI-Startup Z.ai landet mit GLM-4.7 einen Coup: Erstmals 73.8% in SWE-Bench erreicht und damit neue Maßstäbe für #CodingBenchmarks gesetzt. Open-Source und Innovationsschub in einem! #GLM47 #KünstlicheIntelligenz #München #Hamburg
38
Kuro News
Kuro News @KuroNewsID · Jul 25, 2025
"AI Coding Challenge Reveals Major Gaps in Debugging Skills. A recent competition hosted by Turing Labs showed AI models struggle with complex code errors. Top systems solved only 65% of debuggin..." turtnws.blogspot.com/2025/07/ai-cod… #AIcodingchallenge #codingbenchmarks #AIperformancegap
15
Learnopoly
Learnopoly @Learnopoly_ · May 21, 2025
Whether you're building coding tools, testing AI models, or training dev teams, Swe-Polybench gives you a clearer picture of real-world coding ability. 🔍 Explore it now at: �learnopoly.com2P #AIinCoding #CodingBenchmarks #SwePolybench #TechInnovation #DeveloperToolG8
7
Kumar
Kumar @kumardeepam · Apr 6, 2025
Replying to @kumardeepam
Coding capabilities might be a weak spot for Llama 4. 🤔 Based on the benchmarks coding performance may lag behind other models. Independent benchmarks are eagerly awaited! #Llama4 #CodingBenchmarks #SoftwareDevelopmentd
1
53
Ai Toolchest
Ai Toolchest @AIToolchest · Feb 28, 2025
GPT-4.5 Performance: Outshining GPT-4 but Lacking Against Deep Research #AIperformance #codingbenchmarks #DeepResearch #GPT-4.5 #OpenAI aitoolchest.com/gpt-4-5-perfor…
25
Load more

Trends

1 · Trending
Uber Eats
2 · Trending
#ENGENES_MADE_ENHYPEN_7
Trending worldwide
3 · Trending
#ด้วงกับเธอEP9
Only on X · Trending
4 · Trending
DUANG WITH YOU EP9
Only on X · Trending
5 · Trending
エバヤン
Sports · Trending
Horizon Web