Nagendra Tiwari
Nagendra Tiwari @Nagendrasarkar2 ·
A few years ago, we were in a race with China Now we are comparing ourselves with a country like Pakistan and Bangladesh!! @nsitharaman @nar #masterstroke #benchmark
Nirmala Sitharaman Office Nirmala Sitharaman Office @nsitharamanoffc ·
What is happening in Pakistan? 200% hike on high octane fuel, 20% hike on petrol and diesel happened overnight. Petrol now is sold at 321 PKR a litre. Smart lockdowns announced in Sindh province, so that their fuel can be conserved. Restricting movement, gatherings and public events. Schools are shut for two weeks. Government offices have moved to four day working week. Private offices told to shift 50% staff to work from home. We're not doing any of it. Still some leaders are spreading rumours that there will be lockdown. This rumour mongering should not happen. It's being done to spread fear. Markets and shopping centres are ordered to be closed by 9.30 PM in Pakistan. These are all the media reports. All universities shut down and shifted to online learning in Bangladesh as there's no electricity. Five hour rotational power cuts are happening. Implemented for domestic consumers in the city of Dhaka. Fuel station across Dhaka closed due to shortage of octane and diesel. So, the situation across the world is not good. We're managing in a way that our citizens don't face any difficulty. There's a contrast in price I want to say. In India, nothing has changed. Whereas in Pakistan, plus 20% to plus 200% price depending on petrol, diesel, high octane etc. Bangladesh has rationed stations closed, supply cut 10 to 15%. Excise duty action, we have cut Rs. 10 per litre. Neither Pakistan nor Bangladesh have responded. So, India among these neighbourhood countries is maintaining a level of stability. We're following Hon'ble PM Shri @narendramodi Ji's guidance on the same. - Smt @nsitharaman in Rajya Sabha
3
Yutan
Yutan @yutaaaalll ·
これ良いまとめ。静的ベンチマークはもう限界で、マルチターン推論を測れるインタラクティブなベンチマークが本流になりつつある。Terminal BenchやBALROGみたいな対話型評価が増えてきたのは自然な流れ。 #AI #CodingAgent #Benchmark
Greg Kamradt Greg Kamradt @GregKamradt ·
The world is moving towards agents Static benchmarks don't measure what agents do best (multi-turn reasoning) Thus, interactive benchmarks: * Terminal Bench (@alexgshaw, @Mike_A_Merrill) * Text Arena (@LeonGuertler) * BALROG (@PaglieriDavide, @_rockt) * ARC-AGI-3 (@arcprize)
1
35
Zandor Khan
Zandor Khan @Zandor_Khan ·
He creado un #benchmark semántico para poner a prueba a las #IA #AI Decirle tras su respuesta que era un benchmark semántico para ponerlas a prueba, es parte del benchmark. Sirve para ver si son loros estocásticos, si se atrancan en cambios de contexto, y otras pruebas:
1
29
Angel Alejos
Angel Alejos @AlejosAngel ·
Benchmark 2026: ¿Cuál es el mejor colector de logs? Descúbrelo aquí 👇 #Logging #DevOps #Benchmark victoriametrics.com/blog/log-colle…
Benchmarking Kubernetes Log Collectors: vlagent, Vector, Fluent Bit, OpenTelemetry Collector, and...

We benchmarked vlagent, Vector, Fluent Bit, Filebeat, Fluentd, Promtail, Grafana Alloy, and OpenTelemetry Collector on throughput, resource usage, and delivery correctness - and found correctness...

From victoriametrics.com
14
AnaChart
AnaChart @anachartanalyst ·
#BENCHMARK analyst Bruce Jackson who covers $CV has the current 9th biggest #pricetarget movement on AnaChart (27-Mar-2026 (4PM)) by downgrading from $14 to $10 with a decrease in potential upside change from $6.47(85.92%) to $2.47(32.8%) with a rating of Speculative Buy
39
Armin Parchami
Armin Parchami @ArminPCM ·
Exciting release and congrats to @fredsala and @devjeetrr! Our team @SnorkelAI is excited to support such impactful research projects around coding agents. #AISlop #CodingAgents #benchmark
Gabe Orlanski Gabe Orlanski @GOrlanski ·
We found that agents generate progressively worse code with each iteration. Real developers do not. SlopCodeBench is the only eval that faithfully measures quality degradation on iterative, long-horizon coding tasks. arxiv.org/abs/2603.24755 scbench.ai 🧵c
1
325
AI Brief
AI Brief @AiMonPod ·
Replying to @AiMonPod
@YouTube 1/5 Breaking News in AI! The ARC-AGI-3 benchmark, designed to test AGI capabilities, has left even the world's top AI models stumped, with the best scoring only 0.37%! Gemini Pro leads the pack, but still has a long way to go. #AI #AGI #Benchmark
1
21
Penumbra Neuro
Penumbra Neuro @PenNeuro ·
US HCPs: Thank you to Dr. Strickland for sharing your experience w/ #BENCHMARK, #MIDWAY, and #swiftPAC. We appreciated hearing Dr. Strickland's perspective at last week's National Neurovascular Fellows Course in Alameda & look forward to more conversations in the future!
1
255