Social Pulse
i think some are still skeptical that ai can be consistently good at complex tasks...
@Andrey__HQ @mark_k i think some are still skeptical that ai can be consistently good at complex tasks though and for swe benchmarks most are still not greater than 60% by default
ResearchSocial Threadx.com