BixBench: AI Agents Hit 63% Accuracy on Bioinformatics Research Tasks, Tripling Performance in 10 Months
A benchmark of nine LLMs on computational biology tasks shows the best agents reaching 63% accuracy — three times higher than the state of the art just ten months ago.
Subscribe to unlock all stories
Get full access to The Singularity Ledger, archive included.
Cancel anytime. Payments powered by Stripe.