OpenAI's EVMbench evaluates AI agents’ ability to identify, patch, or exploit smart contract vulnerabilities. Illustration: Gwen P; Source: ShutterstockOpenAI's EVMbench evaluates AI agents’ ability to identify, patch, or exploit smart contract vulnerabilities. Illustration: Gwen P; Source: Shutterstock

OpenAI releases crypto security tool as Claude blamed for $2.7m Moonwell bug

2026/02/19 08:34
2 min read

OpenAI and crypto venture capital firm Paradigm on Wednesday released a tool that evaluates AI agents’ ability to identify, patch, or exploit smart contract vulnerabilities.

The tool, EVMbench, draws from 120 vulnerabilities identified over 40 prior smart contract audits, as well as “vulnerability scenarios” drawn from audits of Paradigm’s forthcoming Tempo blockchain.

The release comes days after a bug in AI-generated code cost users of crypto protocol Moonwell nearly $2.7 million in crypto.

One Moonwell software engineer said the code in question had passed an audit from crypto security firm Halborn.

So-called agents are instances of artificial intelligence that can complete complex tasks in the digital world. They can write software, purchase theatre tickets, and conduct research on behalf of their users.

EVMbench data shows that OpenAI’s latest agentic coding model, GPT-5.3-Codex, more than doubled the effectiveness of an earlier model, GPT-5, in exploiting vulnerabilities in smart contract code. But its success in finding and fixing vulnerabilities “remain below full coverage,” OpenAI said in a news release.

“Agents perform best in the exploit setting, where the objective is explicit: continue iterating until funds are drained,” the company said.

“In contrast, performance is weaker on detect and patch tasks. In ‘detect’, agents sometimes stop after identifying a single issue rather than exhaustively auditing the codebase. In ‘patch’, maintaining full functionality while removing subtle vulnerabilities remains challenging.”

A model from Anthropic, Claude Opus 4.6, scored the highest mean result in detecting software vulnerabilities. GPT-5.3-Codex achieved the highest results in patching and exploiting smart contracts.

OpenAI cautioned that EVMbench doesn’t capture the true challenge of securing smart contracts, given the limited sample of vulnerabilities used to build the tool. And it can’t reliably determine whether agent-found vulnerabilities are, in fact, false positives.

Hacks have long bedevilled the crypto industry. Non-reversible transactions make crypto protocols’ smart contracts an attractive target for cybercriminals.

As of Wednesday evening, protocols suffered more than $108 million in hacks and exploits in 2026, according to DefiLlama data.

Aleks Gilbert is DL News’ New York-based DeFi correspondent. You can reach him at [email protected].

Market Opportunity
PoP Planet Logo
PoP Planet Price(P)
$0.0095
$0.0095$0.0095
+0.63%
USD
PoP Planet (P) Live Price Chart
Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact [email protected] for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

You May Also Like

Coinbase CEO advocates for crypto legislation reform in Washington DC

Coinbase CEO advocates for crypto legislation reform in Washington DC

The post Coinbase CEO advocates for crypto legislation reform in Washington DC appeared on BitcoinEthereumNews.com. Key Takeaways Coinbase CEO Brian Armstrong is actively working in Washington, D.C. to promote new crypto market structure legislation. Armstrong is aiming to prevent future SEC leadership similar to former chair Gary Gensler. Coinbase Chief Executive Officer Brian Armstrong said he is working in Washington to advance crypto market structure legislation and prevent another Securities and Exchange Commission chair like Gary Gensler from taking office. The Coinbase CEO said he is focused on getting crypto market structure legislation passed. Coinbase, the largest U.S. crypto exchange, has been among the companies navigating the regulatory landscape as lawmakers and agencies work to establish clearer rules for digital assets. Source: https://cryptobriefing.com/coinbase-ceo-crypto-legislation-washington-dc/
Share
BitcoinEthereumNews2025/09/18 09:43
Pope Leo laments a world ‘in flames’ at Ash Wednesday service

Pope Leo laments a world ‘in flames’ at Ash Wednesday service

'It is so easy to feel powerless in the face of a world that is in flames,; said Leo, the first US pope.
Share
Rappler2026/02/19 11:40
CME to launch Solana and XRP futures options on October 13, 2025

CME to launch Solana and XRP futures options on October 13, 2025

The post CME to launch Solana and XRP futures options on October 13, 2025 appeared on BitcoinEthereumNews.com. Key Takeaways CME Group will launch futures options for Solana (SOL) and XRP. The launch date is set for October 13, 2025. CME Group will launch futures options for Solana and XRP on October 13, 2025. The Chicago-based derivatives exchange will add the new crypto derivatives products to its existing digital asset offerings. The launch will provide institutional and retail traders with additional tools to hedge positions and speculate on price movements for both digital assets. The futures options will be based on CME’s existing Solana and XRP futures contracts. Trading will be conducted through CME Globex, the exchange’s electronic trading platform. Source: https://cryptobriefing.com/cme-solana-xrp-futures-options-launch-2025/
Share
BitcoinEthereumNews2025/09/18 01:07