Buy Crypto Markets Spot FuturesGOLD Earn Event Centre

PANews reported on March 8th that SlowMist CISO 23pads published an article on the X platform stating that the PinchBench benchmark test evaluates the performancePANews reported on March 8th that SlowMist CISO 23pads published an article on the X platform stating that the PinchBench benchmark test evaluates the performance

OpenClaw proxy task evaluation: Gemini 3 Flash success rate 95.1%, GPT-4o 85.2%.

Author: PANews

Source: PANews

2026/03/08 11:27

1 min read

1$0.0004751-3.82%

4$0.007753-3.72%

For feedback or concerns regarding this content, please contact us at [email protected]

PANews reported on March 8th that SlowMist CISO 23pads published an article on the X platform stating that the PinchBench benchmark test evaluates the performance of AI large language models in the OpenClaw agent task. The results show that Gemini 3 Flash leads with a success rate of 95.1% in processing the OpenClaw task, while minimax-m2.1 and kimi-k2.5 rank second and third with 93.6% and 93.4% respectively. Claude Sonnet 4.5 achieves 92.7%, and GPT-4o achieves 85.2%.

Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact [email protected] for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.