GLM-5.2, Z.ai’s open-weight model, has reached 39% F1 on Semgrep’s IDOR benchmark, beating Anthropic’s Claude Code coding assistant in the prompt-only lane. Claude Code scored 37% F1 with Opus 4.6 and ...
When an AI agent causes damage, organizations are left with a question they cannot answer: Who owns the fallout?
Zhipu’s GLM 5.2 open-source AI model now sits within a percentage point of Anthropic’s Opus 4.8 on a key agentic benchmark at ...
Some more advanced smart home users are likely to fall afoul of the rule change if they directly access the SmartThings AP ...
Master ChatGPT Codex in 2026 with our comprehensive guide. Explore local automations, custom plugins, and memory features to ...
With the advent of AI-mediated APIs, the era of manually hard-coding every integration between every microservice may be ...
As enterprises increasingly demand fail-safes against single-vendor reliance, Sakana is proving that packaging collective ...
Signage of Samsung Electronics is displayed outside the company's Seocho building in Seoul on May 28, 2026. Pedro Pardo/Getty Images Samsung Electronics deployed ChatGPT Enterprise and Codex to its ...
Japan’s Sakana AI launched Fugu, a multi-agent orchestration system that routes queries across multiple AI models. Fugu Ultra claims benchmark parity with Anthropic’s Fable 5 and Mythos Preview. The ...
GPT-5.6 release date remains unconfirmed as June 22 opens the primary prediction window. OpenAI’s kindle-alpha cleared ...
Both models trade word-by-word generation for parallel denoising. Only one of them does it without losing intelligence in the ...
Security researchers identified a coordinated malware campaign within the JetBrains Marketplace designed to exfiltrate ...