Normal view
Stack Overflow data reveals the hidden productivity tax of โalmost rightโ AI code

Stack Overflow survey shows that as more enterprise developers actually use AI tools, their expectations aren't being met by reality.Read More
Two major AI coding tools wiped out user data after making cascading mistakes
New types of AI coding assistants promise to let anyone build software by typing commands in plain English. But when these tools generate incorrect internal representations of what's happening on your computer, the results can be catastrophic.
Two recent incidents involving AI coding assistants put a spotlight on risks in the emerging field of "vibe coding"โusing natural language to generate and execute code through AI models without paying close attention to how the code works under the hood. In one case, Google's Gemini CLI destroyed user files while attempting to reorganize them. In another, Replit's AI coding service deleted a production database despite explicit instructions not to modify code.
The Gemini CLI incident unfolded when a product manager experimenting with Google's command-line tool watched the AI model execute file operations that destroyed data while attempting to reorganize folders. The destruction occurred through a series of move commands targeting a directory that never existed.
ยฉ Benj Edwards / Getty Images
Exhausted man defeats AI model in world coding championship
A Polish programmer running on fumes recently accomplished what may soon become impossible: beating an advanced AI model from OpenAI in a head-to-head coding competition. The 10-hour marathon left him "completely exhausted."
On Wednesday, programmer Przemysลaw Dฤbiak (known as "Psyho"), a former OpenAI employee, narrowly defeated the custom AI model in the AtCoder World Tour Finals 2025 Heuristic contest in Tokyo. AtCoder, a Japanese platform that hosts competitive programming contests and maintains global rankings, held what may be the first contest where an AI model competed directly against top human programmers in a major onsite world championship. During the event, the maker of ChatGPT participated as a sponsor and entered an AI model in a special exhibition match titled "Humans vs AI." Despite the tireless nature of silicon, the company walked away with second place.
"Humanity has prevailed (for now!)," wrote Dฤbiak on X, noting he had little sleep while competing in several competitions across three days. "I'm completely exhausted. ... I'm barely alive."
ยฉ Przemysลaw Dฤbiak
Claude Code revenue jumps 5.5x as Anthropic launches analytics dashboard

Anthropic has launched a powerful analytics dashboard for its Claude Code AI assistant, giving engineering leaders real-time insights into developer productivity, tool usage, and ROI on AI coding investments.Read More
Study finds AI tools made open source software developers 19 percent slower
When it comes to concrete use cases for large language models, AI companies love to point out the ways coders and software developers can use these models to increase their productivity and overall efficiency in creating computer code. However, a new randomized controlled trial has found that experienced open source coders became less efficient at coding-related tasks when they used current AI tools.
For their study, researchers at METR (Model Evaluation and Threat Research) recruited 16 software developers, each with multiple years of experience working on specific open source repositories. The study followed these developers across 246 individual "tasks" involved with maintaining those repos, such as "bug fixes, features, and refactors that would normally be part of their regular work." For half of those tasks, the developers used AI tools like Cursor Pro or Anthropic's Claude; for the others, the programmers were instructed not to use AI assistance. Expected time forecasts for each task (made before the groupings were assigned) were used as a proxy to balance out the overall difficulty of the tasks in each experimental group, and the time needed to fix pull requests based on reviewer feedback was included in the overall assessment.

Before performing the study, the developers in question expected the AI tools would lead to a 24 percent reduction in the time needed for their assigned tasks. Even after completing those tasks, the developers believed that the AI tools had made them 20 percent faster, on average. In reality, though, the AI-aided tasks ended up being completed 19 percent slower than those completed without AI tools.
ยฉ Getty Images
Amazon launches Kiro, its own Claude-powered challenger to Windsurf and Codex

Initial community reactions to Kiro were mixed, but developers were intrigued, praising the emphasis on specs, hooks and structure.Read More
-
VentureBeat
- Remaining Windsurf team and tech acquired by Cognition, makers of Devin: โWeโre friends with Anthropic againโ
Remaining Windsurf team and tech acquired by Cognition, makers of Devin: โWeโre friends with Anthropic againโ

Cognition CEO Scott Wu and interim Windsurf CEO Jeff Wang said they would start by integrating the AI-powered engineer Devin into Windsurf.Read More
AI coding tools may not speed up every developer, study shows
Moonshot AIโs Kimi K2 outperforms GPT-4 in key benchmarks โ and itโs free

Chinese AI startup Moonshot releases open-source Kimi K2 model that outperforms OpenAI and Anthropic on coding tasks with breakthrough agentic capabilities and competitive pricing.Read More