❌

Normal view

Received yesterday β€” 15 July 2025

Study finds AI tools made open source software developers 19 percent slower

14 July 2025 at 20:02

When it comes to concrete use cases for large language models, AI companies love to point out the ways coders and software developers can use these models to increase their productivity and overall efficiency in creating computer code. However, a new randomized controlled trial has found that experienced open source coders became less efficient at coding-related tasks when they used current AI tools.

For their study, researchers at METR (Model Evaluation and Threat Research) recruited 16 software developers, each with multiple years of experience working on specific open source repositories. The study followed these developers across 246 individual "tasks" involved with maintaining those repos, such as "bug fixes, features, and refactors that would normally be part of their regular work." For half of those tasks, the developers used AI tools like Cursor Pro or Anthropic's Claude; for the others, the programmers were instructed not to use AI assistance. Expected time forecasts for each task (made before the groupings were assigned) were used as a proxy to balance out the overall difficulty of the tasks in each experimental group, and the time needed to fix pull requests based on reviewer feedback was included in the overall assessment.

Experts and the developers themselves expected time savings that didn't materialize when AI tools were actually used. Credit: METR

Before performing the study, the developers in question expected the AI tools would lead to a 24 percent reduction in the time needed for their assigned tasks. Even after completing those tasks, the developers believed that the AI tools had made them 20 percent faster, on average. In reality, though, the AI-aided tasks ended up being completed 19 percent slower than those completed without AI tools.

Read full article

Comments

Β© Getty Images

Received before yesterday

OpenAI launches Codex, an AI coding agent, in ChatGPT

16 May 2025 at 15:00
OpenAI announced on Friday it’s launching a research preview of Codex, the company’s most capable AI coding agent yet. Codex is powered by codex-1, a version of the company’s o3 AI reasoning model optimized for software engineering tasks. OpenAI says codex-1 produces β€œcleaner” code than o3, adheres more precisely to instructions, and will iteratively run […]

Anysphere, which makes Cursor, has reportedly raised $900M at $9B valuation

5 May 2025 at 06:27
Anysphere, the maker of AI-powered coding tool Cursor, has attracted $900 million in a fresh round of funding led by Thrive Capital, The Financial Times reported, citing anonymous sources familiar with the deal. Andreessen Horowitz (a16z) and Accel are also participating in the round, which values Anysphere at about $9 billion, the report said. Cursor […]
❌