General

AI Tools Are Not Time Machines

Sam GutentagSam Gutentag
July 11, 2025

A recent study by Metr suggested that AI slows open-source development by 19%. Digging into the study findings, we can't help but wonder, really?

AI tools like Cursor aren’t built to eliminate engineering as some form of Golden Hammer but instead are intended to quickly get developers up and running on features, fixes, or entire new projects. 

When devs don’t have to handwrite every test or rewire every interface, they’re free to go deeper into correctness, edge cases, and long-term maintainability. 

The study itself admits this. One developer reflected, “If I didn’t have AI, I probably [...] not gone so ham on nailing the correct Python tooling.” Another noted, “Some of [code changes] were a little tedious, and so I am like is this going to [be worth it without AI], but with AI [I make the changes].”  

AI encourages developers to do the “nice-to-have” engineering that makes systems better.

Shifted Time, Not Wasted Time

The reported 19% slowdown sounds dramatic until you do the math. If a task takes two hours, that’s 23 extra minutes. But how often does that number even hold? Software work isn’t that deterministic. A single developer working on a single task running longer than expected say, by just three days could fully account for that 19% swing across the entire study sample.

And here’s the kicker: the study observed just 16 developers across a combined 250 hours of work. That’s roughly four days per person. For a field like software engineering where one missed spec, dependency conflict, or infra issue can nuke your velocity, that’s a statistically fragile dataset. The margin of error here is easily larger than the effect size. More bluntly: this sample is orders of magnitude too small to draw high-confidence conclusions at the granularity the authors aim for.

Mastery Comes From Reps

Like any developer tool, Cursor isn't magic out of the box and requires practice and experience to become a force multiplier. Put in the reps, and you'll get better results. The data actually backs this up.

In appendix C.2.7 of the study, researchers analyzed productivity results based on the number of hours of Cursor experience each developer had. No substantial gains were observed across the first 50 hours, but positive speedups emerged past the 50-hour mark. 

Think about that: the “upper bound” of experience in this study was just one dev who had barely more than a full work week of time with the tool, and they saw improvements. That’s not a conclusive data set, it’s a signal. With more reps, configuration, and learning, the benefits compound.

Where AI tooling helps get the grunt work out of the way, it won’t design architecture or reason through your edge cases but it gives you more time to do just that. 

The old saying that a poor craftsman blames their tools holds firm;  time in the editor isn’t a cost and a thoughtful developer optimizes their workflows to use meaningful tools and techniques. 

This isn’t less work. It’s better-spent time.

You're missing the point if you’re still measuring AI by how fast it types or how quickly it gets you through tickets. Refine your workflows. Optimize your editor. Get the reps. The future isn’t about working faster it’s about solving deeper problems sooner.

Try it yourself or
request a demo

Get started for free

Try it yourself or
Request a Demo

Free for first 5 users