Codewashing

April 30th, 2025

I have little understanding for people using large language models to generate slop; words and images that nobody asked for.

I have more understanding for people using large language models to generate code. Code isn’t the thing in the same way that words or images are; code is the thing that gets you to the thing.

And if a large language model hallucinates some code, you’ll find out soon enough:

With code you get a powerful form of fact checking for free. Run the code, see if it works.

But I want to push back on one justification I see repeatedly about using large language models to write code. Here’s Craig:

There are many moral and ethical issues with using LLMs, but building software feels like one of the few truly ethically “clean”(er) uses (trained on open source code, etc.)

That’s not how this works. Yes, the large language models are trained on lots of code (most of it open source), but they’re not only trained on that. That’s on top of everything else; all the stolen books, all the unpaid creative work of others.

Even Robin Sloan, who first says:

I think the case of code is especially clear, and, for me, basically settled.

…goes on to acknowledge:

But, again, it’s important to say: the code only works because of Everything. Take that data away, train a model using GitHub alone, and you’ll get a far less useful tool.

When large language models are trained on domain-specific data, it’s always in addition to the mahoosive amount of content they’ve already stolen. It’s that mohoosive amount of content—not the domain-specific data—that enables them to parse your instructions.

(Note that I’m being very delibarate in saying “parse”, not “understand.” Though make no mistake, I’m astonished at how good these tools are at parsing instructions. I say that as someone who tried to write natural language parsers for text-only adventure games back in the 1980s.)

So, sure, go ahead and use large language models to write code. But don’t fool yourself into thinking that it’s somehow ethical.

What I said here applies to code too:

If you’re going to use generative tools powered by large language models, don’t pretend you don’t know how your sausage is made.

« Newer Older »

Responses

jeffbridgforth.com

11 Likes

# Liked by Ethan Marcotte on Wednesday, April 30th, 2025 at 4:00pm

# Liked by Sean Gillies on Wednesday, April 30th, 2025 at 4:00pm

# Liked by Wayne Myers on Wednesday, April 30th, 2025 at 4:26pm

# Liked by Luke Dorny on Wednesday, April 30th, 2025 at 4:26pm

# Liked by Jason Neel on Wednesday, April 30th, 2025 at 4:26pm

# Liked by Royce Williams on Wednesday, April 30th, 2025 at 5:30pm

# Liked by Brett Jephson on Wednesday, April 30th, 2025 at 6:56pm

# Liked by Jeff Bridgforth on Wednesday, April 30th, 2025 at 9:55pm

# Liked by Patrick Nesbitt on Thursday, May 1st, 2025 at 6:06am

# Liked by Joe Crawford on Thursday, May 1st, 2025 at 9:52pm

# Liked by Bob on Tuesday, May 20th, 2025 at 7:21pm

Creativity

Thinking about priorities at UX Brighton.

Decision time

Balancing the ledger.

Crawlers

Pest control for your website.

Permission

You have the power, not Google.

Browser history

From a browser bug this morning, back to the birth of hypertext in 1945, with a look forward to a possible future for web browsers.

Related links

Keeping up appearances | deadSimpleTech

Looking at LLM usage and promotion as a cultural phenomenon, it has all of the markings of a status game. The material gains from the LLM (which are usually quite marginal) really aren’t why people are doing it: they’re doing it because in many spaces, using ChatGPT and being very optimistic about AI being the “future” raises their social status. It’s important not only to be using it, but to be seen using it and be seen supporting it and telling people who don’t use it that they’re stupid luddites who’ll inevitably be left behind by technology.

Tuesday, May 27th, 2025 9:25am

Tagged with ai machinelearning language models work hiring culture performative status

In 2025, venture capital can’t pretend everything is fine any more – Pivot to AI

Here is the state of venture capital in early 2025:

Venture capital is moribund except AI.

AI is moribund except OpenAI.

OpenAI is a weird scam that wants to burn money so fast it summons AI God.

Nobody can cash out.

Wednesday, May 14th, 2025 3:30pm

Tagged with ai machinelearning language models vc venture capital economics hype economy investments

Build It Yourself | Armin Ronacher’s Thoughts and Writings

We’re at a point in the most ecosystems where pulling in libraries is not just the default action, it’s seen positively: “Look how modular and composable my code is!” Actually, it might just be a symptom of never wanting to type out more than a few lines.

It always amazes me when people don’t view dependencies as liabilities. To me it feels like the coding equivalent of going to a loan shark. You are asking for technical debt.

There are entire companies who are making a living of supplying you with the tools needed to deal with your dependency mess. In the name of security, we’re pushed to having dependencies and keeping them up to date, despite most of those dependencies being the primary source of security problems.

But there is a simpler path. You write code yourself. Sure, it’s more work up front, but once it’s written, it’s done.

Sunday, March 16th, 2025 8:42am

Tagged with code coding programming dependencies thirdparty maintenance development security libraries

Declare your AIndependence: block AI bots, scrapers and crawlers with a single click

This is a great move from Cloudflare. I may start using their service.

Wednesday, July 3rd, 2024 3:21pm

Tagged with bots ai machinelearning language models crawlers useragents scraping cloudlflare

Should I remove this blog from Google Search?・The Jolly Teapot

There was life before Google search. There will be life after Google search.

Google is not a huge source of traffic and visibility. I get most of my visits from RSS readers, other people’s links including fellow bloggers, or websites like Hacker News. It’s hard to tell at this point since I don’t track anything, but that’s an educated guess.

Removing my website from Google would have very little impact, so I was wondering if I should just do it.

Thursday, June 27th, 2024 3:03pm

Tagged with google search delisting enshittification seo ai machinelearning language models crawlers

Previously on this day

1 year ago I wrote Composability in design systems

There’s probably a Pace Layer analogy in here somewhere.

5 years ago I wrote User agents

The web browser is your mutual friend.

10 years ago I wrote 100 words 039

Day thirty nine.

12 years ago I wrote Anniversary

It was twenty years ago today.

13 years ago I wrote Left to our own devices

Pop ‘round to the Clearleft office if you want to test a site on our devices.

21 years ago I wrote Songs from the web

iTunes 4.5 was released earlier this week.

23 years ago I wrote The Trash Compactor Debate

On the Implausibility of the Death Star’s Trash Compactor:

23 years ago I wrote Apple - eMac

Apple have released a new computer specifically for the education market - the eMac (the "e" is for education).