My first reaction: wow, incredible. My second reaction: still incredible, but no...

ndesaulniers · 2026-02-05T21:46:57 1770328017

> C compiler is one of the most rigorously specified pieces of software out there

/me Laughs in "unspecified behavior."

ori_b · 2026-02-05T22:03:52 1770329032

There's undefined behavior, which is quite well specified. What do you mean by unspecified behavior? Do you have an example?

ndesaulniers · 2026-02-06T07:16:48 1770362208

https://www.open-std.org/jtc1/sc22/wg14/www/docs/n3685.pdf

Read section J.1.

irishcoffee · 2026-02-05T22:30:29 1770330629

Undefined is absolutely clear in the spec.

Unspecified is whatever you want it to mean. I am also laughing, having never heard "unspecified" before.

LiamPowell · 2026-02-06T00:36:03 1770338163

Unspecified behaviour is defined in the glossary at the start of the spec and the term "unspecified" appears over a hundred times...

astrange · 2026-02-06T10:03:17 1770372197

The C spec is certainly not formal or precise.

https://www.ralfj.de/blog/2020/12/14/provenance.html

Another example is that it's unclear from the standard if you can write malloc() in C.

butterNaN · 2026-02-06T14:16:01 1770387361

Sure but the point OP is making is that it is still more spec'd than most real world problems

astrange · 2026-02-06T20:07:44 1770408464

You're welcome to try writing a C compiler and standard library doing no research other than reading the spec.

cryptonector · 2026-02-06T02:23:24 1770344604

> My second reaction:

This is the key: the more you constrain the LLM, the better it will perform. At least that's my experience with Claude. When working with existing code, the better the code to begin with, the better Claude performs, while if the code has issues then Claude can end up spinning its wheels.

softwaredoug · 2026-02-05T22:50:24 1770331824

Yes I think any codegen with a lot of tests and verification is more about “fitting” to the tests. Like fitting an ML model. It’s model training, not coding.

But a lot of programming we discover correctness as we go, one reason humans don’t completely exit the loop. We need to see and build tests as we go, giving them particular care and attention to ensure they test what matters.

uywykjdskn · 2026-02-05T23:47:53 1770335273

The agent can obviously do that