Their compaction endpoint is far and away the best in the industry. Claude's has...

nubg · 2026-01-24T03:35:50 1769225750

Help me understand, how is a compaction endpoint not just a Prompt + json_dump of the message history? I would understand if the prompt was the secret sauce, but you make it sound like there is more to a compaction system than just a clever prompt?

Art9681 · 2026-01-25T01:05:27 1769303127

Their models are specifically trained for their tools. For example the `apply_patch` tool. You would think it's just another file editing tool, but its unique diff format is trained into their models. It also works better than the generic file editing tools implemented in other clients. I can also confirm their compaction is best in class. I've imlemented my own client using their API and gpt-5.2 can work for hours and process millions of input tokens very effectively.

FuckButtons · 2026-01-24T08:01:07 1769241667

They could be operating in latent space entirely maybe? It seems plausible to me that you can just operate on the embedding of the conversation and treat it as an optimization / compression problem.

e1g · 2026-01-24T09:35:36 1769247336

Yes, Codex compaction is in the latent space (as confirmed in the article):

> the Responses API has evolved to support a special /responses/compact endpoint [...] it returns an opaque encrypted_content item that preserves the model’s latent understanding of the original conversation

xg15 · 2026-01-24T10:36:12 1769250972

Is this what they mean by "encryption" - as in "no human-readable text"? Or are they actually encrypting the compaction outputs before sending them back to the client? If so, why?

e1g · 2026-01-24T10:50:43 1769251843

"encrypted_content" is just a poorly worded variable name that indicates the content of that "item" should be treated as an opaque foreign key. No actual encryption (in the cryptographic sense) is involved.

whatreason · 2026-01-24T16:06:45 1769270805

This is not correct, encrypted content is in fact encrypted content. For openai to be able to support ZDR there needs to be a way for you to store reasoning content client side without being able to see the actual tokens. The tokens need to stay secret because it often contains reasoning related to safety and instruction following. So openai gives it to you encrypted and keeps the keys for decrypting on their side so it can be re-rendered into tokens when given to the model.

There is also another reason, to prevent some attacks related to injecting things in reasoning blocks. Anthropic has published some studies on this. By using encrypted content, openai and rely on it not being modified. Openai and anthropic have started to validate that you're not removing these messages between requests in certain modes like extended thinking for safety and performance reasons

xg15 · 2026-01-24T10:52:27 1769251947

Ah, that makes more sense. Thanks!

EnPissant · 2026-01-24T11:17:58 1769253478

Are you sure? For reasoning, encrypted_content is for sure actually encrypted.

e1g · 2026-01-24T12:20:05 1769257205

Hmmm, no, I don't know this for sure. In my testing, the /compact endpoint seems to work almost too well for large/complex conversations, and it feels like it cannot contain the entire latent space, so I assumed it keeps pointers inside it (ala previous_response_id). On the other hand, OpenAI says it's stateless and compatible with Zero Data Retention, so maybe it can contain everything.

EnPissant · 2026-01-25T04:27:01 1769315221

They say they do not compress the user messages, but yeah, it's purpose is to do very lossy compression of everything else. I'd expect it to be small.

EnPissant · 2026-01-24T05:44:39 1769233479

Maybe it's a model fine tuned for compaction?

kordlessagain · 2026-01-24T03:02:26 1769223746

Yes, agree completely.