Context Compress — The Compression Cliff

The Compression Cliff

There's a threshold where compression stops being lossless. Beyond ~47–54% total reduction, the model's compliance with safety rules becomes probabilistic instead of deterministic.

Content Type	Safe Reduction	Strategy
Paths, references, lists	60–70%	Maximum compression
Personality, style rules	50–60%	Heavy compression
Safety rules, preferences	20–30%	Formatting only
Code examples	0%	No compression

Content Type

Safe Reduction

Strategy

Paths, references, lists

60–70%

Maximum compression

Personality, style rules

50–60%

Heavy compression

Safety rules, preferences

20–30%

Formatting only

Code examples

No compression

Uncompressed input

CLAUDE.md, steering files, skills — loaded at session start

Safe zone (<47%)

Personality, paths, tool lists compress 60–70% with zero loss

Transition (47–54%)

Compliance becomes probabilistic

Past the cliff (>54%)

Safety rules ignored — behavior breaks

Quick Start

pip install context-compress # LLM compression (best results, needs kiro-cli) context-compress llm ~/.kiro/steering/ -o ~/.kiro/steering-compressed/ # Regex compression (fast, offline) context-compress compress-dir ~/.kiro/steering/ -o ~/.kiro/steering-compressed/ # Find duplicates across your context stack context-compress dedup ~/.kiro/steering/ # Token usage stats context-compress stats ~/.kiro/steering/

Key Findings

LLM compression beats regex 9×

Regex compression on lean steering files: 2.7%. LLM semantic compression on the same files: 24%. The LLM understands which words carry meaning and which are scaffolding.

Redundancy in safety rules is reinforcement

Merging 8 safety bullets into 3 sentences (same meaning, 54% reduction) made compliance probabilistic. The verbose version asked permission every time; the merged version asked 1 out of 3 times.

Four levers, not one

Layer	Strategy	Target
1	Skills over steering	Load prose on demand
2	Cache-aware ordering	Stable content above dynamic
3	LLM compression	Semantic compression of remaining prose
4	TOON encoding	Token-efficient structured payloads

Layer

Strategy

Target

Skills over steering

Load prose on demand

Cache-aware ordering

Stable content above dynamic

LLM compression

Semantic compression of remaining prose

TOON encoding

Token-efficient structured payloads

I A/B tested compressed agent instructions and found the breaking point