Technique 4: Compress old chat history.
After 30 messages, your AI is rereading the entire conversation every single turn.
Instead, summarize older messages into one paragraph and keep only the last 5 messages in full.
Same context with 10x fewer tokens per turn.