Lags to a halt on long chats

A lot of people complained about this before, maybe have a percentage of the chat unloaded while its out of frame to solve this? The models do have 1M context length after all, we just cant use it because of the lag.