Does context caching not improve response speed?

My understanding was that it should improve both cost and response speed - is that correct?

I tried using it, and it doesn’t seem to improve the response speed at all.

I didn’t see any info about performance. It doesn’t affect neither rate limits and TPM, so I don’t think it improves response speed