PUBLICATION: FINCH: Prompt-guided key-value cache compression for large language models MIT Press Document