Skip to content

Question: Details of KV cache compression ratios and settings for all models #36

@bg51717

Description

@bg51717

Hello, thank you for sharing this great work!

I have some detailed questions about the KV cache compression experiments:

  1. How exactly are the KV cache compression ratios calculated?

  2. For all reported models and all compression ratios in the paper/experiments, could you share the specific parameter settings?

    • qk_rope_head_dim
    • kv_lora_rank

Thanks a lot for your time and support!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions