[Roadmap] vLLM Roadmap Q4 2024 #9006

simon-mo · 2024-10-01T17:39:50Z

IsaacRe · 2024-10-02T19:33:07Z

Support for KV cache compression

ksjadeja · 2024-10-04T17:01:03Z

Do we have plans to support #5540? We are having a production level use case and would really appreciate if someone can look into it for Q4 onwards.

simon-mo changed the title ~~[Roadmap]: vLLM Roadmap Q4 2024~~ [Roadmap] vLLM Roadmap Q4 2024 Oct 1, 2024

simon-mo mentioned this issue Oct 1, 2024

[Roadmap] vLLM Roadmap Q3 2024 #5805

Closed

46 tasks

simon-mo pinned this issue Oct 1, 2024

amd-abhikulk mentioned this issue Oct 4, 2024

[Misc]: Need to understand support for torch.compile in Q4 roadmap #9072

Open

1 task

Provide feedback