mirror of https://github.com/vllm-project/vllm
[Performance] Enable chunked prefill and prefix caching together (#8120)
Co-authored-by: Tao He <sighingnow@gmail.com> Co-authored-by: Juelianqvq <Juelianqvq@noreply.github.com>
This commit is contained in:
parent
ec266536b7
commit
bd852f2a8b