Commit Graph

74 Commits

Author SHA1 Message Date
Luka Govedič 71c60491f2
[Kernel] Build flash-attn from source (#8245) 2024-09-20 23:27:10 -07:00
Aaron Pham 9d104b5beb
[CI/Build] Update Ruff version (#8469)
Signed-off-by: Aaron Pham <contact@aarnphm.xyz>
Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com>
2024-09-18 11:00:56 +00:00
Cody Yu a65cb16067
[MISC] Dump model runner inputs when crashing (#8305) 2024-09-12 01:12:25 +00:00
bnellnm 73202dbe77
[Kernel][Misc] register ops to prevent graph breaks (#6917)
Co-authored-by: Sage Moore <sage@neuralmagic.com>
2024-09-11 12:52:19 -07:00
Simon Mo dc0b6066ab
[CI] Change PR remainder to avoid at-mentions (#8134) 2024-09-03 14:11:42 -07:00
Kevin H. Luu f1575dc99f
[ci] Fix GHA workflow (#8129)
Signed-off-by: kevin <kevin@anyscale.com>
2024-09-03 13:25:09 -07:00
Kevin H. Luu 95a178f861
[CI] Only PR reviewers/committers can trigger CI on PR (#8124)
Signed-off-by: kevin <kevin@anyscale.com>
2024-09-03 11:32:27 -07:00
Jonathan Berkhahn 9c71c97ae2
[mypy] Enable mypy type checking for `vllm/core` (#7229) 2024-08-28 07:11:14 +08:00
youkaichao 35ee2ad6b9
[github][misc] promote asking llm first (#7809) 2024-08-23 09:38:50 -07:00
Cyrus Leung baaedfdb2d
[mypy] Enable following imports for entrypoints (#7248)
Co-authored-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>
Co-authored-by: Fei <dfdfcai4@gmail.com>
2024-08-20 23:28:21 -07:00
Kevin H. Luu 5469146bcc
[ci] Remove fast check cancel workflow (#7455) 2024-08-12 21:19:51 -07:00
Kevin H. Luu 1137f343aa
[ci] Cancel fastcheck when PR is ready (#7433)
Signed-off-by: kevin <kevin@anyscale.com>
2024-08-12 10:59:14 -07:00
Kevin H. Luu 9b3e2edd30
[ci] Cancel fastcheck run when PR is marked ready (#7427)
Signed-off-by: kevin <kevin@anyscale.com>
2024-08-12 10:56:52 -07:00
Cyrus Leung 7eb4a51c5f
[Core] Support serving encoder/decoder models (#7258) 2024-08-09 10:39:41 +08:00
Michael Goin e14fb22e59
[Doc] Put collect_env issue output in a <detail> block (#7310) 2024-08-08 11:22:49 -07:00
Isotr0py 360bd67cf0
[Core] Support loading GGUF model (#5191)
Co-authored-by: Michael Goin <michael@neuralmagic.com>
2024-08-05 17:54:23 -06:00
Jee Jee Li f80ab3521c
Clean up remaining Punica C information (#7027) 2024-08-04 15:37:08 -07:00
Michael Goin b482b9a5b1
[CI/Build] Add support for Python 3.12 (#7035) 2024-08-02 13:51:22 -07:00
Cyrus Leung 3bb4b1e4cd
[mypy] Speed up mypy checking (#7056) 2024-08-01 19:49:43 -07:00
Sage Moore 7e0861bd0b
[CI/Build] Update PyTorch to 2.4.0 (#6951)
Co-authored-by: Michael Goin <michael@neuralmagic.com>
2024-08-01 11:11:24 -07:00
Jee Jee Li 7ecee34321
[Kernel][RFC] Refactor the punica kernel based on Triton (#5036) 2024-07-31 17:12:24 -07:00
Cyrus Leung da1f7cc12a
[mypy] Enable following imports for some directories (#6681) 2024-07-31 10:38:03 +08:00
Kevin H. Luu 4fbf4aa128
[ci] GHA workflow to remove ready label upon "/notready" comment (#6921)
Signed-off-by: kevin <kevin@anyscale.com>
2024-07-29 17:03:45 -07:00
Kevin H. Luu 2d4733ba2d
Fix PR comment bot (#6554)
Signed-off-by: kevin <kevin@anyscale.com>
2024-07-18 14:48:29 -07:00
Kevin H. Luu ecdb462c24
[ci] Reword Github bot comment (#6534) 2024-07-18 08:01:45 -07:00
Simon Mo 9bfece89fd
Add FUNDING.yml (#6435) 2024-07-14 20:36:16 -07:00
Kevin H. Luu 16ff6bd58c
[ci] Fix wording for GH bot (#6398)
Signed-off-by: kevin <kevin@anyscale.com>
2024-07-12 16:34:37 -07:00
Cody Yu aa48e502fb
[MISC] Upgrade dependency to PyTorch 2.3.1 (#5327) 2024-07-12 12:04:26 -07:00
Kevin H. Luu 4dbebd03cc
[ci] Add GHA workflows to enable full CI run (#6381)
Signed-off-by: kevin <kevin@anyscale.com>
2024-07-12 11:36:26 -07:00
Benjamin Muskalla b422d4961a
[CI/Build] Enable mypy typing for remaining folders (#6268) 2024-07-10 22:15:55 +08:00
Cyrus Leung 0e9164b40a
[mypy] Enable type checking for test directory (#5017) 2024-06-15 04:45:31 +00:00
SangBin Cho 847cdcca1c
[CI] Upgrade codespell version. (#5381) 2024-06-12 10:06:14 -07:00
Cyrus Leung 7a64d24aad
[Core] Support image processor (#4197) 2024-06-02 22:56:41 -07:00
youkaichao 325c119961
[Misc] add logging level env var (#5045) 2024-05-24 23:49:49 -07:00
Michael Goin 5f6d10c14c
[CI/Build] Enforce style for C++ and CUDA code with `clang-format` (#4722) 2024-05-22 07:18:41 +00:00
Simon Mo e186d37cb1
[CI] use ccache actions properly in release workflow (#4629) 2024-05-06 22:23:36 +00:00
Simon Mo c7f2cf2b7f
[CI] Reduce wheel size by not shipping debug symbols (#4602) 2024-05-04 21:28:58 -07:00
Simon Mo 8d8357c8ed
bump version to v0.4.2 (#4600) 2024-05-04 17:09:49 -07:00
SangBin Cho fb087af52e
[mypy][7/N] Cover all directories (#4555) 2024-05-02 10:47:41 -07:00
SangBin Cho cf8cac8c70
[mypy][6/N] Fix all the core subdirectory typing (#4450)
Co-authored-by: Cade Daniel <edacih@gmail.com>
2024-05-02 03:01:00 +00:00
Michael Goin d627a3d837
[Misc] Upgrade to `torch==2.3.0` (#4454) 2024-04-29 20:05:47 -04:00
SangBin Cho df29793dc7
[mypy][5/N] Support all typing on model executor (#4427) 2024-04-28 19:01:26 -07:00
youkaichao aba47be3fe
[Misc] add RFC issue template (#4401)
Co-authored-by: Simon Mo <simon.mo@hey.com>
2024-04-26 15:47:45 -07:00
SangBin Cho b5b4a398a7
[Mypy] Typing lora folder (#4337) 2024-04-25 19:13:50 +00:00
Simon Mo 62b5166bd4
[CI] Add ccache for wheel builds job (#4281) 2024-04-23 09:51:41 -07:00
SangBin Cho 0ae11f78ab
[Mypy] Part 3 fix typing for nested directories for most of directory (#4161) 2024-04-22 21:32:44 -07:00
youkaichao 15b86408a8
[Misc] add nccl in collect env (#4211) 2024-04-19 19:44:51 +00:00
youkaichao 8a7a3e4436
[Core] add an option to log every function call to for debugging hang/crash in distributed inference (#4079)
Co-authored-by: Simon Mo <simon.mo@hey.com>
2024-04-18 16:15:12 -07:00
SangBin Cho 533d2a1f39
[Typing] Mypy typing part 2 (#4043)
Co-authored-by: SangBin Cho <sangcho@sangcho-LT93GQWG9C.local>
2024-04-17 17:28:43 -07:00
youkaichao 546e721168
[CI/Test] expand ruff and yapf for all supported python version (#4037) 2024-04-13 01:43:37 +00:00