Luka Govedič
|
71c60491f2
|
[Kernel] Build flash-attn from source (#8245)
|
2024-09-20 23:27:10 -07:00 |
Aaron Pham
|
9d104b5beb
|
[CI/Build] Update Ruff version (#8469)
Signed-off-by: Aaron Pham <contact@aarnphm.xyz>
Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com>
|
2024-09-18 11:00:56 +00:00 |
Cody Yu
|
a65cb16067
|
[MISC] Dump model runner inputs when crashing (#8305)
|
2024-09-12 01:12:25 +00:00 |
bnellnm
|
73202dbe77
|
[Kernel][Misc] register ops to prevent graph breaks (#6917)
Co-authored-by: Sage Moore <sage@neuralmagic.com>
|
2024-09-11 12:52:19 -07:00 |
Simon Mo
|
dc0b6066ab
|
[CI] Change PR remainder to avoid at-mentions (#8134)
|
2024-09-03 14:11:42 -07:00 |
Kevin H. Luu
|
f1575dc99f
|
[ci] Fix GHA workflow (#8129)
Signed-off-by: kevin <kevin@anyscale.com>
|
2024-09-03 13:25:09 -07:00 |
Kevin H. Luu
|
95a178f861
|
[CI] Only PR reviewers/committers can trigger CI on PR (#8124)
Signed-off-by: kevin <kevin@anyscale.com>
|
2024-09-03 11:32:27 -07:00 |
Jonathan Berkhahn
|
9c71c97ae2
|
[mypy] Enable mypy type checking for `vllm/core` (#7229)
|
2024-08-28 07:11:14 +08:00 |
youkaichao
|
35ee2ad6b9
|
[github][misc] promote asking llm first (#7809)
|
2024-08-23 09:38:50 -07:00 |
Cyrus Leung
|
baaedfdb2d
|
[mypy] Enable following imports for entrypoints (#7248)
Co-authored-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>
Co-authored-by: Fei <dfdfcai4@gmail.com>
|
2024-08-20 23:28:21 -07:00 |
Kevin H. Luu
|
5469146bcc
|
[ci] Remove fast check cancel workflow (#7455)
|
2024-08-12 21:19:51 -07:00 |
Kevin H. Luu
|
1137f343aa
|
[ci] Cancel fastcheck when PR is ready (#7433)
Signed-off-by: kevin <kevin@anyscale.com>
|
2024-08-12 10:59:14 -07:00 |
Kevin H. Luu
|
9b3e2edd30
|
[ci] Cancel fastcheck run when PR is marked ready (#7427)
Signed-off-by: kevin <kevin@anyscale.com>
|
2024-08-12 10:56:52 -07:00 |
Cyrus Leung
|
7eb4a51c5f
|
[Core] Support serving encoder/decoder models (#7258)
|
2024-08-09 10:39:41 +08:00 |
Michael Goin
|
e14fb22e59
|
[Doc] Put collect_env issue output in a <detail> block (#7310)
|
2024-08-08 11:22:49 -07:00 |
Isotr0py
|
360bd67cf0
|
[Core] Support loading GGUF model (#5191)
Co-authored-by: Michael Goin <michael@neuralmagic.com>
|
2024-08-05 17:54:23 -06:00 |
Jee Jee Li
|
f80ab3521c
|
Clean up remaining Punica C information (#7027)
|
2024-08-04 15:37:08 -07:00 |
Michael Goin
|
b482b9a5b1
|
[CI/Build] Add support for Python 3.12 (#7035)
|
2024-08-02 13:51:22 -07:00 |
Cyrus Leung
|
3bb4b1e4cd
|
[mypy] Speed up mypy checking (#7056)
|
2024-08-01 19:49:43 -07:00 |
Sage Moore
|
7e0861bd0b
|
[CI/Build] Update PyTorch to 2.4.0 (#6951)
Co-authored-by: Michael Goin <michael@neuralmagic.com>
|
2024-08-01 11:11:24 -07:00 |
Jee Jee Li
|
7ecee34321
|
[Kernel][RFC] Refactor the punica kernel based on Triton (#5036)
|
2024-07-31 17:12:24 -07:00 |
Cyrus Leung
|
da1f7cc12a
|
[mypy] Enable following imports for some directories (#6681)
|
2024-07-31 10:38:03 +08:00 |
Kevin H. Luu
|
4fbf4aa128
|
[ci] GHA workflow to remove ready label upon "/notready" comment (#6921)
Signed-off-by: kevin <kevin@anyscale.com>
|
2024-07-29 17:03:45 -07:00 |
Kevin H. Luu
|
2d4733ba2d
|
Fix PR comment bot (#6554)
Signed-off-by: kevin <kevin@anyscale.com>
|
2024-07-18 14:48:29 -07:00 |
Kevin H. Luu
|
ecdb462c24
|
[ci] Reword Github bot comment (#6534)
|
2024-07-18 08:01:45 -07:00 |
Simon Mo
|
9bfece89fd
|
Add FUNDING.yml (#6435)
|
2024-07-14 20:36:16 -07:00 |
Kevin H. Luu
|
16ff6bd58c
|
[ci] Fix wording for GH bot (#6398)
Signed-off-by: kevin <kevin@anyscale.com>
|
2024-07-12 16:34:37 -07:00 |
Cody Yu
|
aa48e502fb
|
[MISC] Upgrade dependency to PyTorch 2.3.1 (#5327)
|
2024-07-12 12:04:26 -07:00 |
Kevin H. Luu
|
4dbebd03cc
|
[ci] Add GHA workflows to enable full CI run (#6381)
Signed-off-by: kevin <kevin@anyscale.com>
|
2024-07-12 11:36:26 -07:00 |
Benjamin Muskalla
|
b422d4961a
|
[CI/Build] Enable mypy typing for remaining folders (#6268)
|
2024-07-10 22:15:55 +08:00 |
Cyrus Leung
|
0e9164b40a
|
[mypy] Enable type checking for test directory (#5017)
|
2024-06-15 04:45:31 +00:00 |
SangBin Cho
|
847cdcca1c
|
[CI] Upgrade codespell version. (#5381)
|
2024-06-12 10:06:14 -07:00 |
Cyrus Leung
|
7a64d24aad
|
[Core] Support image processor (#4197)
|
2024-06-02 22:56:41 -07:00 |
youkaichao
|
325c119961
|
[Misc] add logging level env var (#5045)
|
2024-05-24 23:49:49 -07:00 |
Michael Goin
|
5f6d10c14c
|
[CI/Build] Enforce style for C++ and CUDA code with `clang-format` (#4722)
|
2024-05-22 07:18:41 +00:00 |
Simon Mo
|
e186d37cb1
|
[CI] use ccache actions properly in release workflow (#4629)
|
2024-05-06 22:23:36 +00:00 |
Simon Mo
|
c7f2cf2b7f
|
[CI] Reduce wheel size by not shipping debug symbols (#4602)
|
2024-05-04 21:28:58 -07:00 |
Simon Mo
|
8d8357c8ed
|
bump version to v0.4.2 (#4600)
|
2024-05-04 17:09:49 -07:00 |
SangBin Cho
|
fb087af52e
|
[mypy][7/N] Cover all directories (#4555)
|
2024-05-02 10:47:41 -07:00 |
SangBin Cho
|
cf8cac8c70
|
[mypy][6/N] Fix all the core subdirectory typing (#4450)
Co-authored-by: Cade Daniel <edacih@gmail.com>
|
2024-05-02 03:01:00 +00:00 |
Michael Goin
|
d627a3d837
|
[Misc] Upgrade to `torch==2.3.0` (#4454)
|
2024-04-29 20:05:47 -04:00 |
SangBin Cho
|
df29793dc7
|
[mypy][5/N] Support all typing on model executor (#4427)
|
2024-04-28 19:01:26 -07:00 |
youkaichao
|
aba47be3fe
|
[Misc] add RFC issue template (#4401)
Co-authored-by: Simon Mo <simon.mo@hey.com>
|
2024-04-26 15:47:45 -07:00 |
SangBin Cho
|
b5b4a398a7
|
[Mypy] Typing lora folder (#4337)
|
2024-04-25 19:13:50 +00:00 |
Simon Mo
|
62b5166bd4
|
[CI] Add ccache for wheel builds job (#4281)
|
2024-04-23 09:51:41 -07:00 |
SangBin Cho
|
0ae11f78ab
|
[Mypy] Part 3 fix typing for nested directories for most of directory (#4161)
|
2024-04-22 21:32:44 -07:00 |
youkaichao
|
15b86408a8
|
[Misc] add nccl in collect env (#4211)
|
2024-04-19 19:44:51 +00:00 |
youkaichao
|
8a7a3e4436
|
[Core] add an option to log every function call to for debugging hang/crash in distributed inference (#4079)
Co-authored-by: Simon Mo <simon.mo@hey.com>
|
2024-04-18 16:15:12 -07:00 |
SangBin Cho
|
533d2a1f39
|
[Typing] Mypy typing part 2 (#4043)
Co-authored-by: SangBin Cho <sangcho@sangcho-LT93GQWG9C.local>
|
2024-04-17 17:28:43 -07:00 |
youkaichao
|
546e721168
|
[CI/Test] expand ruff and yapf for all supported python version (#4037)
|
2024-04-13 01:43:37 +00:00 |