Commit Graph

74 Commits

Author SHA1 Message Date
SangBin Cho 09473ee41c
[mypy] Add mypy type annotation part 1 (#4006) 2024-04-12 14:35:50 -07:00
Woosuk Kwon cfaf49a167
[Misc] Define common requirements (#3841) 2024-04-05 00:39:17 -07:00
youkaichao ca81ff5196
[Core] manage nccl via a pypi package & upgrade to pt 2.2.1 (#3805) 2024-04-04 10:26:19 -07:00
youkaichao 205b94942e
[CI/Build] fix TORCH_CUDA_ARCH_LIST in wheel build (#3801) 2024-04-02 11:54:33 -07:00
SangBin Cho 01bfb22b41
[CI] Try introducing isort. (#3495) 2024-03-25 07:59:47 -07:00
Zhuohan Li c0c17d4896
[Misc] Fix PR Template (#3478) 2024-03-18 15:00:31 -07:00
Simon Mo 8e67598aa6
[Misc] fix line length for entire codebase (#3444) 2024-03-16 00:36:29 -07:00
simon-mo ad50bf4b25 fix lint 2024-03-15 22:23:38 -07:00
youkaichao 413366e9a2
[Misc] PR templates (#3413)
Co-authored-by: Zhuohan Li <zhuohan123@gmail.com>
2024-03-15 18:25:51 -07:00
Harry Mellor a7af4538ca
Fix issue templates (#3436) 2024-03-15 21:26:00 +00:00
youkaichao dfc77408bd
[issue templates] add some issue templates (#3412) 2024-03-14 13:16:00 -07:00
Massimiliano Pronesti 93dc5a2870
chore(vllm): codespell for spell checking (#2820) 2024-02-21 18:56:01 -08:00
Philipp Moritz 390b495ff3
Don't build punica kernels by default (#2605) 2024-01-26 15:19:19 -08:00
Simon Mo 1e4277d2d1
lint: format all python file instead of just source code (#2567) 2024-01-23 15:53:06 -08:00
Woosuk Kwon b0a1d667b0
Pin PyTorch & xformers versions (#2155) 2023-12-17 01:46:54 -08:00
Woosuk Kwon f3e024bece
[CI/CD] Upgrade PyTorch version to v2.1.1 (#2045) 2023-12-11 17:48:11 -08:00
Simon Mo 5ffc0d13a2
Migrate linter from `pylint` to `ruff` (#1665) 2023-11-20 11:58:01 -08:00
Woosuk Kwon fd58b73a40
Build CUDA11.8 wheels for release (#1596) 2023-11-09 03:52:29 -08:00
Zhuohan Li 06458a0b42
Upgrade to CUDA 12 (#1527)
Co-authored-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>
2023-11-08 14:17:49 -08:00
Woosuk Kwon e8ef4c0820
Fix PyTorch index URL in workflow (#1378) 2023-10-16 12:37:56 -07:00
Woosuk Kwon 348897af31
Fix PyTorch version to 2.0.1 in workflow (#1377) 2023-10-16 11:27:17 -07:00
Zhuohan Li ba0bfd40e2
TP/quantization/weight loading refactor part 1 - Simplify parallel linear logic (#1181) 2023-10-02 15:36:09 -07:00
Daniel c393af6cd7
[Feature | CI] Added a github action to build wheels (#746) 2023-08-21 16:59:15 +09:00
Zhuohan Li 42e0c1df78
[Quality] Add CI for formatting (#343) 2023-07-03 14:50:56 -07:00