Default Branch

78ac0f591d · [CI/Build] fix uv caching in Dockerfile (#13611) · Updated 2025-02-23 00:25:20 +08:00

Branches

614d9c24e7 · Update hipify.py · Updated 2025-02-03 14:01:40 +08:00

349
1

0a02744dc8 · fix TP · Updated 2025-01-31 09:18:56 +08:00

392
12

0405645a6c · initial · Updated 2025-01-31 08:55:49 +08:00

390
1

68543e17aa · wip · Updated 2025-01-29 18:04:07 +08:00

399
4

39c4a4cdb5 · review comments · Updated 2025-01-29 07:08:50 +08:00

464
7

34d8e885b5 · Update benchmark-pipeline.yaml · Updated 2025-01-23 04:13:37 +08:00

495
4

a7ca0cc47f · Merge branch 'main' into moondream2 · Updated 2025-01-20 16:10:52 +08:00

540
2

1aa5adb1f7 · cuda · Updated 2025-01-17 03:15:23 +08:00

580
1

7097f31955 · test · Updated 2025-01-15 19:22:32 +08:00

779
22

e68f63ef83 · Simplify · Updated 2025-01-15 18:31:16 +08:00

601
2

c1d1875ba3 · Updates docs with correction about default cuda version · Updated 2025-01-08 06:29:07 +08:00

710
1

ab8e962352 · [Ignore] Test multi-modal models extended · Updated 2025-01-04 03:31:48 +08:00

763
1

efbce85f4d · [misc] Layerwise profile updates (#10242) · Updated 2024-12-17 02:14:57 +08:00

930
0
Included

d35febb802 · Add image repeat to the benchmark_serving.py to test hit/miss of MM cache · Updated 2024-12-14 00:33:18 +08:00

980
2

99b267c647 · encoder cache · Updated 2024-12-04 22:40:24 +08:00

1093
3

5dd94302d1 · Update test-pipeline.yaml · Updated 2024-12-04 09:55:23 +08:00

1088
2

f65626809b · Merge branch 'main' into ray-backend-v1 · Updated 2024-11-26 06:46:41 +08:00

1170
2

6e5c165e1e · [V1] VLM prefix caching: Add hashing of images · Updated 2024-11-21 01:00:18 +08:00

1260
1

4092be2925 · stash branch for brittany · Updated 2024-10-24 06:05:02 +08:00

1697
29

3794ef084d · sync · Updated 2024-10-21 22:48:37 +08:00

1758
2