vllm/models at 0e2b026ece1627dac51e2050858dbd0213936ed6 - vllm

History

youkaichao dc5ce861bf [torch.compile] remove compilation_context and simplify code (#10838 ) Signed-off-by: youkaichao <youkaichao@gmail.com>		2024-12-03 06:19:02 +00:00
..
decoder_only	[torch.compile] remove compilation_context and simplify code (#10838 )	2024-12-03 06:19:02 +00:00
embedding	[Model] Replace embedding models with pooling adapter (#10769 )	2024-12-01 08:02:54 +08:00
encoder_decoder	[Model] Support Qwen2 embeddings and use tags to select model tests (#10184 )	2024-11-14 20:23:09 -08:00
fixtures	[CI/Build] Update pixtral tests to use JSON (#8436 )	2024-09-13 03:47:52 +00:00
__init__.py	[CI/Build] Move `test_utils.py` to `tests/utils.py` (#4425 )	2024-05-13 23:50:09 +09:00
registry.py	[Model] Added GLM-4 series hf format model support vllm==0.6.4 (#10561 )	2024-11-28 14:53:31 +00:00
test_initialization.py	[Model] Added GLM-4 series hf format model support vllm==0.6.4 (#10561 )	2024-11-28 14:53:31 +00:00
test_oot_registration.py	[Model] Explicit interface for vLLM models and support OOT embedding models (#9108 )	2024-10-07 06:10:35 +00:00
test_registry.py	[Misc] Rename embedding classes to pooling (#10801 )	2024-12-01 14:36:51 +08:00
utils.py	[CI/Build] Update CPU tests to include all "standard" tests (#5481 )	2024-11-08 23:30:04 +08:00