vllm/tests/models
youkaichao dc5ce861bf
[torch.compile] remove compilation_context and simplify code (#10838)
Signed-off-by: youkaichao <youkaichao@gmail.com>
2024-12-03 06:19:02 +00:00
..
decoder_only [torch.compile] remove compilation_context and simplify code (#10838) 2024-12-03 06:19:02 +00:00
embedding [Model] Replace embedding models with pooling adapter (#10769) 2024-12-01 08:02:54 +08:00
encoder_decoder [Model] Support Qwen2 embeddings and use tags to select model tests (#10184) 2024-11-14 20:23:09 -08:00
fixtures [CI/Build] Update pixtral tests to use JSON (#8436) 2024-09-13 03:47:52 +00:00
__init__.py [CI/Build] Move `test_utils.py` to `tests/utils.py` (#4425) 2024-05-13 23:50:09 +09:00
registry.py [Model] Added GLM-4 series hf format model support vllm==0.6.4 (#10561) 2024-11-28 14:53:31 +00:00
test_initialization.py [Model] Added GLM-4 series hf format model support vllm==0.6.4 (#10561) 2024-11-28 14:53:31 +00:00
test_oot_registration.py [Model] Explicit interface for vLLM models and support OOT embedding models (#9108) 2024-10-07 06:10:35 +00:00
test_registry.py [Misc] Rename embedding classes to pooling (#10801) 2024-12-01 14:36:51 +08:00
utils.py [CI/Build] Update CPU tests to include all "standard" tests (#5481) 2024-11-08 23:30:04 +08:00