vllm/requirements-neuron.txt

10 lines
218 B
Plaintext
Raw Normal View History

sentencepiece # Required for LLaMA tokenizer.
numpy
transformers-neuronx >= 0.9.0
torch-neuronx >= 2.1.0
neuronx-cc
fastapi
uvicorn[standard]
2024-01-22 08:05:56 +08:00
pydantic >= 2.0 # Required for OpenAI server.
prometheus_client >= 0.18.0