vllm/examples/online_serving
John Zheng 900edbfa48
fix typo of grafana dashboard, with correct datasource (#13668)
Signed-off-by: John Zheng <john.zheng@hp.com>
2025-02-21 18:21:05 +00:00
..
chart-helm [CI/Build] Auto-fix Markdown files (#12941) 2025-02-08 04:25:15 -08:00
opentelemetry [CI/Build] Auto-fix Markdown files (#12941) 2025-02-08 04:25:15 -08:00
prometheus_grafana fix typo of grafana dashboard, with correct datasource (#13668) 2025-02-21 18:21:05 +00:00
api_client.py [Misc] Add SPDX-License-Identifier headers to python source files (#12628) 2025-02-02 11:58:18 -08:00
cohere_rerank_client.py [Misc] Add SPDX-License-Identifier headers to python source files (#12628) 2025-02-02 11:58:18 -08:00
disaggregated_prefill.sh [Bugfix] Fix a path bug in disaggregated prefill example script. (#12121) 2025-01-17 11:12:41 +08:00
gradio_openai_chatbot_webserver.py [Misc] Add SPDX-License-Identifier headers to python source files (#12628) 2025-02-02 11:58:18 -08:00
gradio_webserver.py [Misc] Add SPDX-License-Identifier headers to python source files (#12628) 2025-02-02 11:58:18 -08:00
jinaai_rerank_client.py [Misc] Add SPDX-License-Identifier headers to python source files (#12628) 2025-02-02 11:58:18 -08:00
multi-node-serving.sh [Misc] Adding script to setup ray for multi-node vllm deployments (#12913) 2025-02-20 21:16:40 -08:00
openai_chat_completion_client.py [Misc] Add SPDX-License-Identifier headers to python source files (#12628) 2025-02-02 11:58:18 -08:00
openai_chat_completion_client_for_multimodal.py [Model] Ultravox Model: Support v0.5 Release (#12912) 2025-02-10 22:02:48 +00:00
openai_chat_completion_client_with_tools.py [Misc] Add SPDX-License-Identifier headers to python source files (#12628) 2025-02-02 11:58:18 -08:00
openai_chat_completion_structured_outputs.py [Frontend] Add backend-specific options for guided decoding (#13505) 2025-02-20 15:07:58 -05:00
openai_chat_completion_with_reasoning.py [Bugfix]: Reasoning output bug according to the chat template change (#13025) 2025-02-11 15:49:03 +08:00
openai_chat_completion_with_reasoning_streaming.py [Misc] Add SPDX-License-Identifier headers to python source files (#12628) 2025-02-02 11:58:18 -08:00
openai_chat_embedding_client_for_multimodal.py [Misc] Fix typo in the example file (#12896) 2025-02-08 06:56:43 +00:00
openai_completion_client.py [Misc] Add SPDX-License-Identifier headers to python source files (#12628) 2025-02-02 11:58:18 -08:00
openai_cross_encoder_score.py [Misc] Add SPDX-License-Identifier headers to python source files (#12628) 2025-02-02 11:58:18 -08:00
openai_embedding_client.py [Misc] Add SPDX-License-Identifier headers to python source files (#12628) 2025-02-02 11:58:18 -08:00
openai_pooling_client.py [Misc] Add SPDX-License-Identifier headers to python source files (#12628) 2025-02-02 11:58:18 -08:00
openai_transcription_client.py [Frontend] Add `/v1/audio/transcriptions` OpenAI API endpoint (#12909) 2025-02-13 07:23:45 -08:00
run_cluster.sh [Doc] Move examples into categories (#11840) 2025-01-08 13:09:53 +00:00
sagemaker-entrypoint.sh [Doc] Move examples into categories (#11840) 2025-01-08 13:09:53 +00:00