vllm.benchmarks.lib.ready_checker ¶
Utilities for checking endpoint readiness.
wait_for_endpoint async ¶
wait_for_endpoint(
request_func: RequestFunc,
test_input: RequestFuncInput,
session: ClientSession,
timeout_seconds: int = 600,
retry_interval: int = 5,
) -> RequestFuncOutput
Wait for an endpoint to become available before starting benchmarks.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
request_func | RequestFunc | The async request function to call | required |
test_input | RequestFuncInput | The RequestFuncInput to test with | required |
timeout_seconds | int | Maximum time to wait in seconds (default: 10 minutes) | 600 |
retry_interval | int | Time between retries in seconds (default: 5 seconds) | 5 |
Returns:
| Name | Type | Description |
|---|---|---|
RequestFuncOutput | RequestFuncOutput | The successful response |
Raises:
| Type | Description |
|---|---|
ValueError | If the endpoint doesn't become available within the timeout |