Endpoint
Example Request
cURL
Response
A list of model objects representing every LLM currently available on the network.
Example response
Model availability changes dynamically as GPU nodes join and leave the network. Always query
GET /v1/models before relying on a specific model being available, and handle 503 Service Unavailable responses gracefully in case availability drops between your check and your inference request.