Co-authored-by: Roniece Ricardo <33437850+RonRicardo@users.noreply.github.com> Co-authored-by: Daniel Garman <garman@github.com> Co-authored-by: SiaraMist <siaramist@github.com> Co-authored-by: Evan Bonsignori <ebonsignori@github.com>
901 B
901 B
title, shortTitle, intro, versions, topics, autogenerated, allowTitleToDifferFromFilename
| title | shortTitle | intro | versions | topics | autogenerated | allowTitleToDifferFromFilename | |||
|---|---|---|---|---|---|---|---|---|---|
| REST API endpoints for models inference | Inference | Use the REST API to submit a chat completion request to a specified model, with or without organizational attribution. |
|
|
rest | true |
About {% data variables.product.prodname_github_models %} inference
You can use the REST API to run inference requests using the {% data variables.product.prodname_github_models %} platform.
The API supports:
- Accessing top models from OpenAI, DeepSeek, Microsoft, Llama, and more.
- Running chat-based inference requests with full control over sampling and response parameters.
- Streaming or non-streaming completions.
- Organizational attribution and usage tracking.