docs/content/rest/models/inference.md at b0e01d5bfb400a10fc656102b75219db6ad02c66

mirror of synced 2026-01-04 00:06:20 -05:00

Files

Kevin Heis f7a477b87b GitHub models REST API docs (#55288 )

Co-authored-by: Roniece Ricardo <33437850+RonRicardo@users.noreply.github.com>
Co-authored-by: Daniel Garman <garman@github.com>
Co-authored-by: SiaraMist <siaramist@github.com>
Co-authored-by: Evan Bonsignori <ebonsignori@github.com>

2025-05-15 18:56:40 +00:00

901 B

Raw Blame History

title, shortTitle, intro, versions, topics, autogenerated, allowTitleToDifferFromFilename

title

shortTitle

intro

versions

topics

autogenerated

allowTitleToDifferFromFilename

REST API endpoints for models inference

Inference

Use the REST API to submit a chat completion request to a specified model, with or without organizational attribution.

fpt
*

API

rest

true

About {% data variables.product.prodname_github_models %} inference

You can use the REST API to run inference requests using the {% data variables.product.prodname_github_models %} platform.

The API supports:

Accessing top models from OpenAI, DeepSeek, Microsoft, Llama, and more.
Running chat-based inference requests with full control over sampling and response parameters.
Streaming or non-streaming completions.
Organizational attribution and usage tracking.

901 B Raw Blame History

About {% data variables.product.prodname_github_models %} inference

901 B

Raw Blame History