1
0
mirror of synced 2026-01-04 00:06:20 -05:00
Files
docs/content/rest/models/inference.md
Kevin Heis f7a477b87b GitHub models REST API docs (#55288)
Co-authored-by: Roniece Ricardo <33437850+RonRicardo@users.noreply.github.com>
Co-authored-by: Daniel Garman <garman@github.com>
Co-authored-by: SiaraMist <siaramist@github.com>
Co-authored-by: Evan Bonsignori <ebonsignori@github.com>
2025-05-15 18:56:40 +00:00

901 B

title, shortTitle, intro, versions, topics, autogenerated, allowTitleToDifferFromFilename
title shortTitle intro versions topics autogenerated allowTitleToDifferFromFilename
REST API endpoints for models inference Inference Use the REST API to submit a chat completion request to a specified model, with or without organizational attribution.
fpt
*
API
rest true

About {% data variables.product.prodname_github_models %} inference

You can use the REST API to run inference requests using the {% data variables.product.prodname_github_models %} platform.

The API supports:

  • Accessing top models from OpenAI, DeepSeek, Microsoft, Llama, and more.
  • Running chat-based inference requests with full control over sampling and response parameters.
  • Streaming or non-streaming completions.
  • Organizational attribution and usage tracking.