API Documentation

Quick Start

InferNest is fully compatible with the OpenAI SDK. Change one line:

Python

from openai import OpenAI

client = OpenAI(
    base_url="https://infernest.xyz/v1",
    api_key="sk-your-key-here"
)

response = client.chat.completions.create(
    model="deepseek-v4-flash",
    messages=[{"role": "user", "content": "Hello!"}]
)

Node.js

import OpenAI from "openai";

const client = new OpenAI({
    baseURL: "https://infernest.xyz/v1",
    apiKey: "sk-your-key-here"
});

const response = await client.chat.completions.create({
    model: "deepseek-v4-flash",
    messages: [{ role: "user", content: "Hello!" }]
});

curl

curl https://infernest.xyz/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer sk-your-key-here" \
  -d '{"model":"deepseek-v4-flash","messages":[{"role":"user","content":"Hello!"}]}'

Authentication

Include your API key in the Authorization header:

Authorization: Bearer sk-your-key-here

Find your API key in the Dashboard.

Endpoints

EndpointDescription
POST /v1/chat/completionsChat completions
POST /v1/embeddingsText embeddings
GET /v1/modelsList available models

Model IDs

deepseek-v4-flashqwen-3.6-27bglm-5.2doubao-pro-256k

Rate Limits

Default limits per API key:

  • • 60 requests per minute
  • • 100,000 tokens per minute

Need higher limits? Contact us after signing up.