Melious
Inference

Models

List and explore available AI models

Models

List and retrieve information about available AI models. Melious provides access to 45+ open-source models from leading AI research labs, hosted on European infrastructure for privacy and compliance.


List Models

GET /v1/models

Returns all available models in OpenAI format.

from openai import OpenAI

client = OpenAI(
    api_key="sk-mel-your-api-key-here",
    base_url="https://api.melious.ai/v1"
)

models = client.models.list()

for model in models.data:
    print(f"{model.id} - {model.owned_by}")
import OpenAI from 'openai';

const client = new OpenAI({
  apiKey: 'sk-mel-your-api-key-here',
  baseURL: 'https://api.melious.ai/v1'
});

const models = await client.models.list();

for (const model of models.data) {
  console.log(`${model.id} - ${model.owned_by}`);
}
curl https://api.melious.ai/v1/models \
  -H "Authorization: Bearer sk-mel-your-api-key-here"

Query Parameters

ParameterTypeDefaultDescription
include_metabooleanfalseInclude Melious-specific metadata

Response

{
  "object": "list",
  "data": [
    {
      "id": "gpt-oss-120b",
      "object": "model",
      "created": 1699999999,
      "owned_by": "Meta"
    },
    {
      "id": "qwen3-235b-a22b-instruct",
      "object": "model",
      "created": 1699999999,
      "owned_by": "Qwen"
    }
  ]
}

With Metadata

curl "https://api.melious.ai/v1/models?include_meta=true" \
  -H "Authorization: Bearer sk-mel-your-api-key-here"
{
  "object": "list",
  "data": [
    {
      "id": "gpt-oss-120b",
      "object": "model",
      "created": 1699999999,
      "owned_by": "Meta",
      "_meta": {
        "type": "chat",
        "input_modalities": ["text"],
        "output_modalities": ["text"],
        "capabilities": {
          "streaming": true,
          "function_calling": true
        },
        "context_length": 131072
      }
    }
  ]
}

Retrieve Model

GET /v1/models/{model_id}

Get details for a specific model.

model = client.models.retrieve("gpt-oss-120b")
print(f"Model: {model.id}")
print(f"Owner: {model.owned_by}")
const model = await client.models.retrieve('gpt-oss-120b');
console.log(`Model: ${model.id}`);
console.log(`Owner: ${model.owned_by}`);
curl https://api.melious.ai/v1/models/gpt-oss-120b \
  -H "Authorization: Bearer sk-mel-your-api-key-here"

Available Models

Chat Models

ModelBrandContextVisionToolsReasoning
gpt-oss-120bMeta131KNoYesNo
llama-3.1-8b-instructMeta131KNoNoNo
qwen3-235b-a22b-instructQwen268KNoYesNo
qwen3-235b-a22b-thinkingQwen262KNoYesYes
qwen3-coder-480b-a35b-instructQwen262KNoYesNo
qwen3-coder-30b-a3b-instructQwen262KNoYesNo
qwen3-30b-a3b-instructQwen262KNoYesNo
qwen3-30b-a3b-thinkingQwen262KNoYesYes
qwen3-32bQwen40KNoYesHybrid
qwen3-4b-instructQwen262KNoYesNo
qwen3-4b-thinkingQwen262KNoYesYes
deepseek-r1-0528DeepSeek131KNoYesYes
kimi-k2-instructMoonshot134KNoYesNo
kimi-k2-thinkingMoonshot134KNoYesYes
hermes-4-405bNousResearch131KNoYesHybrid
hermes-4-70bNousResearch131KNoYesHybrid
gpt-oss-120bOpenAI (OSS)128KNoYesYes
gpt-oss-20bOpenAI (OSS)128KNoYesYes
gemma-3-27bGoogle131KYesYesNo
mistral-small-3.2-24b-instructMistral131KYesYesNo
devstral-small-2505Mistral131KNoYesNo
glm-4.5ZAI131KNoYesHybrid
glm-4.5-airZAI131KNoYesHybrid
glm-4.6ZAI131KNoYesHybrid
deepseek-ocrDeepSeek32KYesYesNo
nemotron-nano-v2-12bNVIDIA131KNoYesNo
intellect-3PrimeIntellect131KNoYesNo

Audio-Input Models

ModelBrandContextDescription
voxtral-small-24b-2507Mistral131KAccepts audio input for chat

Embedding Models

ModelBrandContextDescription
bge-m3BAAI8KMultilingual, high performance
bge-multilingual-gemma2BAAI8KGemma-based multilingual
bge-en-iclBAAI32KLong context English
bge-large-en-v1.5BAAI512English, high quality
bge-base-en-v1.5BAAI512English, balanced
qwen3-embedding-8bQwen32KLong context
e5-mistral-7b-instructintfloat32KInstruction-tuned
paraphrase-multilingual-mpnetSentence Transformers512Multilingual paraphrasing

Reranking Models

ModelBrandContextDescription
bge-reranker-v2-m3BAAI8KMultilingual semantic reranking

Image Models

ModelBrandDescription
flux-devBlack Forest LabsHigh-quality generation
flux-schnellBlack Forest LabsFast generation
sdxl-base-1.0Stability AISDXL base model
sdxl-base-v10Stability AISDXL base v10
sdxl-lightning-4stepByteDanceFast 4-step generation
sdxl-lightning-8stepByteDanceQuality 8-step generation

Audio Models (Speech-to-Text)

ModelBrandDescription
whisper-large-v3OpenAIHigh-accuracy transcription
whisper-large-v3-turboOpenAIFast transcription

All models are open-source and hosted on European infrastructure. No data is sent to closed-source API providers.


Model Capabilities

Vision Support

Models with vision can process images alongside text:

{
  "input_modalities": ["text", "image"]
}

Vision-capable models:

  • gemma-3-27b (Google)
  • mistral-small-3.2-24b-instruct (Mistral)
  • deepseek-ocr (DeepSeek) - OCR and document understanding

Audio Input

For audio input with chat models:

{
  "input_modalities": ["text", "audio"]
}

Audio-input models:

  • voxtral-small-24b-2507 (Mistral)

Function Calling

Most models support function calling (tools):

{
  "capabilities": {
    "function_calling": true
  }
}

Reasoning Models

Some models are optimized for complex reasoning tasks:

Reasoning models:

  • deepseek-r1-0528 - DeepSeek reasoning model
  • kimi-k2-thinking - Moonshot thinking model
  • qwen3-235b-a22b-thinking - Qwen thinking variant
  • qwen3-30b-a3b-thinking - Qwen thinking variant
  • qwen3-4b-thinking - Qwen thinking variant
  • gpt-oss-120b / gpt-oss-20b - OpenAI open-source reasoning

Hybrid models (can operate in both modes):

  • hermes-4-405b, hermes-4-70b - NousResearch
  • glm-4.5, glm-4.5-air, glm-4.6 - ZAI
  • qwen3-32b - Qwen

Reasoning models may take longer but produce higher-quality responses for complex tasks like math, coding, and multi-step analysis.


Model Selection Guide

ModelStrengthBest For
gpt-oss-120bBest efficiency/performance, fastestGeneral use, speed priority
kimi-k2-thinkingBest reasoningComplex analysis, math, coding problems
qwen3-235b-a22b-instructBest non-reasoning qualityHigh-quality general chat, long context

By Use Case

Use CaseRecommended Models
General chatgpt-oss-120b, qwen3-235b-a22b-instruct
Complex reasoningkimi-k2-thinking, deepseek-r1-0528
Role-playinghermes-4-70b, kimi-k2-instruct
Vision/imagesmistral-small-3.2-24b-instruct, gemma-3-27b
Codingkimi-k2-thinking, qwen3-coder-480b-a35b-instruct
Long contextqwen3-235b-a22b-instruct (268K), qwen3-30b-a3b-instruct (262K)
Low latencygpt-oss-120b, qwen3-4b-instruct
Embeddingsqwen3-embedding-8b, bge-m3
Reranking/RAGbge-reranker-v2-m3
Document OCRdeepseek-ocr
Image generationflux-schnell (fast), flux-dev (quality)
Transcriptionwhisper-large-v3-turbo (fast), whisper-large-v3 (quality)

Error Codes

CodeDescription
RESOURCE_NOT_FOUNDModel doesn't exist
AUTH_INVALID_API_KEYInvalid API key

See Also

On this page