Show Model Information

curl --request GET \
  --url 'http://localhost:3000/api/v1/ollama/show-model?modelName=llama2&verbose=true' \
  --header 'Authorization: Bearer <token>'

{
  "success": true,
  "capabilities": ["chat", "completion", "generate"],
  "details": {
    "parent_model": "",
    "format": "gguf",
    "family": "llama",
    "families": ["llama"],
    "parameter_size": "7B",
    "quantization_level": "Q4_0"
  }
}

GET

api

ollama

show-model

curl --request GET \
  --url 'http://localhost:3000/api/v1/ollama/show-model?modelName=llama2&verbose=true' \
  --header 'Authorization: Bearer <token>'

{
  "success": true,
  "capabilities": ["chat", "completion", "generate"],
  "details": {
    "parent_model": "",
    "format": "gguf",
    "family": "llama",
    "families": ["llama"],
    "parameter_size": "7B",
    "quantization_level": "Q4_0"
  }
}

Query Parameters

modelName

string

required

Name of the model to inspect (e.g., “llama2”, “mistral:7b”)

verbose

boolean

default:false

Include verbose model information

baseUrl

string

Optional Ollama server URL (defaults to http://localhost:11434)

Response

success

boolean

Indicates if the request was successful

capabilities

array

Model capabilities and features (e.g., [“chat”, “completion”, “embedding”])

details

object

Detailed model information

Show details properties

parent_model

string

Parent model name

format

string

Model format (e.g., “gguf”)

family

string

Model family (e.g., “llama”)

families

array

Model family hierarchy

parameter_size

string

Number of parameters (e.g., “7B”, “13B”)

quantization_level

string

Quantization level (e.g., “Q4_0”, “Q5_K_M”)

curl --request GET \
  --url 'http://localhost:3000/api/v1/ollama/show-model?modelName=llama2&verbose=true' \
  --header 'Authorization: Bearer <token>'

{
  "success": true,
  "capabilities": ["chat", "completion", "generate"],
  "details": {
    "parent_model": "",
    "format": "gguf",
    "family": "llama",
    "families": ["llama"],
    "parameter_size": "7B",
    "quantization_level": "Q4_0"
  }
}

Notes

This endpoint only works for installed models (use List Models to see what’s installed)
The capabilities array indicates what the model can do (chat, embeddings, etc.)
quantization_level affects model size and performance
Use verbose=true for additional technical details

Common Quantization Levels

Q4_0: 4-bit quantization, smallest size, lower quality
Q5_K_M: 5-bit medium quality
Q8_0: 8-bit quantization, larger size, higher quality

List Installed Models Get Pull Status

Overview

Credentials

Projects

Ollama

AI

Tools

Chat

Knowledge

MCP

Show Model Information

Query Parameters

Response

Notes

Common Quantization Levels

Overview

Credentials

Projects

Ollama

AI

Tools

Chat

Knowledge

MCP

​Query Parameters

​Response

​Notes

​Common Quantization Levels

Query Parameters

Response

Notes

Common Quantization Levels