Ollama
Show Model Information
Retrieves detailed information about an installed Ollama model including capabilities and parameters
GET
Query Parameters
Name of the model to inspect (e.g., “llama2”, “mistral:7b”)
Include verbose model information
Optional Ollama server URL (defaults to http://localhost:11434)
Response
Indicates if the request was successful
Model capabilities and features (e.g., [“chat”, “completion”, “embedding”])
Detailed model information
Notes
- This endpoint only works for installed models (use List Models to see what’s installed)
- The
capabilitiesarray indicates what the model can do (chat, embeddings, etc.) quantization_levelaffects model size and performance- Use
verbose=truefor additional technical details
Common Quantization Levels
- Q4_0: 4-bit quantization, smallest size, lower quality
- Q5_K_M: 5-bit medium quality
- Q8_0: 8-bit quantization, larger size, higher quality

