Back to Models
Meta: Llama 3.2 11B Vision Instruct AI Model Icon

Meta: Llama 3.2 11B Vision Instruct

meta-llama/llama-3.2-11b-vision-instruct

Description

Llama 3.2 11B Vision is a multimodal model with 11 billion parameters, designed to handle tasks combining visual and textual data. It excels in tasks such as image captioning and...

API Usage Examples

OpenAI Compatible Endpoint

Use this endpoint with any OpenAI-compatible library. Model: Meta: Llama 3.2 11B Vision Instruct (meta-llama/llama-3.2-11b-vision-instruct)

curl https://api.ridvay.com/v1/chat/completions   -H "Content-Type: application/json"   -H "Authorization: Bearer YOUR_API_KEY"   -d '{
    "model": "meta-llama/llama-3.2-11b-vision-instruct",
    "messages": [
      {
        "role": "user",
        "content": "Explain the capabilities of the Meta: Llama 3.2 11B Vision Instruct model"
      }
    ],
    "temperature": 0.7,
    "max_tokens": 1024
  }'

Supported Modalities

  • Text
  • Images

API Pricing

  • Input: 0.049$ / 1M tokens
  • Output: 0.049$ / 1M tokens

Token Limits

  • Max Output: 16,384 tokens
  • Max Context: 131,072 tokens

Subscription Tiers

  • free
  • pro
  • ultimate