Skip to main content
This endpoint is not yet available. It is planned for a future release.
Understand images alongside text — document digitization, ID card reading, invoice OCR. Powered by models like Qwen-VL.

Planned usage

response = client.chat.completions.create(
    model="forii/qwen2.5-vl-72b",
    messages=[{
        "role": "user",
        "content": [
            {"type": "text", "text": "Extract all details from this Aadhaar card"},
            {"type": "image_url", "image_url": {
                "url": "data:image/jpeg;base64,/9j/4AAQ...",
                "detail": "high"
            }}
        ]
    }],
)

India use cases

  • Aadhaar / PAN card extraction — Parse ID documents into structured data
  • GST invoice parsing — Extract line items, totals, GSTIN from invoices
  • Handwritten form digitization — Convert Hindi handwritten forms to structured JSON
  • Screenshot understanding — Debug UI issues from screenshots