epithre-omni

The flagship. Default choice for most chat workloads.

Capabilities

Capability Notes
Tier Flagship
Context window 49,152 tokens
Max output 16,384 tokens
Modalities Text, image (vision)
Tool use Yes; standard tools + tool_choice schema
Extended thinking Yes; opt-in via chat_template_kwargs={"enable_thinking": true}
Structured output Yes; response_format json_object + json_schema strict
Prompt caching Yes; cache_control markers
Streaming Yes; SSE
Indonesian fluency Native (tuned on Indonesian corpora)

When to use

When NOT to use

Pricing

Performance benchmarks

Task Score (vs.)
Indonesian general chat Strong (native fluency)
Indonesian legal QA (with RAG) ~95% accuracy on 200-question gold set
Indonesian sentiment classification ~92% F1 (3-class)
Indonesian vision QA (invoice extraction) ~94% field-level F1
English chat Strong on everyday and technical English

Latency

Caveats

See also