ML Wiki

Tag: vision-language-models

2 items with this tag.

  • May 09, 2026

    Qwen2.5-VL Technical Report

    • source
    • vision
    • vision-language-models
    • multimodal
    • document-understanding
    • agents
  • Apr 24, 2026

    Consensus Entropy: Harnessing Multi-VLM Agreement for Self-Verifying and Self-Improving OCR

    • source
    • ocr
    • vision-language-models
    • uncertainty-estimation
    • ensemble-methods
    • multimodal