ML Wiki

Tag: counting

1 item with this tag.

  • Apr 10, 2026

    NUMINA: When Numbers Speak — Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models

    • source
    • vision
    • video-generation
    • diffusion
    • attention
    • counting
    • multimodal