Skip to content

Popular repositories Loading

  1. VITA VITA Public

    ✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

    Python 2.2k 164

  2. Freeze-Omni Freeze-Omni Public

    ✨✨Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM

    Python 286 19

  3. Long-VITA Long-VITA Public

    ✨✨Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuracy

    Python 250 28

  4. LUCY LUCY Public

    LUCY: Linguistic Understanding and Control Yielding Early Stage of Her

    Python 27 3

  5. Sparrow Sparrow Public

    Sparrow: Data-Efficient Video-LLM with Text-to-Image Augmentation

    Jupyter Notebook 23

Repositories

Showing 5 of 5 repositories
  • Long-VITA Public

    ✨✨Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuracy

    VITA-MLLM/Long-VITA’s past year of commit activity
    Python 250 28 5 1 Updated Mar 12, 2025
  • Sparrow Public

    Sparrow: Data-Efficient Video-LLM with Text-to-Image Augmentation

    VITA-MLLM/Sparrow’s past year of commit activity
    Jupyter Notebook 23 Apache-2.0 0 0 0 Updated Mar 11, 2025
  • LUCY Public

    LUCY: Linguistic Understanding and Control Yielding Early Stage of Her

    VITA-MLLM/LUCY’s past year of commit activity
    Python 27 3 2 0 Updated Mar 10, 2025
  • VITA Public

    ✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

    VITA-MLLM/VITA’s past year of commit activity
    Python 2,150 164 45 1 Updated Feb 13, 2025
  • Freeze-Omni Public

    ✨✨Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM

    VITA-MLLM/Freeze-Omni’s past year of commit activity
    Python 286 19 8 2 Updated Jan 2, 2025

Top languages

Loading…

Most used topics

Loading…