Skip to content
@Vision-CAIR

Vision CAIR Research Group, KAUST

Vision CAIR Group, KAUST, supported by Mohamed Elhoseiny

Popular repositories Loading

  1. MiniGPT-4 MiniGPT-4 Public

    Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

    Python 25.8k 2.9k

  2. MiniGPT4-video MiniGPT4-video Public

    Official code for Goldfish model for long video understanding and MiniGPT4-video for short video understanding

    Python 641 71

  3. ChatCaptioner ChatCaptioner Public

    Official Repository of ChatCaptioner

    Jupyter Notebook 469 28

  4. LongVU LongVU Public

    [ICML 2025] Official PyTorch implementation of LongVU

    Python 423 35

  5. VisualGPT VisualGPT Public

    VisualGPT, CVPR 2022 Proceeding, GPT as a decoder for vision-language models

    Python 341 54

  6. MiniGPT-Med MiniGPT-Med Public

    Open-sourced code of miniGPT-Med

    Python 139 19

Repositories

Showing 10 of 37 repositories
  • iMotion-LLM Public
    Vision-CAIR/iMotion-LLM’s past year of commit activity
    0 0 1 0 Updated Dec 3, 2025
  • Infinibench Public

    Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows

    Vision-CAIR/Infinibench’s past year of commit activity
    Python 19 BSD-3-Clause 1 1 0 Updated Nov 4, 2025
  • 3DCoMPaT-v2 Public

    3DCoMPaT++: An improved large-scale 3D vision dataset for compositional recognition

    Vision-CAIR/3DCoMPaT-v2’s past year of commit activity
    Python 94 BSD-3-Clause 8 1 0 Updated Oct 14, 2025
  • LongVU Public

    [ICML 2025] Official PyTorch implementation of LongVU

    Vision-CAIR/LongVU’s past year of commit activity
    Python 423 Apache-2.0 35 31 2 Updated May 8, 2025
  • MammalNet Public
    Vision-CAIR/MammalNet’s past year of commit activity
    Python 45 MIT 4 5 0 Updated Feb 9, 2025
  • dochaystacks Public

    Document Haystacks: Vision-Language Reasoning Over Piles of 1000+ Documents, CVPR 2025

    Vision-CAIR/dochaystacks’s past year of commit activity
    Python 25 1 2 0 Updated Jan 25, 2025
  • Vision-CAIR/Goldfish_website’s past year of commit activity
    JavaScript 0 0 0 0 Updated Dec 10, 2024
  • MiniGPT4-video Public

    Official code for Goldfish model for long video understanding and MiniGPT4-video for short video understanding

    Vision-CAIR/MiniGPT4-video’s past year of commit activity
    Python 641 BSD-3-Clause 71 38 1 Updated Dec 10, 2024
  • MiniGPT-Med Public

    Open-sourced code of miniGPT-Med

    Vision-CAIR/MiniGPT-Med’s past year of commit activity
    Python 139 Apache-2.0 19 8 0 Updated Sep 3, 2024
  • MiniGPT-4 Public

    Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

    Vision-CAIR/MiniGPT-4’s past year of commit activity
    Python 25,759 BSD-3-Clause 2,922 359 18 Updated Sep 2, 2024

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…