Skip to content
@dvlab-research

DV Lab

Deep Vision Lab

Pinned

  1. LISA LISA Public

    Project Page for "LISA: Reasoning Segmentation via Large Language Model"

    Python 1.5k 105

  2. LongLoRA LongLoRA Public

    Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

    Python 2.5k 256

  3. MGM MGM Public

    Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

    Python 3k 273

  4. LLaMA-VID LLaMA-VID Public

    Official Implementation for LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models

    Python 607 38

  5. Video-P2P Video-P2P Public

    Video-P2P: Video Editing with Cross-attention Control

    Python 340 23

  6. LLMGA LLMGA Public

    This project is the official implementation of 'LLMGA: Multimodal Large Language Model based Generation Assistant'

    Python 261 17

Repositories

Showing 10 of 63 repositories
  • LongLoRA Public

    Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

    Python 2,506 Apache-2.0 256 43 2 Updated Jun 2, 2024
  • MR-GSM8K Public

    Challenge LLMs to Reason About Reasoning: A Benchmark to Unveil Cognitive Depth in LLMs

    Python 34 0 2 0 Updated Jun 1, 2024
  • MGM Public

    Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

    Python 3,044 Apache-2.0 273 49 2 Updated May 4, 2024
  • LISA Public

    Project Page for "LISA: Reasoning Segmentation via Large Language Model"

    Python 1,524 Apache-2.0 105 56 1 Updated Apr 8, 2024
  • GroupContrast Public

    [CVPR 2024] GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding

    39 MIT 1 2 0 Updated Mar 15, 2024
  • Video-P2P Public

    Video-P2P: Video Editing with Cross-attention Control

    Python 340 23 5 0 Updated Mar 12, 2024
  • Parametric-Contrastive-Learning Public

    Parametric Contrastive Learning (ICCV2021) & GPaCo (TPAMI 2023)

    Python 232 MIT 29 5 0 Updated Feb 29, 2024
  • Prompt-Highlighter Public

    [CVPR 2024] Prompt Highlighter: Interactive Control for Multi-Modal LLMs

    Python 104 MIT 2 2 0 Updated Jan 25, 2024
  • LLMGA Public

    This project is the official implementation of 'LLMGA: Multimodal Large Language Model based Generation Assistant'

    Python 261 Apache-2.0 17 3 0 Updated Jan 22, 2024
  • LLaMA-VID Public

    Official Implementation for LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models

    Python 607 Apache-2.0 38 29 0 Updated Jan 10, 2024

Most used topics

Loading…