Large Multimodal Models and Their Applications

images/LMM_Talk_MSRA.png

Abstract

In this talk, I introduced recent progresses in large multimodal models, such as LLaVA, BLIP2, InstructBLIP, and LLaMA-Adapter.

Date
Dec 12, 2023 2:00 PM — 3:30 PM
Location
Microsoft Research Asia
Building 2, No. 5 Dan Ling Street, Beijing, Beijing 100080
Click on the Slides button above to view the built-in slides feature.

Slides can be added in a few ways:

  • Create slides using Hugo Blox Builder’s Slides feature and link using slides parameter in the front matter of the talk file
  • Upload an existing slide deck to static/ and link using url_slides parameter in the front matter of the talk file
  • Embed your slides (e.g. Google Slides) or presentation video on this page using shortcodes.

Further event details, including page elements such as image galleries, can be added to the body of this page.