Skip to main content
David's Blog
Home
Algorithm
Framework
Summary
FAQ
Computer Science
Basic
Network
Operating System
Database
MySQL
Redis
Software Engineer
Lang
Java
DevOps
Unix
Docker
Kubernetes
Design
Design Pattern
Object-Oriented Design
System Design
Tools
Tests
GRE
QUANTITATIVE
VERBAL
TOEFL
LISTENING
READING
SPEAKING
WRITING
About Me
English
简体中文
多模态
David Liu
May 28, 2025
Less than 1 minute
多模态
类型
输入与输出模态不同
多模态输入
多模态输出
要素
Encoder: 各模态的Encoder
Align Strategy: 不同模态的对齐/融合方式
LLM(Optional): 以大语言模型为核心的网络
分类
Dual-Encoder 双塔
Fusion
GLIP
CoCa
SAM
FLAVA
Encoder-Decoder
Adapted LLM