Skip to main content
David's Blog
About
Algorithm
Framework
Data Structure
Linear
Tree
Graph
Set
Search
DFS
BFS
FOR
Optimization
Decrease & Conquer
Dynamic Programming
Math
FAQ
AI
Recommender System
Retrieval
Pre-Ranking
Ranking
Re-Ranking
Metrics
Generative Models
Computer Science
Core
Network
Operating System
Design
Design Pattern
Object-Oriented Design
System Design
Engineering
Database
MySQL
Redis
Lang
Java
DevOps
Unix
Tools
Tests
TOEFL
LISTENING
READING
SPEAKING
WRITING
GRE
QUANTITATIVE
VERBAL
Token Pruning
David Liu
4/3/26
Less than 1 minute
Token Pruning
好处:
即插即拔,无需训练
显著降低成本(Flops),保持准确性
fastV=
https://github.com/pkunlp-icler/FastV/
dart=
https://github.com/ZichenWen1/DART
FastV
注意力机制平方复杂度