Slowfast-llava: a strong training-free baseline for video large language models.
Mingze Xu, Mingfei Gao, Zhe Gan, Hong-You Chen, Zhengfeng Lai, Haiming Gang, Kai Kang, Afshin Dehghan Apple 2024 https://huggingface.co/papers/2407.
- SlowFast-LLaVA (SF-LLaVA), a training-free video large…
Join the discussion on this paper page.
Leave a reply