-
MA-LMM Public
(2024CVPR) MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding
-
-
D-NeRV Public
The official implementation of 'Towards Scalable Neural Representation for Diverse Videos' (CVPR 2023)
-
A2Summ Public
The official implementation of 'Align and Attend: Multimodal Summarization with Dual Contrastive Losses' (CVPR 2023)
-
ASM-Loc Public
(CVPR2022) ASM-Loc: Action-aware Segment Modeling for Weakly-Supervised Temporal Action Localization
-
DeepLearning-500-questions Public
Forked from scutan90/DeepLearning-500-questions深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系[email protected] 版权所有,违权必究 Tan 2018.06
-