Stars
GRID: Generative Recommendation with Semantic IDs
Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'
🎓Automatically Update Recommendation Papers Daily using Github Actions (Update Every 12th hours)
The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)
A collection of research and survey papers of real-time bidding (RTB) based display advertising techniques.
A Toolbox for MultiModal Recommendation. Integrating 10+ Models...
A configurable, tunable, and reproducible library for CTR prediction https://fuxictr.github.io
Multimodal Pretraining, Adaptation, and Generation for Recommendation: A Survey
Awesome Deep Learning papers for industrial Search, Recommendation and Advertisement. They focus on Embedding, Matching, Pre-Ranking, Ranking (CTR/CVR prediction), Post Ranking, Relevance, LLM, Rei…
Paper list about multimodal and large language models, only used to record papers I read in the daily arxiv for personal needs.
Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
✨✨Latest Advances on Multimodal Large Language Models
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
Easy-to-use,Modular and Extendible package of deep-learning based CTR models .
推荐/广告/搜索领域工业界经典以及最前沿论文集合。A collection of industry classics and cutting-edge papers in the field of recommendation/advertising/search.
Retrieval and Retrieval-augmented LLMs
搜索、推荐、广告、用增等工业界实践文章收集(来源:知乎、Datafuntalk、技术公众号)
LAVIS - A One-stop Library for Language-Vision Intelligence
an application for Mac OS X which allows you to use controller inputs like a mouse or keyboard
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.