Publications
Please see Google Scholar for more recent works and arXiv papers.
2025
WACV
Retrieval Augmented Recipe GenerationIEEE/CVF Winter Conference on Applications of Computer Vision (WACV) , 2025
2024
TOMM
CVLP-NaVD: Contrastive Visual-Language Pre-training Models for Non-annotated Visual DescriptionACM Transactions on Multimedia Computing, Communications, and Applications (TOMM)MM Asia
Active Object Segmentation: A New Modality for Egocentric Action RecognitionACM Multimedia Asia (MM Asia), 2024TOMM
Text-driven Video PredictionACM Transactions on Multimedia Computing, Communications, and Applications (TOMM)TMM
From Canteen Food to Daily Meals: Generalizing Food Recognition to More Practical ScenariosIEEE Transactions on Multimedia (TMM)TMM
Efficient Unsupervised Video Hashing with Contextual Modeling and Structural ControllingIEEE Transactions on Multimedia (TMM)