International Conference
[#357] Causal Mode Multiplexer: A Novel Framework for Unbiased Multispectral Pedestrian Detection
Taeheon Kim*, Sebin Shin*, Youngjoon Yu, Hak Gu Kim, and Yong Man Ro (* equal contributor)
CVPR 2024
Jeongsoo Choi*, Se Jin Park*, Minsu Kim*, and Yong Man Ro (* equal contributor)
CVPR 2024
Pai Chet Ng, Zhixiang Chi, Malcolm Low, Juwei Lu, Konstantinos Plataniotis, Nikolaos Boulgouris, Thirimachos Bourlai, Yong Man Ro
ICASSP 2024 Special Session
[#354] Towards Practical and Efficient Image-To-Speech Captioning With Vision-Language Pre-Training and Multi-Modal Tokens
Minsu Kim, Jeongsoo Choi, Soumi Maiti, Jeong Hun Yeo, Shinji Watanabe, and Yong Man Ro
ICASSP 2024
[#353] Visual Speech Recognition for Languages with Limited Labeled Data using Automatic Labels from Whisper
Jeong Hun Yeo*, Minsu Kim*, Shinji Watanabe, and Yong Man Ro
ICASSP 2024
[#352] Persona Extraction through Semantic Similarity for Emotional Support Conversation Generation
Seunghee Han, Se Jin Park, Chae Won Kim, and Yong Man Ro
ICASSP 2024
[#351] Text-driven Talking Face Synthesis by Reprogramming Audio-driven Models
Jeongsoo Choi, Minsu Kim, Se Jin Park, and Yong Man Ro
ICASSP 2024
[#350] Exploring Phonetic Context-aware Lip-Sync for Talking Face Generation
Se Jin Park, Minsu Kim, Jeongsoo Choi, and Yong Man Ro
ICASSP 2024
[#349] OSR via Visual Prompts from Common-Sense Knowledge
Seongyeop Kim, Hyung-Il Kim, and Yong Man Ro
AAAI 2024