International Conference

[#357] Causal Mode Multiplexer: A Novel Framework for Unbiased Multispectral Pedestrian Detection

Taeheon Kim*, Sebin Shin*, Youngjoon Yu, Hak Gu Kim, and Yong Man Ro  (* equal contributor)

CVPR 2024

 Jeongsoo Choi*, Se Jin Park*, Minsu Kim*, and Yong Man Ro (* equal contributor)

CVPR 2024

Pai Chet Ng, Zhixiang Chi, Malcolm Low, Juwei Lu, Konstantinos Plataniotis, Nikolaos Boulgouris, Thirimachos Bourlai, Yong Man Ro

ICASSP 2024 Special Session

[#354] Towards Practical and Efficient Image-To-Speech Captioning With Vision-Language Pre-Training and Multi-Modal Tokens

Minsu Kim, Jeongsoo Choi, Soumi Maiti, Jeong Hun Yeo, Shinji Watanabe, and Yong Man Ro

ICASSP 2024

[#353]  Visual Speech Recognition for Languages with Limited Labeled Data using Automatic Labels from Whisper

Jeong Hun Yeo*, Minsu Kim*, Shinji Watanabe, and Yong Man Ro

ICASSP 2024

[#352]  Persona Extraction through Semantic Similarity for Emotional Support Conversation Generation

Seunghee Han, Se Jin Park, Chae Won Kim, and Yong Man Ro

ICASSP 2024

[#351]  Text-driven Talking Face Synthesis by Reprogramming Audio-driven Models

Jeongsoo Choi, Minsu Kim, Se Jin Park, and Yong Man Ro

ICASSP 2024

[#350] Exploring Phonetic Context-aware Lip-Sync for Talking Face Generation 

Se Jin Park, Minsu Kim, Jeongsoo Choi, and Yong Man Ro

ICASSP 2024

[#349] OSR via Visual Prompts from Common-Sense Knowledge

Seongyeop Kim, Hyung-Il Kim, and Yong Man Ro

AAAI 2024