International Conference

[#348] Intuitive Multilingual Audio-Visual Speech Recognition with a Single-Trained Model

Joanna Hong, Se Jin Park, and Yong Man Ro

EMNLP 2023

[#347] Lip Reading for Low-resource Languages by Learning and Combining General Speech Knowledge and Language-specific Knowledge 

Minsu Kim*, Jeong Hun Yeo*, Jeongsoo Choi, and Yong Man Ro (* equal contribution) 

ICCV 2023

Byung-Kwan Lee*, Junho Kim*, and Yong Man Ro (* equally contributed)

ICCV 2023

[#345] DiffV2S: Diffusion-based Video-to-Speech Synthesis with Vision-guided Speaker Embedding

Jeongsoo Choi*, Joanna Hong*, and Yong Man Ro (* equally contributed)
ICCV 2023

[#344] Mitigating Dataset Bias in Image Captioning through CLIP Confounder-free Captioning Network

YeonJu Kim, Junho Kim, Byung-Kwan Lee, Sebin Shin, and Yong Man Ro
ICIP 2023
Sungjune Park, Jung Uk Kim, Jin Mo Song, and Yong Man Ro
ICIP 2023
Jeongsoo Choi, Minsu Kim, and Yong Man Ro
INTERSPEECH 2023
Joanna Hong*, Minsu Kim*, Jeongsoo Choi, and Yong Man Ro (* equally contributed)
CVPR 2023
Minsu Kim*, Joanna Hong*, and Yong Man Ro (* equally contributed)
ICASSP 2023
Jeong Hun Yeo, Minsu Kim, and Yong Man Ro
ICASSP 2023
Minsu Kim,  Chae Won Kim, and Yong Man Ro
AAAI 2023