Search this site
Embedded Files
Skip to main content
Skip to navigation
IVYLab & IVLLab
IVYLab & IVLLab
LLM Multimodal Highlights
People
Professor
Members
Research Collaborators
Alumni
Research
Lab Overview
Research Fields
Research Demo
Publications
International Conference
International Journal
International Standards
Patents
Domestic Papers
Gallery
Board
Database
Contact
IVYLab & IVLLab
IVYLab & IVLLab
LLM Multimodal Highlights
People
Professor
Members
Research Collaborators
Alumni
Research
Lab Overview
Research Fields
Research Demo
Publications
International Conference
International Journal
International Standards
Patents
Domestic Papers
Gallery
Board
Database
Contact
More
IVYLab & IVLLab
LLM Multimodal Highlights
People
Professor
Members
Research Collaborators
Alumni
Research
Lab Overview
Research Fields
Research Demo
Publications
International Conference
International Journal
International Standards
Patents
Domestic Papers
Gallery
Board
Database
Contact
International Journal
[#160] TMT: Tri-Modal Translation between Speech, Image, and Text by Processing Different Modalities as Different Languages
Minsu Kim, Jee-weon Jung, Hyeongseop Rha, Soumi Maiti, Siddhant Arora, Xuankai Chang, Shinji Watanabe, Yong Man Ro
IEEE Transactions on Multimedia
[#159] MSCoTDet: Language-driven Multi-modal Fusion for Improved Multispectral Pedestrian Detection
Taeheon Kim, Sangyun Chung, Damin Yeom, Youngjoon Yu, Hak Gu Kim, Yong Man Ro
IEEE Transactions on Circuits and Systems for Video Technology
[#158] Prompt Tuning of Deep Neural Networks for Speaker-adaptive Visual Speech Recognition
Minsu Kim, Hyeong-Il Kim, Yong Man Ro
IEEE Transactions on Pattern Analysis and Machine Intelligence
[#157] Advancing Causal Intervention in Image Captioning with Causal Prompt
Youngjoon Yu, Yeonju Kim, Yong Man Ro
IEEE Transactions on Neural Networks and Learning Systems
[#156] Textless Unit-to-Unit training for Many-to-Many Multilingual Speech-to-Speech Translation
Minsu Kim, Jeongsoo Choi, Dahun Kim, Yong Man Ro
IEEE Transactions on Audio, Speech, and Language Processing, vol. 32, pp. 3934-3946, 2024. /
Demo
[#155]
Text-Guided Distillation Learning to Diversify Video Embeddings for Text-Video Retrieval
Sangmin Lee, Hyung-Il Kim, Yong Man Ro
Pattern Recognition, vol. 156, no. 3, pp. 110754, 2024.
[#154] Robust Pedestrian Detection via Constructing Versatile Pedestrian Knowledge Bank
Sungjune Park*, Hyunjun Kim*, Yong Man Ro (* equal contributor)
Pattern Recognition, vol. 153, no. 4, pp. 110539, 2024.
[#153]
Integrating Language-Derived Appearance Elements with Visual Cues in Pedestrian Detection
Sungjune Park*, Hyunjun Kim*, Yong Man Ro (* equal contributor)
IEEE Transactions on Circuits and Systems for Video Technology, pp. 1-1, 2024.
[#152]
AKVSR: Audio Knowledge Empowered Visual Speech Recognition by Compressing Audio Knowledge of a Pretrained Model
Jeong Hun Yeo, Minsu Kim, Jeongsoo Choi, Dae Hoe Kim, and Yong Man Ro
IEEE Transactions on Multimedia, vol. 26, pp. 6462-6474, 2024.
[#151]
Defending Video Recognition Model against Adversarial Perturbations via Defense Patterns
Hong Joo Lee and Yong Man Ro
IEEE Transactions on Dependable and Secure Computing, vol. 21, no. 04, pp. 4110-4121, 2024.
[#150]
Enabling Visual Object Detection with Object Sounds via Visual Modality Recalling Memory
Jung Uk Kim and Yong Man Ro
IEEE Transactions on Neural Networks and Learning Systems, pp. 1-13, 2023.
[#149]
Adversarial anchor-guided feature refinement for adversarial defense
Hakmin Lee and Yong Man Ro
Image and Vision Computing, vol. 136, pp. 104722, 2023.
[#148]
Robust Proxy: Improving Adversarial Robustness by Robust Proxy Learning
Hong Joo Lee and Yong Man Ro
IEEE Transactions on Information Forensics & Security, vol. 18, pp. 4021-4033, 2023
[#147]
Deep learning-based classification system of bacterial keratitis and fungal keratitis using anterior segment images
Yeo Kyoung Won*, Hyebin Lee*, Youngjun Kim, Gyule Han, Tae-Young Chung, Yong Man Ro and Dong Hui Lim (* equal contributor)
Frontiers in Medicine, vol. 10, pp. 1162124, 2023.
[#146]
Stereoscopic Vision Recalling Memory for Monocular 3D Object Detection
Jung Uk Kim, Hyung-Il Kim, and Yong Man Ro
IEEE Transactions on Image Processing, vol. 32, pp. 2749-2760, 2023.
[#145]
Advancing Adversarial Training by Injecting Booster Signal
Hong Joo Lee, Youngjoon Yu, and Yong Man Ro
IEEE Transactions on Neural Networks and Learning Systems, vol. 35, no. 9, pp. 12665-12677, Sept. 2024
[#144]
Defending Person Detection Against Adversarial Patch Attack by using Universal Defensive Frame
Youngjoon Yu, Hong Joo Lee, Hakmin Lee, and Yong Man RoHakmin Lee and Yong Man Ro
IEEE Transactions on Image Processing, vol. 31, pp. 6976-6990, 2022.
[#143]
Face Shape-Guided Deep Feature Alignment for Face Recognition Robust to Face Misalignment
Hyung-Il Kim, Kimin Yun, and Yong Man Ro
IEEE Transactions on Biometrics, Behavior, and Identity Science, vol. 4, no. 4, pp. 556-569, 2022.
[#142]
CroMM-VSR: Cross-Modal Memory Augmented Visual Speech Recognition
Minsu Kim, Joanna Hong, Sejin Park, Yong Man Ro
IEEE Transactions on Multimedia, vol. 24, pp. 4342-4355, 2022.
[#141]
Assessing Individual VR Sickness Through Deep Feature Fusion of VR Video and Physiological Response
Sangmin Lee, Seongyeop Kim, Hak Gu Kim, and Yong Man Ro
IEEE Transactions on Circuits and Systems for Video Technology, vol. 136, pp. 2895-2907, 2022.
[#140]
Uncertainty-Guided Cross-Modal Learning for Robust Multispectral Pedestrian Detection
Jung Uk Kim, Sungjune Park, Yong Man Ro
IEEE Transactions on Circuits and Systems for Video Technology, vol. 32, no. 3, pp. 1510-1523, 2022.
[#139]
On-the-Fly Facial Expression Prediction Using LSTM Encoded Appearance-Suppressed Dynamics
Wissam J. Baddar, Sangmin Lee, and Yong Man Ro
IEEE Transactions on Affective Computing, vol. 13, no. 1, pp. 159-174, 2022.
[#138]
Robust Perturbation for Visual Explanation:Cross-checking Mask Optimization to Avoid Class Distortion
Junho Kim, Seongyeop Kim, Seong Tae Kim, and Yong Man Ro
IEEE Transactions on Image Processing, vol. 31, pp. 301-313, 2022.
[#137]
Speech Reconstruction with Reminiscent Sound via Visual Voice Memory
Joanna Hong, Minsu Kim, Se Jin Park, Yong Man Ro
IEEE Transactions on Audio Speech and Language Processing, vol. 29, pp. 3654-3667, 2021.
[#136]
CUA Loss: Class Uncertainty-Aware Gradient Modulation for Robust Object Detection
Jung Uk Kim, Seong Tae Kim, Hong Joo Lee, Sangmin Lee, and Yong Man Ro
IEEE Transactions on Circuits and Systems for Video Technology, vol. 31, no. 9, pp. 3529-3543, 2021.
[#135]
Adversarially Robust Hyperspectral Image Classification via Random Spectral Sampling and Spectral Shape Encoding
Sungjune Park, Hong Joo Lee, Yong Man Ro
IEEE Access, vol. 9, pp. 66791-66804, 2021.
[#134]
Robust Video Frame Interpolation with Exceptional Motion Map
Minho Park, Hak Gu Kim, Sangmin Lee, and Yong Man Ro
IEEE Transactions on Circuits and Systems for Video Technology, vol. 31, no. 2, pp. 754-764, 2021.
[#133]
Multimodal Faical Biometrics Recognition: Dual-stream Convolutional Neural Networks with Multi-feature Fusion Layers
Leslie Ching Ow Tiong, Seong Tae Kim, and Yong Man Ro
Image and Vision Computing (Elsevier), vol. 102, pp. 103977, 2020.
[#132]
Dual-branch structured de-striping convolution network using parametric noise model
Jongho Lee and Yong Man Ro
IEEE Access , vol. 8, pp. 155519-155528, 2020.
[#131]
Deep Virtual Reality Image Quality Assessment with Human Perception Guider for Omnidirectional Image
Hak Gu Kim, Heoun-taek Lim, and Yong Man Ro
IEEE Transactions on Circuits and Systems for Video Technologyk, vol. 30, no. 4, pp. 917-928, 2019.
[#130]
BBC Net: Bounding-Box Critic Network for Occlusion-Robust Object Detection
Jung Uk Kim, Jungsu Kwon, Hak Gu Kim, and Yong Man Ro
IEEE Transactions on Circuits and Systems for Video Technology, vol. 30, no. 4, pp. 1037-1050, 2019.
[#129]
Encoding Features Robust to Unseen Modes of Variation with Attentive Long Short-Term Memory
Wissam J. Baddar and Yong Man Ro
Pattern Recognition, vol. 100, 107159, 2020.
[#128]
Lightweight and Effective Facial Landmark Detection using Adversarial Learning with Face Geometric Map Generative Network
Hong Joo Lee, Seong Tae Kim, Hakmin Lee and Yong Man Ro
IEEE Transactions on Circuits and Systems for Video Technology, vol. 30, no. 3, pp. 771-780, 2019.
[#127]
MCSIP Net: Multi-Channel Satellite Image Prediction via Deep Neural Network
Jae-Hyeok Lee, Sangmin S. Lee, Hak Gu Kim, Sa-kwang Song, Seongchan Kim, and Yong Man Ro
IEEE Transactions on Geoscience and Remote Sensing (TGRS), vol. 58, no. 3, pp. 2212-2224, 2019.
[#126]
BMAN: Bidirectional Multi-scale Aggregation Networks for Abnormal Event Detection
Sangmin Lee, Hak Gu Kim, and Yong Man Ro
IEEE Transactions on Image Processing, vol. 29, pp. 2395-2408, 2019.
[#125]
Multi-Objective Based Spatio-Temporal Feature Representation Learning Robust to Expression Intensity Variations for Facial Expression Recognition
Dae Hoe Kim, Wissam J. Baddar, Jinhyeok Jang and Yong Man Ro
IEEE Transactions on Affective Computing, vol. 10, no. 2, pp. 223-236, 2017.
[#124]
Endometrium Segmentation on TVUS Image Using Key-point Discriminator
Hong Joo Lee, Hyenok Park, Hak Gu Kim, Dongkuk Shin, Sa Ra Lee, Sung Hoon Kim, Mikyung Kong and Yong Man Ro
Medical Physics, vol. 46. no. 9, pp. 3974-3984, 2019.
[#123]
Implementation of Multimodal Biometric Recognition via Multi-feature Deep Learning Networks and Feature Fusion
Leslie Ching Ow Tiong, Seong Tae Kim and Yong Man Ro
Multimedia Tools and Applications, vol. 78, pp. 22743-22772, 2019.
[#122]
Binocular Fusion Net: Deep Learning Visual Comfort Assessment for Stereoscopic 3D
Hak Gu Kim, Hyunwook Jeong (equally contributed), Heoun-taek Lim and Yong Man Ro
IEEE Transactions on Circuits and Systems for Video Technology, vol. 29, no. 4, pp. 956-967, 2018.
[#121]
VRSA Net: VR Sickness Assessment considering Exceptional Motion for 360-degree VR Video
Hak Gu Kim, Heoun-taek Lim, Sangmin Lee and Yong Man Ro
IEEE Transactions on Image Processing, vol. 28, no. 4, pp. 1646-1660, 2018.
[#120]
Attended Relation Feature Representation of Facial Dynamics for Facial Authentication
Seong Tae Kim and Yong Man Ro
IEEE Transactions on Information Forensics & Security, vol. 14, no. 7, pp. 1768-1778, 2018.
[#119]
Visually Interpretable Deep Network for Diagnosis of Breast Masses on Mammograms
Seong Tae Kim, Jae-Hyeok Lee, Hakmin Lee and Yong Man Ro
Physics in Medicine & Biology, vol. 63, no. 23, 235025, 2018.
[#118]
Ultrafast Layer Based Computer-Generated Hologram Calculation with Sparse Template Holographic Fringe Pattern for 3-D Object
Hak Gu Kim and Yong Man Ro
Optics Express, vol. 25, no. 24, pp. 30418 - 30427, 2017.
[#117
]
Multi-View Stereoscopic Video Hole Filling Considering Spatio-Temporal Consistency and Binocular Symmetry for Synthesized 3D Video
Hak Gu Kim and Yong Man Ro
IEEE Transactions on Circuits and Systems for Video Technology, vol. 27, no. 7, pp. 1435-1449, 2016.
[#116]
Multi-Objective based Spatio-Temporal Feature Representation Learning Robust to Expression Intensity Variations for Facial Expression Recognition
Dae Hoe Kim, Wisam J. Baddar and Yong Man Ro
IEEE Transactions on Affective Computing, vol. 10, no. 2, pp. 223-236, 2017.
[#115
] E
ffective and Efficient Human Action Recognition Using Dynamic Frame Skipping and Trajectory Refection
Jeong-Jik Seo, Hyung-Il Kim, Wesley De Neve and Yong Man Ro
Image and Vision Computing, vol. 58, pp. 76-85, 2017.
[#114]
Latent Feature Representation with Depth Directional Long-Term Recurrent Learning for Breast Masses in Digital Breast Tomosynthesis
Dae Hoe Kim, Seong Tae Kim, Jung Min Chang and Yong Man Ro
Physics in Medicine & Biology, vol. 62, no. 3, pp. 1009 - 1031, 2017.
[#113]
Experimental Investigation of Facial Expressions Associated with Visual Discomfort: Feasibilty Study Towards an Objective Measurement of Visual Discomfort Based on Facial Expression
Seong-Il Lee, Seung Ho Lee, Konstantinos N. Plataniotis and Yong Man Ro
IEEE/OSA Journal of Display Technology, vol. 12, no. 12, pp. 1785-1797, 2016.
[#112]
Acceleration of Calculation Speed of Computer-Generated Holograms Using the Sparsity of the Holographic Fringe Pattern for 3D Object
Hak Gu Kim, Hyunwook Jeong and Yong Man Ro
Optics Express, vol. 24, no. 22, pp. 25317 - 25328, 2016.
[#111]
Experimental Investigation of the Effect of Binocular Disparity on the Visibility Threshold of Asymmetric Noise in Stereoscopic Viewing
Hak Gu Kim, Seong-Il Lee (equally contributed) and Yong Man Ro
Optics Express, vol. 24, no. 17, pp. 19607 - 19615, 2016.
[#110]
Critical Binocular Asymmetry Measure for Perceptual Quality Assessemnt of Synthesized Stero 3D Images in View Synthesis
Yong Ju Jung, Hak Gu Kim and Yong Man Ro
IEEE Transactions on Circuits and Systems for Video Technology, vol. 26, no. 7, pp. 1201 - 1214, 2015.
[#109]
Collaborative Expression Representation Using Peak Expression and Intra Variation Face Images for Practical Subject-Independent Emotion Recognition in Videos
Seung Ho Lee, Wissam J. Baddar and Yong Man Ro
Pattern Recognition, vol. 54, pp. 52 - 67, 2016.
[#108]
Feature Scalability for a Low Complexity Face Recognition with Unconstrained Spatial Resolution
Hyung-Il Kim, Seung Ho Lee, Jae-Young Choi and Yong Man Ro
Multimedia Tools and Applications, vol. 75, no. 12, pp. 6887 - 6908, 2016.
[#107]
Classifier Ensemble Generation and Selection with Multiple Feature Representations for Classification Applications in Computer-Aided Detection and Diagnosis on Mammography
Jae Young Choi, Dae Hoe Kim, Konstantinos N. Plataniotis and Yong Man RO
Expert Systems with Applications, vol. 46, pp. 106 - 121, 2016.
[#106]
Partial Matching of Facial Expression Sequence Using Over-complete Transition Dictionary for Emotion Recognition
Seung Ho Lee and Yong Man Ro
IEEE Transactions on Affective Computing, vol. 7, no. 4, pp. 389 - 408, 2015.
[#105]
Detection of Masses in Digital Breast Tomosynthesis Using Complementary Information of Simulated Projection
Seong Tae Kim, Dae Hoe Kim and Yong Man Ro
Medical Physics, vol. 42, no. 12, pp. 7043 - 7058, 2015.
[#104] I
mproving Mass Detection Using Combined Feature Representations from Projection Views and Reconstructed Volume of DBT and Boosting Based Classification with Feature Selection
Dae Hoe Kim, Seong Tae Kim and Yong Man Ro
Physics in Medicine and Biology, vol. 60, no. 22, pp. 8809 - 8832, 2015.
[#103]
I
mage-Based Coin Recognition Using Rotation-Invariant Region Binary Patterms Based on Gradient Magnitudes
Semin Kim, Seung Ho Lee and Yong Man Ro
Journal of Visual Communication and Image Representation, vol. 32, pp. 217 - 223, 2015.
[#102] Towards a Physiology-based Measure of Visual Discomfort: Brain Activity Measurement While Viewing Stereoscopic Images with Different Screen Disparities
Yong Ju Jung, Dongchan Kim, Hosik Sohn, Seong-il Lee, Hyun Wook Park, and Yong Man Ro
IEEE/OSA Journal of Display Technology, vol. 11, no. 9, pp. 730-743, 2015.
[#101] Region Based Stellate Features Combined with Variable Selection Using AdaBoost Learning in Mammographic Computer-aided Detection
Dae Hoe Kim, Jae Young Choi, and Yong Man Ro
Computers in Biology and Medicine, vol. 63, pp. 238-250, 2015.
[#100] Breast mass detection using slice conspicuity in 3D reconstructed digital breast volumes
Seong Tae Kim, Dae Hoe Kim, and Yong Man Ro
Physics in Medicine and Biology, vol. 59, no. 17, pp. 5003-5023, 2014.
1
2
3
Report abuse
Page details
Page updated
Report abuse