Yujun Cai's Homepage

Publications [Google Scholar][DBLP]

2025

HAIF-GS: Hierarchical and Induced Flow-Guided Gaussian Splatting for Dynamic Scene

Jianing Chen, Zehao Li, Yujun Cai, Hao Jiang, Chengxuan Qian, Juyuan Kang, Shuqin Gao, Honglong Zhao, Tianlu Mao, Yucheng Zhang.

NeurIPS, 2025.

PDF Code
VistaWise: Building Cost-Effective Agent with Cross-Modal Knowledge Graph for Minecraft

Honghao Fu, Junlong Ren, Qi Chai, Deheng Ye, Yujun Cai, Hao Wang.

EMNLP, 2025.

PDF Code
MRFD: Multi-Region Fusion Decoding with Self-Consistency for Mitigating Hallucinations in LVLMs

Haonan Ge, Yiwei Wang, Ming-Hsuan Yang, Yujun Cai.

EMNLP, 2025.

PDF Code
Making every step effective: Jailbreaking large vision-language models through hierarchical kv equalization

Shuyang Hao, Yiwei Wang, Bryan Hooi, Jun Liu, Muhao Chen, Zi Huang, Yujun Cai.

EMNLP, 2025.

PDF Code
SemVink: Advancing VLMs' Semantic Understanding of Optical Illusions via Visual Global Thinking

Sifan Li, Yujun Cai, Yiwei Wang.

EMNLP, 2025.

PDF Code
Understanding GUI Agent Localization Biases through Logit Sharpness

Xingjian Tao, Yiwei Wang, Yujun Cai, Zhicheng Yang, Jing Tang.

EMNLP, 2025.

PDF Code
DiMo-GUI: Advancing Test-time Scaling in GUI Grounding via Modality-Aware Visual Reasoning

Hang Wu, Hongkai Chen, Yujun Cai, Chang Liu, Qingwen Ye, Ming-Hsuan Yang, Yiwei Wang.

EMNLP, 2025.

PDF Code
Mapping the Minds of LLMs: A Graph-Based Analysis of Reasoning LLM

Zhen Xiong, Yujun Cai, Zhecheng Li, Yiwei Wang.

EMNLP, 2025.

PDF Code
Exploring Visual Vulnerabilities via Multi-Loss Adversarial Search for Jailbreaking Vision-Language Models

Shuyang Hao, Bryan Hooi, Jun Liu, Kai-Wei Chang, Zi Huang, Yujun Cai.

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2025.

PDF Code
LatentHOI: On the Generalizable Hand Object Motion Generation with Latent Hand Diffusion

Muchen Li, Sammy Christen, Chengde Wan, Yujun Cai, Renjie Liao, Leonid Sigal, Shugao Ma.

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2025.

PDF Code
DRS: Deep Question Reformulation With Structured Output

Zhecheng Li, Yiwei Wang, Bryan Hooi, Yujun Cai, Nanyun Peng, Kai-Wei Chang.

ACL, 2025.

PDF Code
Vulnerability of LLMs to Vertically Aligned Text Manipulations

Zhecheng Li, Yiwei Wang, Bryan Hooi, Yujun Cai, Zhen Xiong, Nanyun Peng, Kai-wei Chang.

ACL, 2025.

PDF Code
Texture or Semantics? Vision-Language Models Get Lost in Font Recognition

Zhecheng Li, Guoxian Song, Yujun Cai, Zhen Xiong, Junsong Yuan, Yiwei Wang.

COLM, 2025.

PDF Code
Text Speaks Louder than Vision: ASCII Art Reveals Textual Biases in Vision-Language Models

Zhaochen Wang, Bryan Hooi, Yiwei Wang, Ming-Hsuan Yang, Zi Huang, Yujun Cai.

COLM, 2025.

PDF Code
How does Watermarking Affect Visual Language Models in Document Understanding?

Chunxue Xu, Yiwei Wang, Bryan Hooi, Yujun Cai, Songze Li.

COLM, 2025.

PDF Code
Tricking Retrievers with Influential Tokens: An Efficient Black-Box Corpus Poisoning Attack

Cheng Wang, Yiwei Wang, Yujun Cai, Bryan Hooi.

Proceedings of the 2025 Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2025.

PDF Code
Con-ReCall: Detecting Pre-training Data in LLMs via Contrastive Decoding

Cheng Wang, Yiwei Wang, Bryan Hooi, Yujun Cai, Nanyun Peng, Kai-Wei Chang.

International Conference on Computational Linguistics (COLING), 2025.

PDF Code
Enhancing LLM Character-Level Manipulation via Divide and Conquer

Zhen Xiong, Yujun Cai, Bryan Hooi, Nanyun Peng, Kai-Wei Chang, Zhecheng Li, Yiwei Wang.

arXiv preprint arXiv:2502.08180, 2025.

PDF Code
Lost in Edits? A \lambda -Compass for AIGC Provenance

Wenhao You, Bryan Hooi, Yiwei Wang, Euijin Choo, Ming-Hsuan Yang, Junsong Yuan, Zi Huang, Yujun Cai.

arXiv preprint arXiv:2502.04364, 2025.

PDF Code

2024

NeurIPS

DisC-GS: Discontinuity-aware Gaussian Splatting

Haoxuan Qu, Zhuoling Li, Hossein Rahmani, Yujun Cai, Jun Liu.

Advances in Neural Information Processing Systems (NeurIPS), 2024.

PDF Code
emg2pose: A Large and Diverse Benchmark for Surface Electromyographic Hand Pose Estimation

Sasha Salter, Richard Warren, Collin Schlager, Adrian Spurr, Shangchen Han, Rohin Bhasin, Yujun Cai, Peter Walkington, Anuoluwapo Bolarinwa, Robert Wang, others.

Advances in Neural Information Processing Systems Dataset and Benmark Track (NeurIPS), 2024.

PDF Code
Energy-Calibrated VAE with Test Time Free Lunch

Yihong Luo, Siya Qiu, Xingjian Tao, Yujun Cai, Jing Tang.

European Conference on Computer Vision (ECCV), 2024.

PDF Code
CVPR

Llms are good action recognizers

Haoxuan Qu, Yujun Cai, Jun Liu.

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024.

PDF Code
6d-diff: A keypoint diffusion framework for 6d object pose estimation

Li Xu, Haoxuan Qu, Yujun Cai, Jun Liu.

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024.

PDF Code
Vulnerability of LLMs to Vertically Aligned Text Manipulations

Zhecheng Li, Yiwei Wang, Bryan Hooi, Yujun Cai, Zhen Xiong, Nanyun Peng, Kai-Wei Chang.

ACL, 2024.

PDF Code
STMG: A Machine Learning Microgesture Recognition System for Supporting Thumb-Based VR/AR Input

Kenrick Kin, Chengde Wan, Ken Koh, Andrei Marin, Necati Cihan Camgoz, Yubo Zhang, Yujun Cai, Fedor Kovalev, Moshe Ben-Zacharia, Shannon Hoople, others.

Proceedings of the CHI Conference on Human Factors in Computing Systems (CHI), 2024.

PDF Code
GradiSeg: Gradient-Guided Gaussian Segmentation with Enhanced 3D Boundary Precision

Zehao Li, Wenwei Han, Yujun Cai, Hao Jiang, Baolong Bi, Shuqin Gao, Honglong Zhao, Zhaoqi Wang.

arXiv preprint arXiv:2412.00392, 2024.

PDF Code
Think Carefully and Check Again! Meta-Generation Unlocking LLMs for Low-Resource Cross-Lingual Summarization

Zhecheng Li, Yiwei Wang, Bryan Hooi, Yujun Cai, Naifan Cheung, Nanyun Peng, Kai-wei Chang.

arXiv preprint arXiv:2410.20021, 2024.

PDF Code
Off-the-shelf ChatGPT is a Good Few-shot Human Motion Predictor

Haoxuan Qu, Zhaoyang He, Zeyu Hu, Yujun Cai, Jun Liu.

arXiv preprint arXiv:2405.15267, 2024.

PDF Code
Are LLMs Really Not Knowledgable? Mining the Submerged Knowledge in LLMs' Memory

Xingjian Tao, Yiwei Wang, Yujun Cai, Zhicheng Yang, Jing Tang.

arXiv preprint arXiv:2412.20846, 2024.

PDF Code

2023

Primacy effect of chatgpt

Yiwei Wang, Yujun Cai, Muhao Chen, Yuxuan Liang, Bryan Hooi.

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023.

PDF Code
A Characteristic Function-Based Method for Bottom-Up Human Pose Estimation

Haoxuan Qu, Yujun Cai, Lin Geng Foo, Ajay Kumar, Jun Liu.

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023.

PDF Code
Social diffusion: Long-term multiple human motion anticipation

Julian Tanke, Linguang Zhang, Amy Zhao, Chengcheng Tang, Yujun Cai, Lezi Wang, Po-Chen Wu, Juergen Gall, Cem Keskin.

Proceedings of the IEEE/CVF International Conference on Computer Vision (CVPR), 2023.

PDF Code
How Fragile is Relation Extraction under Entity Replacements?

Yiwei Wang, Bryan Hooi, Fei Wang, Yujun Cai, Yuxuan Liang, Wenxuan Zhou, Jing Tang, Manjuan Duan, Muhao Chen.

CONLL, 2023.

PDF Code

2022

Heatmap distribution matching for human pose estimation

Haoxuan Qu, Li Xu, Yujun Cai, Lin Geng Foo, Jun Liu.

Advances in Neural Information Processing Systems, 2022.

PDF Code
Geometry-guided progressive nerf for generalizable and efficient neural human rendering

Mingfei Chen, Jianfeng Zhang, Xiangyu Xu, Lijuan Liu, Yujun Cai, Jiashi Feng, Shuicheng Yan.

European Conference on Computer Vision, 2022.

PDF Code
Graphcache: Message passing as caching for sentence-level relation extraction

Yiwei Wang, Muhao Chen, Wenxuan Zhou, Yujun Cai, Yuxuan Liang, Bryan Hooi.

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2022.

PDF Code
Should we rely on entity mentions for relation extraction? debiasing relation extraction with counterfactual analysis

Yiwei Wang, Muhao Chen, Wenxuan Zhou, Yujun Cai, Yuxuan Liang, Dayiheng Liu, Baosong Yang, Juncheng Liu, Bryan Hooi.

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2022.

PDF Code
MonaGO: a novel gene ontology enrichment analysis visualisation system

Ziyin Xin, Yujun Cai, Louis T Dang, Hannah MS Burke, Jerico Revote, Natalie Charitakis, Denis Bienroth, Hieu T Nim, Yuan-Fang Li, Mirana Ramialison.

BMC bioinformatics, 2022.

PDF Code
Deepemd: Differentiable earth mover' s distance for few-shot learning

Chi Zhang, Yujun Cai, Guosheng Lin, Chunhua Shen.

IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022.

PDF Code
UmeTrack: Unified multi-view end-to-end hand tracking for VR

Shangchen Han, Po-chen Wu, Yubo Zhang, Beibei Liu, Linguang Zhang, Zheng Wang, Weiguang Si, Peizhao Zhang, Yujun Cai, Tomas Hodan, others.

SIGGRAPH Asia 2022 conference papers, 2022.

PDF Code
Time-Aware Neighbor Sampling on Temporal Graphs

Yiwei Wang, Yujun Cai, Yuxuan Liang, Henghui Ding, Changhu Wang, Bryan Hooi.

2022 International Joint Conference on Neural Networks (IJCNN), 2022.

PDF Code

2021

Adaptive data augmentation on temporal graphs

Yiwei Wang, Yujun Cai, Yuxuan Liang, Henghui Ding, Changhu Wang, Siddharth Bhatia, Bryan Hooi.

Advances in Neural Information Processing Systems, 2021.

PDF Code
Direct multi-view multi-person 3d pose estimation

Jianfeng Zhang, Yujun Cai, Shuicheng Yan, Jiashi Feng, others.

Advances in Neural Information Processing Systems, 2021.

PDF Code
A unified 3d human motion synthesis model via conditional variational auto-encoder

Yujun Cai, Yiwei Wang, Yiheng Zhu, Tat-Jen Cham, Jianfei Cai, Junsong Yuan, Jun Liu, Chuanxia Zheng, Sijie Yan, Henghui Ding, others.

Proceedings of the IEEE/CVF International Conference on Computer Vision (CVPR), 2021.

PDF Code
Structure-aware label smoothing for graph neural networks

Yiwei Wang, Yujun Cai, Yuxuan Liang, Wei Wang, Henghui Ding, Muhao Chen, Jing Tang, Bryan Hooi.

arXiv preprint arXiv:2112.00499, 2021.

PDF Code
Time-aware neighbor sampling for temporal graph networks

Yiwei Wang, Yujun Cai, Yuxuan Liang, Henghui Ding, Changhu Wang, Bryan Hooi.

The 2022 International Joint Conference on Neural Networks (IJCNN), 2021.

PDF Code
Curgraph: Curriculum learning for graph classification

Yiwei Wang, Wei Wang, Yuxuan Liang, Yujun Cai, Bryan Hooi.

Proceedings of the Web Conference 2021, 2021.

PDF Code
Mixup for node and graph classification

Yiwei Wang, Wei Wang, Yuxuan Liang, Yujun Cai, Bryan Hooi.

Proceedings of the Web Conference 2021, 2021.

PDF Code
Progressive supervision for node classification

Yiwei Wang, Wei Wang, Yuxuan Liang, Yujun Cai, Bryan Hooi.

Machine Learning and Knowledge Discovery in Databases: European Conference, ECML PKDD 2020, Ghent, Belgium, September 14--18, 2020, Proceedings, Part I, 2021.

PDF Code

2020

Learning progressive joint propagation for human motion prediction

Yujun Cai, Lin Huang, Yiwei Wang, Tat-Jen Cham, Jianfei Cai, Junsong Yuan, Jun Liu, Xu Yang, Yiheng Zhu, Xiaohui Shen, others.

Computer Vision--ECCV 2020: 16th European Conference, Glasgow, UK, August 23--28, 2020, Proceedings, Part VII 16, 2020.

PDF Code
3D hand pose estimation using synthetic data and weakly labeled RGB images

Yujun Cai, Liuhao Ge, Jianfei Cai, Nadia Magnenat Thalmann, Junsong Yuan.

IEEE transactions on pattern analysis and machine intelligence, 2020.

PDF Code
Graphcrop: Subgraph cropping for graph classification

Yiwei Wang, Wei Wang, Yuxuan Liang, Yujun Cai, Bryan Hooi.

arXiv preprint arXiv:2009.10564, 2020.

PDF Code
Detecting implementation bugs in graph convolutional network based node classifiers

Yiwei Wang, Wei Wang, Yujun Ca, Bryan Hooi, Beng Chin Ooi.

2020 IEEE 31st International Symposium on Software Reliability Engineering (ISSRE), 2020.

PDF Code
Nodeaug: Semi-supervised node classification with data augmentation

Yiwei Wang, Wei Wang, Yuxuan Liang, Yujun Cai, Juncheng Liu, Bryan Hooi.

Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2020.

PDF Code
Deepemd: Few-shot image classification with differentiable earth mover' s distance and structured classifiers

Chi Zhang, Yujun Cai, Guosheng Lin, Chunhua Shen.

Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (CVPR), 2020.

PDF Code

2019

Exploiting spatial-temporal relationships for 3d pose estimation via graph convolutional networks

Yujun Cai, Liuhao Ge, Jun Liu, Jianfei Cai, Tat-Jen Cham, Junsong Yuan, Nadia Magnenat Thalmann.

Proceedings of the IEEE/CVF International Conference on Computer Vision (CVPR), 2019.

PDF Code

2018

Weakly-supervised 3d hand pose estimation from monocular rgb images

Yujun Cai, Liuhao Ge, Jianfei Cai, Junsong Yuan.

Proceedings of the European conference on computer vision (ECCV), 2018.

PDF Code
Hand pointnet: 3d hand pose estimation using point sets

Liuhao Ge, Yujun Cai, Junwu Weng, Junsong Yuan.

Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), 2018.

PDF Code

Last Update: