Homepage
Publications
Services
Teaching
Yujun Cai
Lecturer (Assistant Professor)
The University of Queensland
vanora.caiyj@gmail.com / yujun.cai@uq.edu.au
Publications
[
Google Scholar
]
[
DBLP
]
2025
HAIF-GS: Hierarchical and Induced Flow-Guided Gaussian Splatting for Dynamic Scene
Jianing Chen, Zehao Li, Yujun Cai, Hao Jiang, Chengxuan Qian, Juyuan Kang, Shuqin Gao, Honglong Zhao, Tianlu Mao, Yucheng Zhang
.
NeurIPS, 2025.
PDF
Code
VistaWise: Building Cost-Effective Agent with Cross-Modal Knowledge Graph for Minecraft
Honghao Fu, Junlong Ren, Qi Chai, Deheng Ye, Yujun Cai, Hao Wang
.
EMNLP, 2025.
PDF
Code
MRFD: Multi-Region Fusion Decoding with Self-Consistency for Mitigating Hallucinations in LVLMs
Haonan Ge, Yiwei Wang, Ming-Hsuan Yang, Yujun Cai
.
EMNLP, 2025.
PDF
Code
Making every step effective: Jailbreaking large vision-language models through hierarchical kv equalization
Shuyang Hao, Yiwei Wang, Bryan Hooi, Jun Liu, Muhao Chen, Zi Huang, Yujun Cai
.
EMNLP, 2025.
PDF
Code
SemVink: Advancing VLMs' Semantic Understanding of Optical Illusions via Visual Global Thinking
Sifan Li, Yujun Cai, Yiwei Wang
.
EMNLP, 2025.
PDF
Code
Understanding GUI Agent Localization Biases through Logit Sharpness
Xingjian Tao, Yiwei Wang, Yujun Cai, Zhicheng Yang, Jing Tang
.
EMNLP, 2025.
PDF
Code
DiMo-GUI: Advancing Test-time Scaling in GUI Grounding via Modality-Aware Visual Reasoning
Hang Wu, Hongkai Chen, Yujun Cai, Chang Liu, Qingwen Ye, Ming-Hsuan Yang, Yiwei Wang
.
EMNLP, 2025.
PDF
Code
Mapping the Minds of LLMs: A Graph-Based Analysis of Reasoning LLM
Zhen Xiong, Yujun Cai, Zhecheng Li, Yiwei Wang
.
EMNLP, 2025.
PDF
Code
Exploring Visual Vulnerabilities via Multi-Loss Adversarial Search for Jailbreaking Vision-Language Models
Shuyang Hao, Bryan Hooi, Jun Liu, Kai-Wei Chang, Zi Huang, Yujun Cai
.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2025.
PDF
Code
LatentHOI: On the Generalizable Hand Object Motion Generation with Latent Hand Diffusion
Muchen Li, Sammy Christen, Chengde Wan, Yujun Cai, Renjie Liao, Leonid Sigal, Shugao Ma
.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2025.
PDF
Code
DRS: Deep Question Reformulation With Structured Output
Zhecheng Li, Yiwei Wang, Bryan Hooi, Yujun Cai, Nanyun Peng, Kai-Wei Chang
.
ACL, 2025.
PDF
Code
Vulnerability of LLMs to Vertically Aligned Text Manipulations
Zhecheng Li, Yiwei Wang, Bryan Hooi, Yujun Cai, Zhen Xiong, Nanyun Peng, Kai-wei Chang
.
ACL, 2025.
PDF
Code
Texture or Semantics? Vision-Language Models Get Lost in Font Recognition
Zhecheng Li, Guoxian Song, Yujun Cai, Zhen Xiong, Junsong Yuan, Yiwei Wang
.
COLM, 2025.
PDF
Code
Text Speaks Louder than Vision: ASCII Art Reveals Textual Biases in Vision-Language Models
Zhaochen Wang, Bryan Hooi, Yiwei Wang, Ming-Hsuan Yang, Zi Huang, Yujun Cai
.
COLM, 2025.
PDF
Code
How does Watermarking Affect Visual Language Models in Document Understanding?
Chunxue Xu, Yiwei Wang, Bryan Hooi, Yujun Cai, Songze Li
.
COLM, 2025.
PDF
Code
Tricking Retrievers with Influential Tokens: An Efficient Black-Box Corpus Poisoning Attack
Cheng Wang, Yiwei Wang, Yujun Cai, Bryan Hooi
.
Proceedings of the 2025 Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2025.
PDF
Code
Con-ReCall: Detecting Pre-training Data in LLMs via Contrastive Decoding
Cheng Wang, Yiwei Wang, Bryan Hooi, Yujun Cai, Nanyun Peng, Kai-Wei Chang
.
International Conference on Computational Linguistics (COLING), 2025.
PDF
Code
Enhancing LLM Character-Level Manipulation via Divide and Conquer
Zhen Xiong, Yujun Cai, Bryan Hooi, Nanyun Peng, Kai-Wei Chang, Zhecheng Li, Yiwei Wang
.
arXiv preprint arXiv:2502.08180, 2025.
PDF
Code
Lost in Edits? A \lambda -Compass for AIGC Provenance
Wenhao You, Bryan Hooi, Yiwei Wang, Euijin Choo, Ming-Hsuan Yang, Junsong Yuan, Zi Huang, Yujun Cai
.
arXiv preprint arXiv:2502.04364, 2025.
PDF
Code
2024
NeurIPS
DisC-GS: Discontinuity-aware Gaussian Splatting
Haoxuan Qu, Zhuoling Li, Hossein Rahmani, Yujun Cai, Jun Liu
.
Advances in Neural Information Processing Systems (NeurIPS), 2024.
PDF
Code
emg2pose: A Large and Diverse Benchmark for Surface Electromyographic Hand Pose Estimation
Sasha Salter, Richard Warren, Collin Schlager, Adrian Spurr, Shangchen Han, Rohin Bhasin, Yujun Cai, Peter Walkington, Anuoluwapo Bolarinwa, Robert Wang, others
.
Advances in Neural Information Processing Systems Dataset and Benmark Track (NeurIPS), 2024.
PDF
Code
Energy-Calibrated VAE with Test Time Free Lunch
Yihong Luo, Siya Qiu, Xingjian Tao, Yujun Cai, Jing Tang
.
European Conference on Computer Vision (ECCV), 2024.
PDF
Code
CVPR
Llms are good action recognizers
Haoxuan Qu, Yujun Cai, Jun Liu
.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024.
PDF
Code
6d-diff: A keypoint diffusion framework for 6d object pose estimation
Li Xu, Haoxuan Qu, Yujun Cai, Jun Liu
.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024.
PDF
Code
Vulnerability of LLMs to Vertically Aligned Text Manipulations
Zhecheng Li, Yiwei Wang, Bryan Hooi, Yujun Cai, Zhen Xiong, Nanyun Peng, Kai-Wei Chang
.
ACL, 2024.
PDF
Code
STMG: A Machine Learning Microgesture Recognition System for Supporting Thumb-Based VR/AR Input
Kenrick Kin, Chengde Wan, Ken Koh, Andrei Marin, Necati Cihan Camgoz, Yubo Zhang, Yujun Cai, Fedor Kovalev, Moshe Ben-Zacharia, Shannon Hoople, others
.
Proceedings of the CHI Conference on Human Factors in Computing Systems (CHI), 2024.
PDF
Code
GradiSeg: Gradient-Guided Gaussian Segmentation with Enhanced 3D Boundary Precision
Zehao Li, Wenwei Han, Yujun Cai, Hao Jiang, Baolong Bi, Shuqin Gao, Honglong Zhao, Zhaoqi Wang
.
arXiv preprint arXiv:2412.00392, 2024.
PDF
Code
Think Carefully and Check Again! Meta-Generation Unlocking LLMs for Low-Resource Cross-Lingual Summarization
Zhecheng Li, Yiwei Wang, Bryan Hooi, Yujun Cai, Naifan Cheung, Nanyun Peng, Kai-wei Chang
.
arXiv preprint arXiv:2410.20021, 2024.
PDF
Code
Off-the-shelf ChatGPT is a Good Few-shot Human Motion Predictor
Haoxuan Qu, Zhaoyang He, Zeyu Hu, Yujun Cai, Jun Liu
.
arXiv preprint arXiv:2405.15267, 2024.
PDF
Code
Are LLMs Really Not Knowledgable? Mining the Submerged Knowledge in LLMs' Memory
Xingjian Tao, Yiwei Wang, Yujun Cai, Zhicheng Yang, Jing Tang
.
arXiv preprint arXiv:2412.20846, 2024.
PDF
Code
2023
Primacy effect of chatgpt
Yiwei Wang, Yujun Cai, Muhao Chen, Yuxuan Liang, Bryan Hooi
.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023.
PDF
Code
A Characteristic Function-Based Method for Bottom-Up Human Pose Estimation
Haoxuan Qu, Yujun Cai, Lin Geng Foo, Ajay Kumar, Jun Liu
.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023.
PDF
Code
Social diffusion: Long-term multiple human motion anticipation
Julian Tanke, Linguang Zhang, Amy Zhao, Chengcheng Tang, Yujun Cai, Lezi Wang, Po-Chen Wu, Juergen Gall, Cem Keskin
.
Proceedings of the IEEE/CVF International Conference on Computer Vision (CVPR), 2023.
PDF
Code
How Fragile is Relation Extraction under Entity Replacements?
Yiwei Wang, Bryan Hooi, Fei Wang, Yujun Cai, Yuxuan Liang, Wenxuan Zhou, Jing Tang, Manjuan Duan, Muhao Chen
.
CONLL, 2023.
PDF
Code
2022
Heatmap distribution matching for human pose estimation
Haoxuan Qu, Li Xu, Yujun Cai, Lin Geng Foo, Jun Liu
.
Advances in Neural Information Processing Systems, 2022.
PDF
Code
Geometry-guided progressive nerf for generalizable and efficient neural human rendering
Mingfei Chen, Jianfeng Zhang, Xiangyu Xu, Lijuan Liu, Yujun Cai, Jiashi Feng, Shuicheng Yan
.
European Conference on Computer Vision, 2022.
PDF
Code
Graphcache: Message passing as caching for sentence-level relation extraction
Yiwei Wang, Muhao Chen, Wenxuan Zhou, Yujun Cai, Yuxuan Liang, Bryan Hooi
.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2022.
PDF
Code
Should we rely on entity mentions for relation extraction? debiasing relation extraction with counterfactual analysis
Yiwei Wang, Muhao Chen, Wenxuan Zhou, Yujun Cai, Yuxuan Liang, Dayiheng Liu, Baosong Yang, Juncheng Liu, Bryan Hooi
.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2022.
PDF
Code
MonaGO: a novel gene ontology enrichment analysis visualisation system
Ziyin Xin, Yujun Cai, Louis T Dang, Hannah MS Burke, Jerico Revote, Natalie Charitakis, Denis Bienroth, Hieu T Nim, Yuan-Fang Li, Mirana Ramialison
.
BMC bioinformatics, 2022.
PDF
Code
Deepemd: Differentiable earth mover' s distance for few-shot learning
Chi Zhang, Yujun Cai, Guosheng Lin, Chunhua Shen
.
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022.
PDF
Code
UmeTrack: Unified multi-view end-to-end hand tracking for VR
Shangchen Han, Po-chen Wu, Yubo Zhang, Beibei Liu, Linguang Zhang, Zheng Wang, Weiguang Si, Peizhao Zhang, Yujun Cai, Tomas Hodan, others
.
SIGGRAPH Asia 2022 conference papers, 2022.
PDF
Code
Time-Aware Neighbor Sampling on Temporal Graphs
Yiwei Wang, Yujun Cai, Yuxuan Liang, Henghui Ding, Changhu Wang, Bryan Hooi
.
2022 International Joint Conference on Neural Networks (IJCNN), 2022.
PDF
Code
2021
Adaptive data augmentation on temporal graphs
Yiwei Wang, Yujun Cai, Yuxuan Liang, Henghui Ding, Changhu Wang, Siddharth Bhatia, Bryan Hooi
.
Advances in Neural Information Processing Systems, 2021.
PDF
Code
Direct multi-view multi-person 3d pose estimation
Jianfeng Zhang, Yujun Cai, Shuicheng Yan, Jiashi Feng, others
.
Advances in Neural Information Processing Systems, 2021.
PDF
Code
A unified 3d human motion synthesis model via conditional variational auto-encoder
Yujun Cai, Yiwei Wang, Yiheng Zhu, Tat-Jen Cham, Jianfei Cai, Junsong Yuan, Jun Liu, Chuanxia Zheng, Sijie Yan, Henghui Ding, others
.
Proceedings of the IEEE/CVF International Conference on Computer Vision (CVPR), 2021.
PDF
Code
Structure-aware label smoothing for graph neural networks
Yiwei Wang, Yujun Cai, Yuxuan Liang, Wei Wang, Henghui Ding, Muhao Chen, Jing Tang, Bryan Hooi
.
arXiv preprint arXiv:2112.00499, 2021.
PDF
Code
Time-aware neighbor sampling for temporal graph networks
Yiwei Wang, Yujun Cai, Yuxuan Liang, Henghui Ding, Changhu Wang, Bryan Hooi
.
The 2022 International Joint Conference on Neural Networks (IJCNN), 2021.
PDF
Code
Curgraph: Curriculum learning for graph classification
Yiwei Wang, Wei Wang, Yuxuan Liang, Yujun Cai, Bryan Hooi
.
Proceedings of the Web Conference 2021, 2021.
PDF
Code
Mixup for node and graph classification
Yiwei Wang, Wei Wang, Yuxuan Liang, Yujun Cai, Bryan Hooi
.
Proceedings of the Web Conference 2021, 2021.
PDF
Code
Progressive supervision for node classification
Yiwei Wang, Wei Wang, Yuxuan Liang, Yujun Cai, Bryan Hooi
.
Machine Learning and Knowledge Discovery in Databases: European Conference, ECML PKDD 2020, Ghent, Belgium, September 14--18, 2020, Proceedings, Part I, 2021.
PDF
Code
2020
Learning progressive joint propagation for human motion prediction
Yujun Cai, Lin Huang, Yiwei Wang, Tat-Jen Cham, Jianfei Cai, Junsong Yuan, Jun Liu, Xu Yang, Yiheng Zhu, Xiaohui Shen, others
.
Computer Vision--ECCV 2020: 16th European Conference, Glasgow, UK, August 23--28, 2020, Proceedings, Part VII 16, 2020.
PDF
Code
3D hand pose estimation using synthetic data and weakly labeled RGB images
Yujun Cai, Liuhao Ge, Jianfei Cai, Nadia Magnenat Thalmann, Junsong Yuan
.
IEEE transactions on pattern analysis and machine intelligence, 2020.
PDF
Code
Graphcrop: Subgraph cropping for graph classification
Yiwei Wang, Wei Wang, Yuxuan Liang, Yujun Cai, Bryan Hooi
.
arXiv preprint arXiv:2009.10564, 2020.
PDF
Code
Detecting implementation bugs in graph convolutional network based node classifiers
Yiwei Wang, Wei Wang, Yujun Ca, Bryan Hooi, Beng Chin Ooi
.
2020 IEEE 31st International Symposium on Software Reliability Engineering (ISSRE), 2020.
PDF
Code
Nodeaug: Semi-supervised node classification with data augmentation
Yiwei Wang, Wei Wang, Yuxuan Liang, Yujun Cai, Juncheng Liu, Bryan Hooi
.
Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2020.
PDF
Code
Deepemd: Few-shot image classification with differentiable earth mover' s distance and structured classifiers
Chi Zhang, Yujun Cai, Guosheng Lin, Chunhua Shen
.
Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (CVPR), 2020.
PDF
Code
2019
Exploiting spatial-temporal relationships for 3d pose estimation via graph convolutional networks
Yujun Cai, Liuhao Ge, Jun Liu, Jianfei Cai, Tat-Jen Cham, Junsong Yuan, Nadia Magnenat Thalmann
.
Proceedings of the IEEE/CVF International Conference on Computer Vision (CVPR), 2019.
PDF
Code
2018
Weakly-supervised 3d hand pose estimation from monocular rgb images
Yujun Cai, Liuhao Ge, Jianfei Cai, Junsong Yuan
.
Proceedings of the European conference on computer vision (ECCV), 2018.
PDF
Code
Hand pointnet: 3d hand pose estimation using point sets
Liuhao Ge, Yujun Cai, Junwu Weng, Junsong Yuan
.
Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), 2018.
PDF
Code
Last Update: