I am a PhD candidate in Computer Science at Case Western Reserve University (CWRU), where I am advised by Prof. Yu Yin.

Prior to that, I was a visiting student at ShanghaiTech University, supervised by Prof. Dinggang Shen. I received my M.S. in Information Science from University of Pittsburgh (Pitt) in 2022, supervised by Prof. Yu-Ru Lin. I received my B.S. in Computing and Information Science from Guangdong University of Technology, supervised by Prof. Weihua He.

I have broad research interests in Computer Vision and Vision-Language Models, with a particular focus on advancing spatial intelligence in the next generation of AI systems. Google Scholar citations

πŸ“š Selected Publications

2025.10
sym

Spatial Intelligence in Vision-Language Models: A Comprehensive Survey

Disheng Liu, Tuo Liang, Zhe Hu, Jierui Peng, Yiren Lu, Yi Xu, Yun Fu, Yu Yin; GitHub

  • Vision-Language Models (VLMs) have achieved great success but still lack spatial intelligence, and this survey provides the first unified overview of recent advances, taxonomies, and evaluations toward building spatially intelligent AI.
2025.06
sym

Balancing Fidelity and Diversity: Synthetic data could stand on the shoulder of the real in visual recognition

Disheng Liu, Tuo Liang, Yu Yin; Github

  • With the rapid progress of generative models, synthetic data has become a common solution to data scarcity in AI. However, is using it directly without curation ideal for visual recognition? We systematically study how data fidelity and diversity affect recognition performance and show that balancing these factors significantly improves results through a training-free curation pipeline.
2025.03
sym

CAUSAL3D: A Comprehensive Benchmark for Causal Learning from Visual Data

Disheng Liu*, Yiran Qiao*, Wuche Liu, Yiren Lu, Yunlai Zhou, Tuo Liang, Yu Yin, Jing Ma; *Equal contribution; Datasets

  • True intelligence relies on understanding hidden causal relations, yet current AI and vision models lack benchmarks to assess this ability. We introduce Causal3D, a comprehensive 19-dataset benchmark linking structured and visual data to evaluate causal reasoning, revealing that performance drops sharply as causal complexity increases.

πŸ’» Working Experience

πŸ“ Servicing

Reviewer for

ICLR’26, CVPR’26

πŸŽ“ Teaching

Teaching Assistant

β€’ Fall 2025 β€” CSDS 465: Computer Vision (Instructor: Yu Yin)

β€’ Spring 2025 β€” CSDS 425: Computer Networks (Instructor: An Wang)

β€’ Fall 2024 β€” CSDS 425: Computer Networks (Instructor: Mark Allman)