Teaching Information
Teaching Assistant of 'Algorithmic Thinking' (Fall 2024, Spring 2025) at University of Pittsburgh. Guest Lecturer and Co-Designer of 'Neural Signal Modeling and Analysis' (Fall 2023, Fall 2024) at University of Pittsburgh
Education
- B.S, Electrical Engineering, Wuhan University, 2019
- M.S., Computer Science, Wuhan University, 2022
- Ph.D., Electrical and Computer Engineering, University of Pittsburgh, 2025
Awards and Honors
- MICCAI STAR Award, 2023
Current Projects
Trustworthy Large Language Models
This project develops uncertainty estimation and calibration methods that make large language models reliable enough for high-stakes deployment. The aim is for models to know what they don't know — producing calibrated confidence, flagging unreliable generations, and abstaining rather than returning confidently wrong answers.
Vision-Language-Action (VLA) Models for Robotic-Arm Manipulation
This project extends uncertainty-aware machine learning to vision-language-action (VLA) models for robotic-arm manipulation, where overconfident predictions translate into physical risk. The focus is on quantifying uncertainty over predicted actions and conditioning execution on model confidence, improving the safety and robustness of language-conditioned control.
Conditional Factuality Controlled LLMs with Generalization Certificates via Conformal Sampling
2026Large language models need reliable test-time control of hallucinations. We propose Conditional Factuality Control (CFC), a post-hoc conformal framework that uses feature-conditional thresholds to produce set-valued outputs with conditional coverage guarantees. Compared with marginal conformal methods, CFC adapts to prompt difficulty, improving coverage on hard prompts while reducing prediction-set size. We also introduce CFC-PAC, which provides finite-sample risk certificates. Experiments on synthetic, QA/reasoning, and VLM benchmarks show near-target coverage with smaller sets than baselines.
Uncertainty Regularized Evidential Regression
2024The Evidential Regression Network (ERN) combines deep learning with Dempster-Shafer theory to jointly predict a target and quantify its uncertainty. However, the non-negativity that the theory enforces through specific activation functions prevents the model from learning effectively over part of the sample space, which limits performance. This paper characterizes that limited-learning region, analyzes the ERN to explain why the constraint arises, and introduces a regularization term that lets the network learn from the entire training set. Extensive experiments support the theoretical analysis and demonstrate the effectiveness of the proposed method.
Bidirectional Mapping with Contrastive Learning on Multimodal Neuroimaging Data
2023Deep learning models of the interaction between brain structure and function have proven effective at identifying biomarkers for clinical phenotypes and brain diseases, but most prior work maps in only one direction—structure to function or function to structure—ignoring the intrinsic unity of the two modalities and introducing directional bias. To address this, we propose Bidirectional Mapping with Contrastive Learning (BMCL), which reduces the bias between the two one-way mappings through ROI-level contrastive learning. Evaluated on the HCP and OASIS datasets for clinical phenotype and neurodegenerative disease prediction, BMCL outperforms several state-of-the-art methods.