Vipin Chaudhary, PhD

Kevin J. Kranzusch Chair, Computer and Data Sciences,
Center for PEATAI, Case School of Engineering

Phone: 216.368.0171

Office Location: 803D Olin Building

Dr. Chaudhary is the Kevin J. Kranzusch Chair and Professor of Computer and Data Sciences with secondary appointments in the Department of Electrical, Computer, and Systems Engineering, Department of Biomedical Engineering, and the Center for Artificial Intelligence in Drug Discovery, School of Medicine.

A veteran of High-Performance Computing (HPC) and Artificial Intelligence (AI), Dr. Chaudhary has been actively participating in the science, business, government, and technology innovation frontiers of HPC and AI for almost four decades. His contributions range from heading research laboratories and holding executive management positions, to starting new technology ventures. He is currently the Kevin J. Kranzusch inaugural Chair of the Department of Computer and Data Sciences at Case Western Reserve University, and his current research is focused on HPC, Artificial Intelligence and its applications, Quantum Computing and Healthcare.

Previously he was a Program Director in the Office of Advanced Cyberinfrastructure at National Science Foundation where he co-led the National Strategic Computing Initiative from NSF for the United States and was in the working group of the Quantum Leap Initiative, National Quantum Initiative, National Artificial Intelligence Research Institutes, Cyber, and the I-Corps Program (where he was also a Program Director). I-Corps program is now part of “The American Innovation and Competitiveness Act” that enables commercialization of research and venture startups. He co-chaired the Networking and Information Technology Research and Technology Program’s Middleware and Grid Interagency Coordination (MAGIC) Team for the United States. He was also in the working group of the US Interagency Modeling and Analysis Group and a member of the Advanced Computing Roundtable of the Council on Competitiveness. He was awarded the NSF Director’s Superior Accomplishment Award in 2019.

He was the Empire Innovation Professor of Computer Science and Engineering, the Director of the university’s Data Intensive Computing Initiative and the co-founder of the Center for Computational and Data-Enabled Science and Engineering at University at Buffalo, State University of New York.

He co-founded Scalable Informatics, a leading provider of pragmatic, high performance software-defined storage and compute solutions to a wide range of markets, from financial and scientific computing to research and big data analytics. From 2010 to 2013, Dr. Chaudhary was the Chief Executive Officer of Computational Research Laboratories (CRL) where he grew the company globally to be an HPC cloud and solutions leader before selling it to Tata Consulting Services. Prior to this, as Senior Director of Advanced Development at Cradle Technologies, Inc., he was responsible for advanced programming tools for multi-processor chips. He was also the Chief Architect at Corio Inc., which had a successful IPO in July 2000.

Chaudhary was awarded the prestigious President of India Gold Medal in 1986 for securing the first rank amongst graduating students at the Indian Institute of Technology (IIT). He received the B.Tech. (Hons.) degree in Computer Science and Engineering from the Indian Institute of Technology, Kharagpur, in 1986 and a Ph.D. degree from The University of Texas at Austin in 1992.

Over $80M in external research funding.

Dr. Chaudhary’s research focuses on various aspects of AI (trust, embodied, performant) and its application, High Performance Computing and Quantum Computing.

Multi-Institutional Projects

Research Areas

High Performance Computing Artificial Intelligence Quantum Computing Healthcare & Medical Imaging Trustworthy AI

Current Projects

Intelligent Cyberinfrastructure (ICICLE)

Co-leading the NSF-funded ICICLE institute to advance 'AI-as-a-Service' through a plug-and-play cyberinfrastructure that spans the edge-cloud-HPC continuum for democratization of AI.

Related Publications:

Trustworthy AI & Speech Recognition

Evaluating the reliability and human-likeness of AI-generated voice clones and ASR scoring methods to ensure robustness in clinical and hearing research applications.

Related Publications:

Medical Imaging with Generative AI

Developing unsupervised methods for fully automated segmentation of knee lesions using conditional diffusion models and anomaly detection to eliminate annotator bias in osteoarthritis prognosis.

Related Publications:

Materials Data Science

Applying computer vision and deep learning to automate phase identification in synchrotron X-ray diffraction patterns for advanced materials characterization.

Related Publications:

Selected Publications since 2020 (more on Google Scholar)

Trust the Typical

2026

Debargha Ganguly , Srihari Sankar , Biyao Zhang , Vikash Singh , Kanan Gupta , Harshini Kavuru , Alan Luo , Weicong Chen , Warren Morningstar , Raghu Machiraju , Vipin Chaudhary

14th International Conference on Learning Representations (ICLR), April 23-27, 2026, Rio De Janeiro, Brazil

Current approaches to LLM safety rely on a brittle pattern of identifying and blocking known threats via guardrails. This paper introduces Trust The Typical (T3), a framework that reframes safety as an out-of-distribution detection problem, learning the distribution of acceptable prompts in a semantic space and flagging significant deviations as potential threats. Unlike prior methods, T3 requires no training on harmful examples yet achieves state-of-the-art performance across 18 benchmarks spanning toxicity, jailbreaking, multilingual harms, and over-refusal—reducing false positive rates by up to 40× relative to specialized safety models. A single model trained on safe English text transfers effectively to over 14 languages without retraining.

Trustworthy AI Artificial Intelligence

arXiv

BibTeX Citation

@inproceedings{
ganguly2026trust,
title={Trust The Typical},
author={Debargha Ganguly and Sreehari Sankar and Biyao Zhang and Vikash Singh and Kanan Gupta and Harshini Kavuru and Alan Luo and Weicong Chen and Warren Richard Morningstar and Raghu Machiraju and Vipin Chaudhary},
booktitle={The Fourteenth International Conference on Learning Representations},
year={2026},
url={https://openreview.net/forum?id=vfbeleLBWv}
}

Real-Time Online Learning Trajectory Prediction via Efficient Latent Predictor

2026

Jerry Peng , Vipin Chaudhary , Yu Yin

Autonomous Vehicles and Machines Conference, March 1-5, 2026, Burlingame, CA, USA

Presents an efficient latent predictor for real-time online trajectory prediction in autonomous vehicles, achieving high accuracy with reduced computational overhead.

Artificial Intelligence Computer Vision

Using AI to Increase Efficiency of Multilingual Test Materials: Spanish BEL Sentences

2026

C. López , L. Calandruccio , E. Buss , Mohsen Hariri , Vipin Chaudhary

Work in progress

This work-in-progress explores how AI can improve the efficiency of creating multilingual auditory test materials, with a focus on Spanish BEL sentences. The project investigates workflow acceleration and quality support for multilingual assessment design. It sits at the intersection of language technology, hearing research, and educational test development. The aim is to reduce manual burden while preserving the validity of test materials.

Artificial Intelligence

BibTeX Citation

@misc{lopez2026spanishbel,
  title={Using AI to Increase Efficiency of Multilingual Test Materials: Spanish BEL Sentences},
  author={C. López and L. Calandruccio and E. Buss and Mohsen Hariri and Vipin Chaudhary},
  year={2026},
  note={Work in progress}
}

Scorio.jl: A Julia package for ranking stochastic responses

2026

Mohsen Hariri , Michael Hinczewski , Vipin Chaudhary

JuliCon 2026

Scorio.jl is a Julia package for evaluating and ranking systems from repeated stochastic responses on shared tasks. It provides a common tensor-based interface for direct score-based, pairwise, psychometric, voting, graph, and listwise ranking methods. The package supports methodological studies of ranking stability as well as day-to-day leaderboard construction. It makes ranking under repeated stochastic observation easier to analyze across different assumptions and ranking families.

Artificial Intelligence HPC

arXiv Code

BibTeX Citation

@inproceedings{hariri2026scoriojl,
  title={Scorio.jl: A Julia package for ranking stochastic responses},
  author={Mohsen Hariri and Michael Hinczewski and Vipin Chaudhary},
  booktitle={JuliCon 2026},
  year={2026},
  url={https://arxiv.org/abs/2603.14103}
}

Ranking Reasoning LLMs under Test-Time Scaling

2026

Mohsen Hariri , Michael Hinczewski , Jing Ma , Vipin Chaudhary

ACL 2026 Main

This paper studies how to rank reasoning large language models when evaluation uses multiple stochastic samples per prompt under test-time scaling. It formalizes dense benchmark ranking in this repeated-trial setting and introduces Scorio, a library that implements Bayesian, paired-comparison, psychometric, voting, and spectral ranking methods. Across twenty reasoning models and four Olympiad-style math benchmarks, the study shows that many full-trial rankings closely match a Bayesian gold standard while low-budget methods can be less stable. The results provide practical guidance for reliable model ranking under both high- and low-budget evaluation settings.

Artificial Intelligence Trustworthy AI

arXiv Code

BibTeX Citation

@inproceedings{hariri2026ranking,
  title={Ranking Reasoning LLMs under Test-Time Scaling},
  author={Mohsen Hariri and Michael Hinczewski and Jing Ma and Vipin Chaudhary},
  booktitle={ACL 2026 Main},
  year={2026},
  url={https://arxiv.org/abs/2603.10960}
}

QuMod: Parallel Quantum Job Scheduling on Modular QPUs using Circuit Cutting

2026

Vinooth Rao Kulkarni , Aaron Orenstein , Xinpeng Li , Shuai Xu , Daniel Blankenberg , Vipin Chaudhary

IEEE International Conference on Quantum Communications, Networking, and Computing (QCNC 2026), April 6-8, 2026, Kobe, Japan

Presents QuMod, a parallel quantum job scheduling framework for modular QPUs leveraging circuit cutting to improve throughput on heterogeneous quantum hardware.

Quantum Computing HPC

Quantize What Counts: More for Keys, Less for Values

2026

Mohsen Hariri , Alan Luo , Weicong Chen , Shaochen Zhong , Tianyi Zhang , Qifan Wang , Xia Hu , Xiaotian Han , Vipin Chaudhary

ACL 2026 Findings

This work studies asymmetric KV-cache quantization for large language models and shows that key tensors carry more information than value tensors. The analysis motivates allocating more bits and stronger outlier handling to keys than to values, instead of quantizing both sides identically. Experiments show that key-favored bit allocation preserves much more accuracy at the same memory budget. The paper provides both theoretical motivation and practical guidance for more efficient LLM inference.

Artificial Intelligence HPC

arXiv Code View

BibTeX Citation

@inproceedings{hariri2026quantize,
  title={Quantize What Counts: More for Keys, Less for Values},
  author={Mohsen Hariri and Alan Luo and Weicong Chen and Shaochen Zhong and Tianyi Zhang and Qifan Wang and Xia Hu and Xiaotian Han and Vipin Chaudhary},
  booktitle={ACL 2026 Findings},
  year={2026},
  url={https://arxiv.org/abs/2502.15075}
}

Efficient Transpilation of OpenQASM 3.0 Dynamic Circuits to CUDAQ: Performance and Expressiveness Advantages

2026

Vinooth Rao Kulkarni , Jaehyun Lee , Adam Hutchings , Anas Albahri , Jai Nana , Shuai Xu , Vipin Chaudhary

IEEE International Conference on Quantum Communications, Networking, and Computing (QCNC 2026), April 6-8, 2026, Kobe, Japan

Presents an efficient transpilation approach for converting OpenQASM 3.0 dynamic circuits to CUDAQ, demonstrating performance and expressiveness advantages.

Quantum Computing HPC

Medical Image Spatial Grounding with Semantic Sampling

2026

Andrew Seohwan Yu , Mohsen Hariri , Kunio Nakamura , Mingrui Yang , Xiaojuan Li , Vipin Chaudhary

MICCAI 2026 (under review)

This work studies spatial grounding for vision-language models in 3D medical imaging, where anatomy, modality, slice direction, and coordinate systems create unique challenges. It introduces MIS-Ground, a benchmark for analyzing failure modes in medical image spatial grounding, and MIS-SemSam, an inference-time semantic sampling method that improves grounding accuracy without retraining. The paper evaluates how visual and textual prompting choices influence grounding performance across clinical imaging settings. It advances reproducible evaluation and practical improvement of medical VLM grounding.

Medical Imaging Computer Vision Artificial Intelligence

arXiv

BibTeX Citation

@misc{yu2026medical,
  title={Medical Image Spatial Grounding with Semantic Sampling},
  author={Andrew Seohwan Yu and Mohsen Hariri and Kunio Nakamura and Mingrui Yang and Xiaojuan Li and Vipin Chaudhary},
  year={2026},
  note={Under review at MICCAI 2026},
  url={https://arxiv.org/abs/2603.14579}
}

LRD-Net: A Lightweight Real-Centered Detection Network for Cross-Domain Face Forgery Detection

2026

Xuecen Zhang , Vipin Chaudhary

14th International Symposium on Digital Forensics and Security, March 19-20, 2026, Boston, USA

Introduces LRD-Net, a lightweight real-centered detection network for cross-domain deepfake detection, generalizing face forgery detection across domains.

Artificial Intelligence Computer Vision Trustworthy AI

Less Prune, MoRE Experts: Recognizing and Restructuring Latent Experts for Model Compression

2026

Jiamu Zhang , Alessandro Mason , Ning Xie , Aarav Swami , Ashley Chen , Shuai Xu , Vipin Chaudhary , Hanjie Chen

Texas NLP Symposium, April 3, 2026, College Station, Texas

Proposes recognizing and restructuring latent expert structures within large models for compression, achieving efficiency while preserving accuracy.

Artificial Intelligence

K^4-Serve: Robust Streaming Log Anomaly Detection for HPC & AI Infrastructure

2026

W. Chen , V. Singh , Z. Rahmani , D. Ganguly , Mohsen Hariri , S. Maxwell , S. Gajurel , E. Dragowsky , H. Djohari , Vipin Chaudhary

ACM PEARC 2026 (under review)

K^4-Serve operationalizes the K^4 framework for streaming anomaly detection on production HPC and AI infrastructure logs. It combines Kafka-based ingestion, versioned normalization, sliding-window scoring, retraining, and observability features to support robust real-world deployment. The system achieves stable deployment on real HPC logs with near-perfect event-level detection and only one false alert in the reported study. The work bridges anomaly-detection methodology and production cyberinfrastructure practice.

HPC Artificial Intelligence

BibTeX Citation

@misc{chen2026k4serve,
  title={K^4-Serve: Robust Streaming Log Anomaly Detection for HPC \& AI Infrastructure},
  author={W. Chen and V. Singh and Z. Rahmani and D. Ganguly and Mohsen Hariri and S. Maxwell and S. Gajurel and E. Dragowsky and H. Djohari and Vipin Chaudhary},
  year={2026},
  note={Under review at ACM PEARC 2026}
}

HugRAG: Hierarchical Causal Knowledge Graph Design for RAG

2026

Nengbo Wang , Tuo Liang , Vikash Singh , Chaoda Song , Van Yang , Yu Yin , Jing Ma , Jagdip Singh , Vipin Chaudhary

HugRAG rethinks knowledge organization for graph-based RAG through causal gating across hierarchical modules. It explicitly models causal relationships to suppress spurious correlations while enabling scalable reasoning over large-scale knowledge graphs. Extensive experiments demonstrate that HugRAG consistently outperforms competitive graph-based RAG baselines across multiple datasets and evaluation metrics, establishing a principled foundation for structured, scalable, and causally grounded RAG systems.

Artificial Intelligence

arXiv

BibTeX Citation

@article{wang2026hugrag,
  title={HugRAG: Hierarchical Causal Knowledge Graph Design for RAG},
  author={Wang, Nengbo and Liang, Tuo and Singh, Vikash and Song, Chaoda and Yang, Van and Yin, Yu and Ma, Jing and Singh, Jagdip and Chaudhary, Vipin},
  journal={arXiv preprint arXiv:2602.05143},
  year={2026}
}

Geom@k: Fast to Converge, Slow to Drift

2026

Mohsen Hariri , Vipin Chaudhary

COLM 2026 (under review)

This paper studies evaluation metrics for test-time scaling by separating answer discovery from repeated correctness. It derives Geom@k and the broader GeoSpectrum@K family from a common hypergeometric view of fixed-budget metrics. Across aggregate settings, Geom@2 provides a strong balance of fast convergence and low ranking drift relative to alternative summaries. The work offers a compute-aware perspective on stable evaluation under repeated sampling.

Artificial Intelligence Trustworthy AI

BibTeX Citation

@misc{hariri2026geomk,
  title={Geom@k: Fast to Converge, Slow to Drift},
  author={Mohsen Hariri and Vipin Chaudhary},
  year={2026},
  note={Under review at COLM 2026}
}

Don't Pass@k: A Bayesian Framework for Large Language Model Evaluation

2026

Mohsen Hariri , Amirhossein Samandar , Michael Hinczewski , Vipin Chaudhary

14th International Conference on Learning Representations (ICLR), April 23-27, 2026, Rio De Janeiro, Brazil

Pass@k is widely used to report LLM reasoning performance but often yields unstable and misleading rankings, especially when trial counts are limited and compute is constrained. This paper proposes a principled Bayesian evaluation framework that replaces Pass@k with posterior estimates of a model's underlying success probability and credible intervals, using a Dirichlet prior to give closed-form expressions for posterior mean and uncertainty under any weighted rubric. Empirically, on AIME'24/'25, HMMT'25, and BrUMO'25, the Bayesian approach achieves faster convergence and greater rank stability than Pass@k, enabling reliable model comparisons at far smaller sample counts. The framework also naturally extends to graded, rubric-based evaluations, making uncertainty explicit.

Artificial Intelligence Trustworthy AI

arXiv

BibTeX Citation

@inproceedings{
hariri2026dont,
title={Don{\textquoteright}t Pass@\$k\$: A Bayesian Framework for Large Language Model Evaluation},
author={Mohsen Hariri and Amirhossein Samandar and Michael Hinczewski and Vipin Chaudhary},
booktitle={The Fourteenth International Conference on Learning Representations},
year={2026},
url={https://openreview.net/forum?id=PTXi3Ef4sT}
}

Sweeping Promptable Spoofs under the DirtyRAG

2026

Shaochen Zhong , Jiamu Zhang , Hoang Anh Duy Le , Wenya Xie , Yifan Lu , Xintong Sun , Mohsen Hariri , Hongyi Liu , Guanchu Wang , Zhaozhuo Xu , Zirui Liu , Shuai Xu , Ning Xie , Li Li , Rui Chen , Ruixiang Tang , Xia Hu , Vipin Chaudhary

ICML 2026 (under review)

This paper studies security vulnerabilities in retrieval-augmented generation through DirtyRAG, a query-blind benign-passage attack that can be steered by prompting. It shows that promptable spoof passages remain effective against strong defenses and exposes a practical attack surface for real-world RAG systems. The work also introduces RAG-ATag, a benchmark for evaluating RAG security under these attack conditions. It highlights the need for more robust retrieval and generation defenses in deployed LLM systems.

Artificial Intelligence Trustworthy AI

BibTeX Citation

@misc{zhong2026dirtyrag,
  title={Sweeping Promptable Spoofs under the DirtyRAG},
  author={Shaochen Zhong and Jiamu Zhang and Hoang Anh Duy Le and Wenya Xie and Yifan Lu and Xintong Sun and Mohsen Hariri and Hongyi Liu and Guanchu Wang and Zhaozhuo Xu and Zirui Liu and Shuai Xu and Ning Xie and Li Li and Rui Chen and Ruixiang Tang and Xia Hu and Vipin Chaudhary},
  year={2026},
  note={Under review at ICML 2026}
}

Categorical Evaluation of LLMs under Test-Time Scaling

2026

Mohsen Hariri , H. S. Hillsdownley , Vipin Chaudhary

COLM 2026 (under review)

This work argues that binary pass-based metrics are too coarse for evaluating reasoning models under test-time scaling. It introduces a categorical Bayesian framework that scores rubric-defined outcomes with uncertainty rather than collapsing all outputs into pass-or-fail labels. The study shows that lightweight runtime signals can support accurate categorical evaluation without relying on a judge model and that rubric design can materially change model rankings. The paper extends uncertainty-aware LLM evaluation beyond binary correctness.

Artificial Intelligence Trustworthy AI

BibTeX Citation

@misc{hariri2026categorical,
  title={Categorical Evaluation of LLMs under Test-Time Scaling},
  author={Mohsen Hariri and H. S. Hillsdownley and Vipin Chaudhary},
  year={2026},
  note={Under review at COLM 2026}
}

Speculative Thinking: Enhancing Small-Model Reasoning with Large Model Guidance at Inference Time

2025

Van Yang , Xiang Yue , Vipin Chaudhary , Xiaotian Han

Conference on Language Modeling (COLM), October 7-10, 2025, Montreal, Canada

Recent advances in post-training enhance model reasoning but require costly training pipelines and produce inefficient, overly lengthy outputs. This paper introduces Speculative Thinking, a training-free framework enabling large reasoning models to guide smaller ones during inference at the reasoning level—distinct from token-level speculative decoding—by identifying structural cues such as paragraph breaks followed by reflective phrases where small models struggle and delegating those steps to a larger model. The method significantly boosts smaller model reasoning accuracy while shortening output length, offering an efficient inference-time paradigm that preserves the small model's compute efficiency.

Artificial Intelligence

arXiv

BibTeX Citation

@misc{yang2025speculativethinkingenhancingsmallmodel,
      title={Speculative Thinking: Enhancing Small-Model Reasoning with Large Model Guidance at Inference Time}, 
      author={Wang Yang and Xiang Yue and Vipin Chaudhary and Xiaotian Han},
      year={2025},
      eprint={2504.12329},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2504.12329}, 
}

Novel Adaptation of Video Segmentation to 3D MRI: Efficient Zero-Shot Knee Segmentation with SAM2

2025

Andrew Yu , Mohsen Hariri , Xuecen Zhang , Mingrui Yang , Vipin Chaudhary , Xiaojuan Li

SPIE Medical Imaging 2025, February 16-20, 2025, San Diego, USA [Oral]

Medical image segmentation methods face the challenge of domain transfer, where performance degrades due to distribution shifts between source and target domains. This paper adapts SAM2, a general-purpose video segmentation model, for zero-shot single-prompt 3D knee MRI segmentation by treating volumetric slices as individual video frames and leveraging SAM2's memory mechanism to generate motion- and spatially-aware predictions across the volume. Experiments on the OAI-ZIB dataset demonstrate a Dice similarity coefficient of 0.9643 on tibia using only a single prompt and no task-specific training or fine-tuning.

Medical Imaging Artificial Intelligence Computer Vision

arXiv

BibTeX Citation

@misc{yu2024noveladaptationvideosegmentation,
      title={Novel adaptation of video segmentation to 3D MRI: efficient zero-shot knee segmentation with SAM2}, 
      author={Andrew Seohwan Yu and Mohsen Hariri and Xuecen Zhang and Mingrui Yang and Vipin Chaudhary and Xiaojuan Li},
      year={2024},
      eprint={2408.04762},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2408.04762}, 
}

QuFlex: Parallel Quantum Job Scheduling Using Adaptive Circuit-Cutting

2025

Vinooth Kulkarni , Aaron Orenstein , Xinpeng Li , Shuai Xu , Daniel Blankenberg , Vipin Chaudhary

Supercomputing India Conference, December 9-13, 2025, Hyderabad

Parallel quantum job scheduling across multiple QPUs is critical for maximizing throughput in heterogeneous quantum computing environments. QuFlex introduces an adaptive circuit-cutting approach that dynamically partitions quantum circuits based on available QPU resources, enabling efficient parallel scheduling across heterogeneous quantum hardware. The framework demonstrates improved QPU utilization and reduced job completion times compared to static partitioning approaches.

Quantum Computing HPC

DOI

BibTeX Citation

@INPROCEEDINGS{11333863,
  author={Kulkarni, Vinooth and Orenstein, Aaron and Li, Xinpeng and Xu, Shuai and Blankenberg, Daniel and Chaudhary, Vipin},
  booktitle={2025 Supercomputing India (SCI)}, 
  title={QuFlex: Parallel Quantum Job Scheduling using Adaptive Circuit Cutting}, 
  year={2025},
  volume={},
  number={},
  pages={1-8},
  keywords={Schedules;Runtime;Scalability;Qubit;Subspace constraints;Programming;Partitioning algorithms;Resource management;Round robin;Optimization},
  doi={10.1109/SCI68648.2025.11333863}}

MQuAKE-Remastered: Multi-Hop Knowledge Editing Can Only Be Advanced with Reliable Evaluations

2025

Shaochen Zhong , Yifan Lu , Lize Shao , Bhargav Bhushanam , Xiaocong Du , Yixin Wan , Yucheng Shi , Daochen Zha , Yiwei Wang , Ninghao Liu , Kaixiong Zhou , Shuai Xu , Kai-Wei Chang , Louis Feng , Vipin Chaudhary , Xia Hu

13th International Conference on Learning Representations (ICLR), April 24-28, 2025, Singapore [Spotlight]

Multi-hop knowledge editing in LLMs has been evaluated using benchmarks with unreliable protocols that conflate editing success with benchmark artifacts, producing misleading results. This paper presents MQuAKE-Remastered, which corrects systematic flaws in prior multi-hop knowledge editing assessments and demonstrates that reliable evaluation methodology is largely absent—and essential—for advancing the field. Accepted as a Spotlight at ICLR 2025, the work shows that many reported gains in multi-hop editing do not hold under rigorous evaluation, calling for a reset of evaluation standards.

Artificial Intelligence

OpenReview

BibTeX Citation

@inproceedings{
zhong2025mquakeremastered,
title={{MQ}u{AKE}-Remastered: Multi-Hop Knowledge Editing Can Only Be Advanced with Reliable Evaluations},
author={Shaochen Zhong and Yifan Lu and Lize Shao and Bhargav Bhushanam and Xiaocong Du and Yixin Wan and Yucheng Shi and Daochen Zha and Yiwei Wang and Ninghao Liu and Kaixiong Zhou and Shuai Xu and Kai-Wei Chang and Louis Feng and Vipin Chaudhary and Xia Hu},
booktitle={The Thirteenth International Conference on Learning Representations},
year={2025},
url={https://openreview.net/forum?id=m9wG6ai2Xk}
}

Masked-speech recognition using human and synthetic cloned speech

2025

Lauren Calandruccio , Mohsen Hariri , Emily Buss , Vipin Chaudhary

Trends in Hearing

This study evaluates the intelligibility and human-likeness of AI-generated voice clones compared to human speech. Using transformer-based language models, the research demonstrates that synthetic speech can achieve similar recognition scores and perceptual similarity to original human talkers, even in noisy environments. The findings suggest that voice synthesis and automatic speech recognition (ASR) are promising tools for evaluating speech recognition in both clinical audiology and hearing research.

Artificial Intelligence Trustworthy AI

DOI

BibTeX Citation

@article{calandruccio2025masked,
  title={Masked-speech recognition using human and synthetic cloned speech},
  author={Calandruccio, Lauren and Hariri, Mohsen and Buss, Emily and Chaudhary, Vipin},
  journal={Trends in Hearing},
  volume={29},
  pages={23312165251403080},
  year={2025},
  publisher={SAGE Publications},
  doi={10.1177/23312165251403080}
}

LoRATK: LoRA Once, Backdoor Everywhere in the Share-and-Play Ecosystem

2025

Hongyi Liu , Shaochen Zhong , Xinton Sun , Minghao Tian , Mohsen Hariri , Zirui Liu , Ruixiang Tang , Zhimeng Jiang , Jiayi Yuan , Yu-Neng Chuan , Li Li , Soo-Hyun Choi , Rui Chen , Vipin Chaudhary , Xia Hu

Findings of EMNLP 2025, November 5-9, 2025, Suzhou, China

Fine-tuning LLMs with LoRA has created a convenient share-and-play ecosystem where users download community-shared adapters to enhance base models, but this also introduces a new attack surface for distributing malicious LoRAs. This paper demonstrates that a backdoor LoRA can be trained once and then seamlessly merged in a training-free fashion with multiple task-enhancing LoRAs, retaining both malicious behavior and legitimate downstream capabilities. Such merged LoRAs are particularly dangerous because malicious intent is concealed behind improved downstream performance, creating strong incentive for voluntary adoption, and no safety measures exist to intervene during local deployment.

Trustworthy AI Artificial Intelligence

DOI arXiv

BibTeX Citation

@misc{liu2025loratkloraoncebackdoor,
      title={LoRATK: LoRA Once, Backdoor Everywhere in the Share-and-Play Ecosystem}, 
      author={Hongyi Liu and Shaochen Zhong and Xintong Sun and Minghao Tian and Mohsen Hariri and Zirui Liu and Ruixiang Tang and Zhimeng Jiang and Jiayi Yuan and Yu-Neng Chuang and Li Li and Soo-Hyun Choi and Rui Chen and Vipin Chaudhary and Xia Hu},
      year={2025},
      eprint={2403.00108},
      archivePrefix={arXiv},
      primaryClass={cs.CR},
      url={https://arxiv.org/abs/2403.00108}, 
}

Longer Context, Deeper Thinking: Uncovering the Role of Long-Context Ability in Reasoning

2025

Van Yang , Zirui Liu , Hongye Jin , Qingyu Yin , Vipin Chaudhary , Xiaotian Han

39th Conference on Neural Information Processing Systems (NeurIPS 2025), December 2025

Recent language models exhibit strong reasoning capabilities, yet the influence of long-context capacity on reasoning remains underexplored. This paper hypothesizes that current reasoning limitations stem partly from insufficient long-context capacity, motivated by observations that higher context window lengths correlate with stronger reasoning performance and that failed reasoning cases resemble failed long-context cases. Controlled experiments comparing architecturally identical models with varying long-context capacities confirm that enhancing long-context ability before supervised fine-tuning leads to improved reasoning, advocating for long-context capacity as a first-class design objective.

Artificial Intelligence

arXiv

BibTeX Citation

@misc{yang2025longercontextdeeperthinking,
      title={Longer Context, Deeper Thinking: Uncovering the Role of Long-Context Ability in Reasoning}, 
      author={Wang Yang and Zirui Liu and Hongye Jin and Qingyu Yin and Vipin Chaudhary and Xiaotian Han},
      year={2025},
      eprint={2505.17315},
      archivePrefix={arXiv},
      primaryClass={cs.AI},
      url={https://arxiv.org/abs/2505.17315}, 
}

LABELING COPILOT: A Deep Research Agent for Automated Data Curation in Computer Vision

2025

Debargha Ganguly , Sumit Kumar , Ishwar Balappanawar , Weicong Chen , Shashank Kambhatla , Srinivasan Iyengar , Shivkumar Kalyanaraman , Ponnurangam Kumaraguru , Vipin Chaudhary

2025 IEEE International Conference on Big Data, December 8-11, 2025, Macau, China

Curating high-quality, domain-specific datasets is a major bottleneck for deploying robust vision systems. This paper introduces Labeling Copilot, the first data curation deep research agent for computer vision, powered by a large multimodal language model that uses multi-step reasoning to execute specialized tools across three core capabilities: Calibrated Discovery for sourcing in-distribution data from large repositories, Controllable Synthesis for generating rare-scenario data with robust filtering, and Consensus Annotation for producing accurate labels via a novel multi-model consensus mechanism. On the dense COCO dataset, the Consensus Annotation module achieves an annotation mAP of 37.1%, and on Open Images it discovers 903 new bounding box categories.

Artificial Intelligence Computer Vision

arXiv

BibTeX Citation

@misc{ganguly2025labelingcopilotdeepresearch,
      title={LABELING COPILOT: A Deep Research Agent for Automated Data Curation in Computer Vision}, 
      author={Debargha Ganguly and Sumit Kumar and Ishwar Balappanawar and Weicong Chen and Shashank Kambhatla and Srinivasan Iyengar and Shivkumar Kalyanaraman and Ponnurangam Kumaraguru and Vipin Chaudhary},
      year={2025},
      eprint={2509.22631},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2509.22631}, 
}

K4: Online Log Anomaly Detection via Unsupervised Typicality Learning

2025

Weicong Chen , Vikash Singh , Zahra Rahmani , Debargha Ganguly , Mohsen Hariri , Vipin Chaudhary

IEEE/ACM International Conference on High Performance Computing (SC25), December 17-20, 2025, Hyderabad, India

Existing log anomaly detection methods are often slow, dependent on error-prone parsing, and use unrealistic evaluation protocols. This paper introduces K4 (Knowing the Unknown by Knowing only the Known), a fully unsupervised, parser-independent framework that transforms arbitrary log embeddings into compact four-dimensional descriptors—Precision, Recall, Density, Coverage—using efficient k-nearest neighbor statistics. Under a realistic online chunk-based evaluation protocol, K4 achieves state-of-the-art AUROC of 0.995–0.999 across HDFS, BGL, and Thunderbird datasets, with training under 4 seconds and inference as low as 4 μs.

Trustworthy AI HPC Artificial Intelligence

DOI arXiv

BibTeX Citation

@misc{chen2025k4onlineloganomaly,
      title={$K^4$: Online Log Anomaly Detection Via Unsupervised Typicality Learning}, 
      author={Weicong Chen and Vikash Singh and Zahra Rahmani and Debargha Ganguly and Mohsen Hariri and Vipin Chaudhary},
      year={2025},
      eprint={2507.20051},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2507.20051}, 
}

HOPPS: Hardware-Aware Optimal Phase Polynomial Synthesis with Blockwise Optimization for Quantum Circuits

2025

Xinpeng Li , Ji Liu , Shuai Xu , Paul Hovland , Vipin Chaudhary

IEEE/ACM International Conference on High Performance Computing (SC25), December 17-20, 2025, Hyderabad, India

Blocks composed of CNOT and Rz gates are ubiquitous in modern quantum applications such as QAOA ansatzes and quantum adders, but after compilation they often exhibit large CNOT counts or depths that lower fidelity. This paper introduces HOPPS, a SAT-based hardware-aware optimal phase polynomial synthesis algorithm that generates CNOT/Rz blocks with CNOT count or depth optimality under hardware topology constraints. To address scalability for large circuits, an iterative blockwise optimization strategy partitions large circuits into smaller blocks and optimally refines each—achieving CNOT count reductions up to 50% and depth reductions up to 57.1% when used as a peephole optimizer.

Quantum Computing HPC

DOI arXiv

BibTeX Citation

@misc{li2025hoppshardwareawareoptimalphase,
      title={HOPPS: Hardware-Aware Optimal Phase Polynomial Synthesis with Blockwise Optimization for Quantum Circuits}, 
      author={Xinpeng Li and Ji Liu and Shuai Xu and Paul Hovland and Vipin Chaudhary},
      year={2025},
      eprint={2511.18770},
      archivePrefix={arXiv},
      primaryClass={quant-ph},
      url={https://arxiv.org/abs/2511.18770}, 
}

Grammars of Formal Uncertainty: When to Trust LLMs in Automated Reasoning Tasks

2025

Debargha Ganguly , Vikash Singh , Sreehari Sankar , Biyao Zhang , Xuecen Zhang , Srinivasan Iyengar , Xiaotian Han , Amit Sharma , Shivkumar Kalyanaraman , Vipin Chaudhary

39th Conference on Neural Information Processing Systems (NeurIPS 2025), December 2025

Large language models show remarkable promise for automated reasoning by generating formal specifications, but a fundamental tension exists between their probabilistic nature and the deterministic guarantees required by formal verification. This paper comprehensively investigates failure modes and uncertainty quantification in LLM-generated formal artifacts, revealing that SMT-based autoformalization has highly domain-specific accuracy impacts ranging from +34.8% on logical tasks to −44.5% on factual ones. A probabilistic context-free grammar (PCFG) framework is introduced to model LLM outputs and yield a refined uncertainty taxonomy, finding that uncertainty signals are task-dependent—for example, grammar entropy for logic achieves AUROC > 0.93.

Artificial Intelligence Trustworthy AI

arXiv

BibTeX Citation

@misc{ganguly2025grammarsformaluncertaintytrust,
      title={Grammars of Formal Uncertainty: When to Trust LLMs in Automated Reasoning Tasks}, 
      author={Debargha Ganguly and Vikash Singh and Sreehari Sankar and Biyao Zhang and Xuecen Zhang and Srinivasan Iyengar and Xiaotian Han and Amit Sharma and Shivkumar Kalyanaraman and Vipin Chaudhary},
      year={2025},
      eprint={2505.20047},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2505.20047}, 
}

Efficient Fine-Grained GPU Performance Modeling for Distributed Deep Learning of LLM

2025

Biyao Zhang , Mingkai Zheng , Debargha Ganguly , Xuecen Zhang , Vikash Singh , Vipin Chaudhary , Zhao Zhang

IEEE/ACM International Conference on High Performance Computing (SC25), December 17-20, 2025, Hyderabad, India

Training large language models is one of the most compute-intensive tasks in HPC, and predicting end-to-end training time for multi-billion parameter models across hundreds of GPUs is challenging due to complex interactions between transformer components, parallelism strategies, and multi-tier communication. This paper addresses this by decomposing LLMs into core computational primitives and modeling them with operator-level decomposition, lightweight hardware-aware prediction models for key operations, and an end-to-end prediction system integrating these across complex parallelization strategies. The resulting framework enables accurate distributed LLM training performance prediction without costly full-scale sampling.

HPC Artificial Intelligence

DOI arXiv

BibTeX Citation

@misc{zhang2025efficientfinegrainedgpuperformance,
      title={Efficient Fine-Grained GPU Performance Modeling for Distributed Deep Learning of LLM}, 
      author={Biyao Zhang and Mingkai Zheng and Debargha Ganguly and Xuecen Zhang and Vikash Singh and Vipin Chaudhary and Zhao Zhang},
      year={2025},
      eprint={2509.22832},
      archivePrefix={arXiv},
      primaryClass={cs.DC},
      url={https://arxiv.org/abs/2509.22832}, 
}

Forte: Finding Outliers with Representation Typicality Estimation

2025

Debargha Ganguly , Warren Morningstar , Andrew Yu , Vipin Chaudhary

13th International Conference on Learning Representations (ICLR), April 24-28, 2025, Singapore

Generative models can now produce photorealistic synthetic data virtually indistinguishable from real training data, challenging OOD detectors that rely on generative model likelihoods due to likelihood misestimation and typicality issues. This paper introduces Forte, which hypothesizes that estimating typical sets using self-supervised learners leads to better OOD detection, using representation learning and informative summary statistics based on manifold estimation to address these issues. Forte outperforms other unsupervised approaches and achieves state-of-the-art performance on established challenging benchmarks as well as new synthetic data detection tasks, requiring no class labels.

Trustworthy AI Artificial Intelligence

arXiv

BibTeX Citation

@misc{ganguly2024fortefindingoutliers,
      title={Forte : Finding Outliers with Representation Typicality Estimation}, 
      author={Debargha Ganguly and Warren Morningstar and Andrew Yu and Vipin Chaudhary},
      year={2024},
      eprint={2410.01322},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2410.01322}, 
}

Flexible Group Count Enables Hassle-Free Structured Pruning

2025

Jiamu Zhang , Shaochen Zhong , Andrew Ye , Zirui Liu , Sebastian Zhao , Kaixiong Zhou , Li Li , Soo-Hyun Choi , Rui Chen , Xia Hu , Shuai Xu , Vipin Chaudhary

IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), June 11-15, 2025, Nashville, USA

Densely structured pruning methods maintain pruned models in a fully dense format, allowing immediate compression benefits, but existing grouped kernel pruning approaches introduce dynamic operations that add complications or impose limitations such as requiring expensive clustering schemes or custom architecture support. This paper argues that making Conv2d group count flexible under an integral optimization is the best practice for grouped kernel pruning, leveraging its ideal alignment with grouped convolution infrastructure. The resulting one-shot, post-train, data-agnostic method is more performant, adaptive, and user-friendly than its predecessors, requiring little to no hyperparameter tuning or handcrafted criteria.

Artificial Intelligence Computer Vision

DOI

BibTeX Citation

@INPROCEEDINGS{11094192,
  author={Zhang, Jiamu and Zhong, Shaochen and Ye, Andrew and Liu, Zirui and Zhao, Sebastian and Zhou, Kaixiong and Li, Li and Choi, Soo-Hyun and Chen, Rui and Hu, Xia and Xu, Shuai and Chaudhary, Vipin},
  booktitle={2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)}, 
  title={Flexible Group Count Enables Hassle-Free Structured Pruning}, 
  year={2025},
  volume={},
  number={},
  pages={4807-4818},
  keywords={Computer vision;Limiting;Costs;Convolution;Computer architecture;Pattern recognition;Kernel;Tuning;Optimization;Best practices;pruning;structure pruning;convolutional neural networks;model compression;grouped kernel pruning;efficient deep learning;hassle-free},
  doi={10.1109/CVPR52734.2025.00453}}

70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length Float

2025

Tianyi Zhang , Mohsen Hariri , Shaochen Zhong , Vipin Chaudhary , Yang Sui , Xia Hu , Anshumali Shrivastava

39th Conference on Neural Information Processing Systems (NeurIPS 2025), December 2025

Large-scale AI models have grown rapidly in size, creating significant challenges for deployment on resource-constrained hardware. This paper introduces Dynamic-Length Float (DFloat11), a lossless compression framework that reduces LLM size by 30% while preserving outputs that are bit-for-bit identical to the original model, exploiting the low entropy in BFloat16 weight representations through entropy coding and dynamic-length encodings. A custom GPU kernel enables fast online decompression, and experiments on Llama 3.3, Qwen 3, and Mistral 3 validate 30% size reduction with 2.3–46.2× higher throughput than CPU offloading—notably enabling lossless inference of Llama 3.1 405B on a single 8×80GB GPU node.

Artificial Intelligence HPC

arXiv

BibTeX Citation

@misc{zhang202670size100accuracy,
      title={70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length Float (DFloat11)}, 
      author={Tianyi Zhang and Mohsen Hariri and Shaochen Zhong and Vipin Chaudhary and Yang Sui and Xia Hu and Anshumali Shrivastava},
      year={2026},
      eprint={2504.11651},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2504.11651}, 
}

CausalRAG: Integrating Causal Graphs into Retrieval-Augmented Generation

2025

Nengbo Wang , Xiaotian Han , Jagdip Singh , Jing Ma , Vipin Chaudhary

63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025), July 27-August 1, 2025, Vienna, Austria

Traditional RAG systems face critical limitations including disrupted contextual integrity from text chunking and over-reliance on semantic similarity for retrieval. This paper proposes CausalRAG, a novel framework that incorporates causal graphs into the retrieval process, constructing and tracing cause-effect relationships to preserve contextual continuity and improve retrieval precision. Evaluated against regular RAG and graph-based RAG approaches across multiple metrics including answer faithfulness and context precision, CausalRAG demonstrates consistent superiority, showing that causal grounding is a promising direction for knowledge-intensive tasks.

Artificial Intelligence

DOI arXiv

BibTeX Citation

@misc{wang2025causalragintegratingcausalgraphs,
      title={CausalRAG: Integrating Causal Graphs into Retrieval-Augmented Generation}, 
      author={Nengbo Wang and Xiaotian Han and Jagdip Singh and Jing Ma and Vipin Chaudhary},
      year={2025},
      eprint={2503.19878},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2503.19878}, 
}

100-LongBench: Are de facto Long-Context Benchmarks Literally Evaluating Long-Context Ability?

2025

Wang Yang , Hongye Jin , Shaochen Zhong , Song Jiang , Qifan Wang , Vipin Chaudhary , Xiaotian Han

63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025), July 27-August 1, 2025, Vienna, Austria

Existing long-context evaluation benchmarks fail to separate long-context performance from a model's baseline ability, making cross-model comparisons unclear, and are typically constructed with fixed input lengths that limit applicability across models with different context windows. This paper introduces 100-LongBench, a length-controllable long-context benchmark with a novel metric that disentangles baseline knowledge from true long-context capability across multiple task categories. Experiments demonstrate that existing benchmarks significantly conflate baseline model strength with genuine long-context ability, revealing a widespread evaluation gap.

Artificial Intelligence

DOI arXiv

BibTeX Citation

@misc{yang2025100longbenchfactolongcontextbenchmarks,
      title={100-LongBench: Are de facto Long-Context Benchmarks Literally Evaluating Long-Context Ability?}, 
      author={Wang Yang and Hongye Jin and Shaochen Zhong and Song Jiang and Qifan Wang and Vipin Chaudhary and Xiaotian Han},
      year={2025},
      eprint={2505.19293},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2505.19293}, 
}

Visual Concept Networks: A Graph-Based Approach to Detecting Anomalous Data in Deep Neural Networks

2024

Debargha Ganguly , Debayan Gupta , Vipin Chaudhary

4th International Conference on Pattern Recognition and Artificial Intelligence (ICPRAI), July 3-6, 2024, Jeju Island, South Korea

Deep neural networks struggle with robustness against anomalous and out-of-distribution data, and current OOD benchmarks often oversimplify by focusing on single-object tasks. This paper introduces Visual Concept Networks, a graph-based method that converts images into networks of interconnected human-understandable visual concepts and uses topological features to detect both far-OOD and near-OOD data. Extensive testing on two novel complex real-world tasks with ablation studies using large vocabularies demonstrates the method's effectiveness for detecting anomalous data in DNNs.

Trustworthy AI Artificial Intelligence Computer Vision

arXiv

BibTeX Citation

@misc{ganguly2024visualconceptnetworksgraphbased,
      title={Visual Concept Networks: A Graph-Based Approach to Detecting Anomalous Data in Deep Neural Networks}, 
      author={Debargha Ganguly and Debayan Gupta and Vipin Chaudhary},
      year={2024},
      eprint={2409.18235},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2409.18235}, 
}

Taylor Unswift: Secured Weight Release for Large Language Models via Taylor Expansion

2024

Guanchu Wang , Yu-Neng Chuang , Ruixiang Tang , Shaochen Zhong , Jiayi Yuan , Hongye Jin , Zirui Liu , Vipin Chaudhary , Shuai Xu , James Caverlee , Xia Hu

2024 Conference on Empirical Methods in Natural Language Processing (EMNLP 2024), November 12-16, 2024, Miami, USA

Releasing LLM weights poses a dilemma: open-sourcing compromises ownership while closed APIs raise data privacy concerns. This paper introduces TaylorMLP, which protects LLM ownership by transforming weights into Taylor-series parameters that can be released instead of original weights, and prevents unauthorized use by inducing low-speed token generation through increasing the number of Taylor-series terms. Empirical experiments across five datasets and three LLM architectures demonstrate TaylorMLP induces over 4× latency increase while producing tokens precisely matched with original models, effectively defending against weight reconstruction from downstream datasets.

Trustworthy AI Artificial Intelligence

DOI arXiv

BibTeX Citation

@misc{wang2025taylorunswiftsecuredweight,
      title={Taylor Unswift: Secured Weight Release for Large Language Models via Taylor Expansion}, 
      author={Guanchu Wang and Yu-Neng Chuang and Ruixiang Tang and Shaochen Zhong and Jiayi Yuan and Hongye Jin and Zirui Liu and Vipin Chaudhary and Shuai Xu and James Caverlee and Xia Hu},
      year={2025},
      eprint={2410.05331},
      archivePrefix={arXiv},
      primaryClass={cs.CR},
      url={https://arxiv.org/abs/2410.05331}, 
}

Phase Identification in Synchrotron X-ray Diffraction Patterns of Ti-6Al-4V Using Computer Vision and Deep Learning

2024

Weiqi Yue , Pawan K. Tripathi , Gabriel Ponon , Zhuldyz Ualikhankyzy , Donald W. Brown , Bjorn Clausen , Maria Strantza , Darren C. Pagan , Matthew A. Willard , Frank Ernst , Erman Ayday , Vipin Chaudhary , Roger H. French

Integrating Materials and Manufacturing Innovation

This research utilizes convolutional neural networks (CNNs) to automate the phase identification of titanium alloys from synchrotron X-ray diffraction (XRD) patterns. By treating XRD patterns as one-dimensional images, the deep learning model achieves high accuracy in distinguishing between alpha and beta phases, significantly reducing the time required for manual analysis in materials characterization.

Computer Vision Materials Science Artificial Intelligence

DOI

BibTeX Citation

@article{Yue2024,
  title={Phase Identification in Synchrotron X-ray Diffraction Patterns of Ti--6Al--4V Using Computer Vision and Deep Learning},
  author={Yue, Weiqi and Tripathi, Pawan K. and Ponon, Gabriel and Ualikhankyzy, Zhuldyz and Brown, Donald W. and Clausen, Bjorn and Strantza, Maria and Pagan, Darren C. and Willard, Matthew A. and Ernst, Frank and Ayday, Erman and Chaudhary, Vipin and French, Roger H.},
  journal={Integrating Materials and Manufacturing Innovation},
  volume={13},
  number={1},
  pages={36--52},
  year={2024},
  publisher={Springer},
  doi={10.1007/s40192-023-00328-0}
}

QGroup: Parallel Quantum Job Scheduling Using Dynamic Programming

2024

Aaron Orenstein , Vipin Chaudhary

IEEE International Conference on Quantum Computing and Engineering (QCE24), September 2024, Montreal, Canada

Scheduling quantum circuits across multiple QPUs requires efficient algorithms that minimize idle time while respecting hardware constraints. QGroup uses dynamic programming to optimally group and schedule quantum circuits across multiple QPUs, maximizing throughput and minimizing idle time through principled combinatorial optimization. Evaluated on realistic quantum workloads, QGroup achieves improved scheduling efficiency compared to greedy and heuristic-based baseline approaches.

Quantum Computing HPC

DOI

BibTeX Citation

@INPROCEEDINGS{10821381,
  author={Orenstein, Aaron and Chaudhary, Vipin},
  booktitle={2024 IEEE International Conference on Quantum Computing and Engineering (QCE)}, 
  title={QGroup: Parallel Quantum Job Scheduling Using Dynamic Programming}, 
  year={2024},
  volume={01},
  number={},
  pages={990-999},
  keywords={Computers;Quantum computing;Runtime;Noise;Programming;Parallel processing;Throughput;Dynamic programming;Synchronization;Quantum circuit;Quantum;Optimization;Parallelism;Non-Linear Programming},
  doi={10.1109/QCE60285.2024.00118}}

Privacy-Preserving Collaborative Genomic Research: A Real-Life Deployment and Vision

2024

Nahal Shahani , Zahra Rahmani , Nadav Gat , Zebin Yun , Yuzhou Jiang , Ofir Farchy , Yaniv Harel , Vipin Chaudhary , Mahmood Sharif , Erman Ayday

24th Privacy Enhancing Technologies Symposium (PETS), July 15-20, 2024, Bristol, UK

The genomic domain stands to benefit greatly from advances in AI and data science, but increasing privacy and cybersecurity concerns necessitate robust solutions for sensitive collaborative research. This paper presents a practical deployment of a privacy-preserving framework for genomic research developed in collaboration with Lynx.MD, a secure health data collaboration platform, addressing challenges of enabling joint analysis of genomic data while mitigating data breach risks. The framework demonstrates scalable, privacy-preserving data sharing and analysis that maintains utility while satisfying rigorous security requirements in a real production environment.

Trustworthy AI Artificial Intelligence

arXiv

BibTeX Citation

@misc{rahmani2024privacypreservingcollaborativegenomicresearch,
      title={Privacy-Preserving Collaborative Genomic Research: A Real-Life Deployment and Vision}, 
      author={Zahra Rahmani and Nahal Shahini and Nadav Gat and Zebin Yun and Yuzhou Jiang and Ofir Farchy and Yaniv Harel and Vipin Chaudhary and Mahmood Sharif and Erman Ayday},
      year={2024},
      eprint={2407.09004},
      archivePrefix={arXiv},
      primaryClass={cs.CR},
      url={https://arxiv.org/abs/2407.09004}, 
}

KV Cache Compression, But What Must We Give in Return? A Comprehensive Benchmark of Long Context Capable Approaches

2024

Jiayi Yuan , Hongyi Liu , Shaochen Zhong , Yu-Neng Chuang , Songchen Li , Guanchu Wang , Duy Le , Hongye Jin , Vipin Chaudhary , Zhaozhuo Xu , Zirui Liu , Xia Hu

Findings of EMNLP 2024, November 12-16, 2024, Miami, USA

Long-context capability is critical for LLMs, but transformer architectures face significant challenges due to growing KV cache size and the complexity of attending to extended inputs. This paper provides a comprehensive taxonomy and benchmark evaluation of 10+ state-of-the-art approaches across seven long-context task categories—including KV cache quantization, token dropping, prompt compression, linear-time sequence models, and hybrid architectures—evaluated in a unified, aligned environment. The work reveals numerous previously unknown phenomena and offers a practical workbench and insights for the future development of long-context-capable LLMs.

Artificial Intelligence HPC

arXiv

BibTeX Citation

@misc{yuan2024kvcachecompressionreturn,
      title={KV Cache Compression, But What Must We Give in Return? A Comprehensive Benchmark of Long Context Capable Approaches}, 
      author={Jiayi Yuan and Hongyi Liu and Shaochen Zhong and Yu-Neng Chuang and Songchen Li and Guanchu Wang and Duy Le and Hongye Jin and Vipin Chaudhary and Zhaozhuo Xu and Zirui Liu and Xia Hu},
      year={2024},
      eprint={2407.01527},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2407.01527}, 
}

Knowledge Graphs Can be Learned with Just Intersection Features

2024

Duy Le , Shaochen Zhong , Zirui Liu , Shuai Xu , Vipin Chaudhary , Kaixiong Zhou , Zhaozhuo Xu

41st International Conference on Machine Learning (ICML), July 21, 2024, Vienna, Austria

Knowledge graph completion can be framed as link prediction where structural information is key, but quantifying this structural information poses a challenge. This paper demonstrates that the intersection among k-hop neighborhoods of the head, relation, and tail is the critical structural signal for valid triple prediction, and proposes a novel randomized algorithm to efficiently generate these intersection features. A straightforward fully-connected network leveraging these features outperforms established KG embedding models and graph neural network baselines, while also achieving substantial training time efficiency gains.

Artificial Intelligence

View

BibTeX Citation

@InProceedings{pmlr-v235-le24c,
  title =          {Knowledge Graphs Can be Learned with Just Intersection Features},
  author =       {Le, Duy and Zhong, Shaochen and Liu, Zirui and Xu, Shuai and Chaudhary, Vipin and Zhou, Kaixiong and Xu, Zhaozhuo},
  booktitle =          {Proceedings of the 41st International Conference on Machine Learning},
  pages =          {26199--26214},
  year =          {2024},
  editor =          {Salakhutdinov, Ruslan and Kolter, Zico and Heller, Katherine and Weller, Adrian and Oliver, Nuria and Scarlett, Jonathan and Berkenkamp, Felix},
  volume =          {235},
  series =          {Proceedings of Machine Learning Research},
  month =          {21--27 Jul},
  publisher =    {PMLR},
  pdf =          {https://raw.githubusercontent.com/mlresearch/v235/main/assets/le24c/le24c.pdf},
  url =          {https://proceedings.mlr.press/v235/le24c.html},
  abstract =          {Knowledge Graphs (KGs) are potent frameworks for knowledge representation and reasoning. Nevertheless, KGs are inherently incomplete, leaving numerous uncharted relationships and facts awaiting discovery. Deep learning methodologies have proven effective in enhancing KG completion by framing it as a link prediction task, where the goal is to discern the validity of a triple comprising a head, relation, and tail. The significance of structural information in assessing the validity of a triple within a KG is well-established. However, quantifying this structural information poses a challenge. We need to pinpoint the metric that encapsulates the structural information of a triple and smoothly incorporate this metric into the link prediction learning process. In this study, we recognize the critical importance of the intersection among the $k$-hop neighborhoods of the head, relation, and tail when determining the validity of a triple. To address this, we introduce a novel randomized algorithm designed to efficiently generate intersection features for candidate triples. Our experimental results demonstrate that a straightforward fully-connected network leveraging these intersection features can surpass the performance of established KG embedding models and even outperform graph neural network baselines. Additionally, we highlight the substantial training time efficiency gains achieved by our network trained on intersection features.}
}

Unsupervised Segmentation of Knee Bone Marrow Edema-like Lesions Using Conditional Generative Models

2024

Andrew Seohwan Yu , Mingrui Yang , Richard Lartey , William Holden , Ahmet Hakan Ok , Sameed Khan , Jeehun Kim , Carl Winalski , Naveen Subhas , Vipin Chaudhary , Xiaojuan Li

Bioengineering

This study proposes a novel unsupervised method for the fully automated segmentation of Bone Marrow Edema-like Lesions (BMEL) in knee MRI. By leveraging conditional diffusion models and anomaly detection, the approach eliminates the need for labor-intensive and bias-prone manual annotations. The research sets new benchmarks for BMEL segmentation performance and provides a more reliable, quantitative tool for early diagnosis and prognosis of knee osteoarthritis.

Medical Imaging Artificial Intelligence Computer Vision

DOI

BibTeX Citation

@Article{bioengineering11060526,
AUTHOR = {Yu, Andrew Seohwan and Yang, Mingrui and Lartey, Richard and Holden, William and Ok, Ahmet Hakan and Khan, Sameed and Kim, Jeehun and Winalski, Carl and Subhas, Naveen and Chaudhary, Vipin and Li, Xiaojuan},
TITLE = {Unsupervised Segmentation of Knee Bone Marrow Edema-like Lesions Using Conditional Generative Models},
JOURNAL = {Bioengineering},
VOLUME = {11},
YEAR = {2024},
NUMBER = {6},
ARTICLE-NUMBER = {526},
URL = {https://www.mdpi.com/2306-5354/11/6/526},
DOI = {10.3390/bioengineering11060526}
}

GNNs Also Deserve Editing, and They Need It More Than Once

2024

Shaochen Zhong , Duy Le , Zirui Liu , Zhimeng Jiang , Andrew Ye , Jiamu Zhang , Jiayi Yuan , Kaixiong Zhou , Zhaozhuo Xu , Jing Ma , Shuai Xu , Vipin Chaudhary , Xia Hu

41st International Conference on Machine Learning (ICML), July 21, 2024, Vienna, Austria

Model editing—updating specific factual knowledge—has been extensively studied for LLMs but has received little attention for graph neural networks, which present unique challenges due to their relational structure. This paper extends model editing to GNNs, showing that they require iterative multi-round editing to maintain accuracy after knowledge updates, unlike LLMs where single-pass editing is often sufficient. The work proposes efficient multi-round GNN editing methods and demonstrates that both graph structure and node attributes must be carefully managed across editing rounds to prevent knowledge degradation.

Artificial Intelligence

OpenReview

BibTeX Citation

@inproceedings{
zhong2024gnns,
title={{GNN}s Also Deserve Editing, and They Need It More Than Once},
author={Shaochen Zhong and Duy Le and Zirui Liu and Zhimeng Jiang and Andrew Ye and Jiamu Zhang and Jiayi Yuan and Kaixiong Zhou and Zhaozhuo Xu and Jing Ma and Shuai Xu and Vipin Chaudhary and Xia Hu},
booktitle={Forty-first International Conference on Machine Learning},
year={2024},
url={https://openreview.net/forum?id=rIc9adYbH2}
}
zhong2024gnns,
title={{GNN}s Also Deserve Editing, and They Need It More Than Once},
author={Shaochen Zhong and Duy Le and Zirui Liu and Zhimeng Jiang and Andrew Ye and Jiamu Zhang and Jiayi Yuan and Kaixiong Zhou and Zhaozhuo Xu and Jing Ma and Shuai Xu and Vipin Chaudhary and Xia Hu},
booktitle={Forty-first International Conference on Machine Learning},
year={2024},
url={https://openreview.net/forum?id=rIc9adYbH2}
}

An Automated Approach for Improving the Inference Latency and Energy Efficiency of Pretrained CNNs by Removing Irrelevant Pixels with Focused Convolutions

2024

Caleb Tung , Nicholas Eliopoulos , Purvish Jajal , Gowri Ramshankar , Chen-Yun Yang , Nicholas Synovic , Xuecen Zhang , Vipin Chaudhary , George K. Thiruvathukal , Yung-Hsiang Lu

29th Asia and South Pacific Design Automation Conference (ASP-DAC 2024), January 2024, South Korea

Computer vision CNNs achieve high accuracy but face ever-increasing energy and computation requirements, and making them more energy-efficient typically requires costly retraining. This paper proposes an automated method to improve the inference latency and energy efficiency of pretrained CNNs without retraining, by inserting a threshold layer that identifies irrelevant image regions and replacing subsequent convolutional layers with focused convolutions that ignore those regions entirely. The approach saves inference latency by up to 25% and energy costs by up to 22% on popular pretrained CNNs including ResNet, VGG, and ConvNeXt, with little to no accuracy loss.

Artificial Intelligence Computer Vision

DOI arXiv

BibTeX Citation

@misc{tung2023automatedapproachimprovinginference,
      title={An automated approach for improving the inference latency and energy efficiency of pretrained CNNs by removing irrelevant pixels with focused convolutions}, 
      author={Caleb Tung and Nicholas Eliopoulos and Purvish Jajal and Gowri Ramshankar and Chen-Yun Yang and Nicholas Synovic and Xuecen Zhang and Vipin Chaudhary and George K. Thiruvathukal and Yung-Hsiang Lu},
      year={2023},
      eprint={2310.07782},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2310.07782}, 
}

Creating Intelligent Cyberinfrastructure for Democratizing AI

2024

Dhabaleswar K. Panda , Vipin Chaudhary , Eric Fosler-Lussier , Raghu Machiraju , Amit Majumdar , Beth Plale , Ramiro Ramnath , P. Sadayappan , Nidhee Savardekar , Karen Tomko

AI Magazine

This paper provides an overview of the NSF-funded ICICLE AI Institute, which aims to fundamentally advance 'edge-to-center' AI-as-a-Service. By developing intelligent cyberinfrastructure (CI) that spans the edge-cloud-HPC computing continuum, the project seeks to enable plug-and-play AI that is accessible to a wider population. The work highlights high-impact applications in animal ecology, digital agriculture, and smart foodsheds as primary drivers for democratizing next-generation AI.

Artificial Intelligence HPC Trustworthy AI

DOI

BibTeX Citation

@article{https://doi.org/10.1002/aaai.12166,
  author = {Panda, Dhabaleswar K. and Chaudhary, Vipin and Fosler-Lussier, Eric and Machiraju, Raghu and Majumdar, Amit and Plale, Beth and Ramnath, Rajiv and Sadayappan, Ponnuswamy and Savardekar, Neelima and Tomko, Karen},
  title = {Creating intelligent cyberinfrastructure for democratizing AI},
  journal = {AI Magazine},
  volume = {45},
  number = {1},
  pages = {22-28},
  doi = {10.1002/aaai.12166},
  year = {2024}
}

Materials Data Science Using CRADLE: A Distributed, Data-Centric Approach

2024

Thomas G. Ciardi , Arafath Nihar , Rounak Chawla , Olatunde Akanbi , Pawan K. Tripathi , Yinghui Wu , Vipin Chaudhary , Roger H. French

MRS Communications

This paper introduces CRADLE, a distributed framework designed to support data-centric AI and materials data science at scale. By integrating heterogeneous data management with elastic scaling, CRADLE addresses the challenges of massive datasets generated by modern experiments and simulations. The study demonstrates the framework's capabilities through five applications, including phase identification in X-ray diffraction and defect segmentation in computed tomography, emphasizing scalable and reproducible scientific insights.

HPC Data Science Materials Science

DOI

BibTeX Citation

@article{ciardi2024materials,
  title={Materials data science using CRADLE: A distributed, data-centric approach},
  author={Ciardi, Thomas G. and Nihar, Arafath and Chawla, Rounak and Akanbi, Olatunde and Tripathi, Pawan K. and Wu, Yinghui and Chaudhary, Vipin and French, Roger H.},
  journal={MRS Communications},
  volume={14},
  pages={601--611},
  year={2024},
  publisher={Springer},
  doi={10.1557/s43579-024-00616-6}
}

Efficient Circuit Wire Cutting Based on Commuting Groups

2024

Xinpeng Li , Vinooth Rao Kulkarni , Daniel T Chen , Qiang Guan , Weiwen Jiang , Ning Xie , Shuai Xu , Vipin Chaudhary

IEEE International Conference on Quantum Computing and Engineering (QCE24), September 2024, Montreal, Canada

Current quantum devices face challenges with large circuits due to increasing error rates as circuit size and qubit count grow. Inspired by ancilla-assisted quantum process tomography and MUBs-based grouping for simultaneous measurement, this paper proposes a new circuit wire cutting approach that uses ancillary qubits to transform quantum input initializations into quantum output measurements, allowing multiple measurements to be grouped and executed simultaneously. The technique significantly reduces subcircuit execution overhead and classical reconstruction complexity compared to standard wire cutting.

Quantum Computing HPC

DOI arXiv

BibTeX Citation

@misc{li2024efficientcircuitwirecutting,
      title={Efficient Circuit Wire Cutting Based on Commuting Groups}, 
      author={Xinpeng Li and Vinooth Kulkarni and Daniel T. Chen and Qiang Guan and Weiwen Jiang and Ning Xie and Shuai Xu and Vipin Chaudhary},
      year={2024},
      eprint={2410.20313},
      archivePrefix={arXiv},
      primaryClass={quant-ph},
      url={https://arxiv.org/abs/2410.20313}, 
}

Winner-Take-All Column Row Sampling for Memory Efficient Adaptation of Language Model

2023

Zirui Liu , Guanchu Wang , Shaochen Zhong , Zhaozhuo Xu , Daochen Zha , Ruixiang Tang , Zhimeng Jiang , Kaixiong Zhou , Vipin Chaudhary , Shuai Xu , Xia Hu

37th Conference on Neural Information Processing Systems (NeurIPS 2023), December 2023

Fine-tuning large pre-trained language models has become increasingly difficult due to extensive memory usage, with the primary bottleneck being the storage of activation feature maps needed for gradient computation. This paper proposes WTA-CRS (Winner-Take-All Column-Row Sampling), a new family of unbiased estimators for matrix products with reduced variance that only requires storing sub-sampled activations for gradient calculation, applied during the backward pass to maintain unbiased gradient estimation. Applied to LLM fine-tuning, WTA-CRS significantly reduces activation memory requirements while maintaining training convergence, enabling adaptation of large models on hardware that would otherwise lack sufficient memory.

Artificial Intelligence

arXiv

BibTeX Citation

@misc{liu2024winnertakeallcolumnrowsampling,
      title={Winner-Take-All Column Row Sampling for Memory Efficient Adaptation of Language Model}, 
      author={Zirui Liu and Guanchu Wang and Shaochen Zhong and Zhaozhuo Xu and Daochen Zha and Ruixiang Tang and Zhimeng Jiang and Kaixiong Zhou and Vipin Chaudhary and Shuai Xu and Xia Hu},
      year={2024},
      eprint={2305.15265},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2305.15265}, 
}

Accelerating VQE Algorithms via Parameters and Measurement Reuse

2023

Xinpeng Li , Ji Liu , Ethan Hansen , Shuai Xu , Paul Hovland , Vipin Chaudhary

8th International Conference on Rebooting Computing (ICRC), December 2023, San Diego, CA

Variational Quantum Eigensolver algorithms require many quantum circuit executions to converge, creating significant overhead on current quantum hardware. This paper accelerates VQE by reusing parameters and measurement results across iterations, reducing the number of quantum circuit executions required for convergence without sacrificing solution quality. The approach is validated on standard molecular simulation benchmarks, demonstrating meaningful reduction in quantum resource requirements.

Quantum Computing HPC

DOI

BibTeX Citation

@INPROCEEDINGS{10386370,
  author={Li, Xinpeng and Liu, Ji and Hansen, Ethan H. and Xu, Shuai and Hovland, Paul and Chaudhary, Vipin},
  booktitle={2023 IEEE International Conference on Rebooting Computing (ICRC)}, 
  title={Accelerating VQE Algorithm via Parameters and Measurement Reuse}, 
  year={2023},
  volume={},
  number={},
  pages={1-5},
  keywords={Quantum algorithm;Scalability;Quantum chemistry;Task analysis;Optimization;Variational Quantum Eigensolvers;Combinatorial Optimization;Variational Quantum Algorithms},
  doi={10.1109/ICRC60800.2023.10386370}}

Online Detection of Golden Circuit Cutting Points

2023

Daniel Chen , Ethan Hansen , Xinpeng Li , Aaron Orenstein , Vinooth Kulkarni , Vipin Chaudhary , Qiang Guan , Ji Liu , Yang Zhang , Shuai Xu

IEEE International Conference on Quantum Computing and Engineering (QCE23), September 2023, Seattle, Washington, USA

Quantum circuit cutting enables large circuits to run on small quantum devices, but reconstructing measurement statistics requires computational resources that grow exponentially with the number of cuts. This paper introduces the concept of a golden cutting point—circuit structures that induce negligible basis components during reconstruction, allowing those downstream computations to be avoided entirely. A hypothesis-testing scheme is proposed for online detection of golden cutting points, with robustness results for low-probability test failures, and demonstrated applicability on Qiskit's Aer simulator achieving reduced wall time from identifying and avoiding obsolete measurements.

Quantum Computing HPC

DOI arXiv

BibTeX Citation

@misc{chen2023onlinedetectiongoldencircuit,
      title={Online Detection of Golden Circuit Cutting Points}, 
      author={Daniel T. Chen and Ethan H. Hansen and Xinpeng Li and Aaron Orenstein and Vinooth Kulkarni and Vipin Chaudhary and Qiang Guan and Ji Liu and Yang Zhang and Shuai Xu},
      year={2023},
      eprint={2308.10153},
      archivePrefix={arXiv},
      primaryClass={quant-ph},
      url={https://arxiv.org/abs/2308.10153}, 
}

One Less Reason for Filter Pruning: Gaining Free Adversarial Robustness with Structured Grouped Kernel Pruning

2023

Shaochen Zhong , Zaicheng You , Jiamu Zhang , Sebastian Zhao , Zachary LeClaire , Zirui Liu , Vipin Chaudhary , Shuai Xu , Xia Hu

37th Conference on Neural Information Processing Systems (NeurIPS 2023), December 2023

Structured pruning has traditionally been viewed as trading accuracy for efficiency, often assumed to come at the expense of adversarial robustness. This paper reveals that structured grouped kernel pruning inherently confers adversarial robustness as a byproduct—without any adversarial training—showing that pruning and robustness are not competing objectives but complementary ones. By demonstrating one less reason to avoid filter pruning, the work shows practitioners can gain free adversarial robustness simply by adopting structured grouped kernel pruning as their compression strategy.

Artificial Intelligence Trustworthy AI

OpenReview

BibTeX Citation

@inproceedings{
zhong2023one,
title={One Less Reason for Filter Pruning: Gaining Free Adversarial Robustness with Structured Grouped Kernel Pruning},
author={Shaochen Zhong and Zaichuan You and Jiamu Zhang and Sebastian Zhao and Zachary LeClaire and Zirui Liu and Daochen Zha and Vipin Chaudhary and Shuai Xu and Xia Hu},
booktitle={Thirty-seventh Conference on Neural Information Processing Systems},
year={2023},
url={https://openreview.net/forum?id=Pjky9XG8zP}
}

Accelerating Time to Science using CRADLE: A Framework for Materials Data Science

2023

Arafath Nihar , Thomas Ciardi , Rounak Chawla , Olatunde Akanbi , Yinghui Wu , Vipin Chaudhary , Roger French

30th IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC), December 18-21, 2023, Goa, India

Accelerating materials data science requires scalable frameworks that can manage heterogeneous data and computation across distributed systems. This paper presents CRADLE, a distributed data-centric framework for materials data science workflows that integrates data management, computation, and analysis pipelines to significantly reduce time-to-science. Demonstrated on the 30th IEEE HiPC system, CRADLE shows substantial throughput improvements and workflow simplification for materials characterization and discovery tasks in HPC environments.

HPC Materials Science

DOI

BibTeX Citation

@INPROCEEDINGS{10487079,
  author={Nihar, Arafath and Ciardi, Thomas G. and Chawla, Rounak and Akanbi, Olatunde and Chaudhary, Vipin and Wu, Yinghui and French, Roger H.},
  booktitle={2023 IEEE 30th International Conference on High Performance Computing, Data, and Analytics (HiPC)}, 
  title={Accelerating Time to Science using CRADLE: A Framework for Materials Data Science}, 
  year={2023},
  volume={},
  number={},
  pages={234-245},
  keywords={Materials science and technology;Analytical models;Computational modeling;High performance computing;Distributed databases;Data science;Data models;scalable;infrastructure;distributed computing;Hadoop;AI4Science;materials data science},
  doi={10.1109/HiPC58850.2023.00041}}

Quantum Noise in the Flow of Time: A Temporal Study of the Noise in Quantum Computers

2022

Betis Baheri , Qiang Guan , Vipin Chaudhary , Ang Li

IEEE International Symposium on On-Line Testing and Robust System Design (IOLTS), September 2022, Torino, Italy

Quantum noise in quantum computers is not static but evolves over time, yet most error characterization treats noise as temporally fixed. This paper conducts a temporal study of noise characteristics in quantum computers, revealing how quantum noise patterns change over time and analyzing the implications for circuit fidelity and error mitigation strategies. The findings provide insights for developing more effective time-aware calibration and error mitigation approaches for near-term quantum hardware.

Quantum Computing HPC

DOI

BibTeX Citation

@inproceedings{Baheri:2023riq,
    author = "Baheri, Betis and Chaudhary, Vipin and Li, Ang and Xu, Shuai and Fang, Bo and Guan, Qiang",
    title = "{Quantum Noise Mitigation: Introducing the Robust Quantum Circuit Scheduler for Enhanced Fidelity and Throughput}",
    booktitle = "{32nd International Symposium on High-Performance Parallel and Distributed Computing}",
    doi = "10.1145/3588983.3596688",
    year = "2023"
}

Pinpointing the System Reliability Degradation in NISQ Machines

2022

Qiang Guan , Betis Baheri , Zixian Xu , Ying Mao , Vipin Chaudhary , Shuai Xu , Bo Fang

IEEE International Conference on Quantum Computing and Engineering (QCE22), September 2022, Colorado, USA

Noise in quantum hardware causes significant reliability degradation in NISQ machines, but the systematic patterns of this degradation are not well understood. This paper investigates the sources and temporal patterns of reliability degradation in NISQ machines, identifying when and where noise causes significant performance drops in quantum circuits. The analysis provides guidance for developing error mitigation strategies targeted at the most impactful reliability degradation patterns in near-term quantum hardware.

Quantum Computing HPC

DOI

BibTeX Citation

@INPROCEEDINGS{9951284,
  author={Baheri, Betis and Xu, Zixuan and Chaudhary, Vipin and Mao, Ying and Fang, Bo and Xu, Shuai and Guan, Qiang},
  booktitle={2022 IEEE International Conference on Quantum Computing and Engineering (QCE)}, 
  title={Pinpointing the System Reliability Degradation in NISQ Machines}, 
  year={2022},
  volume={},
  number={},
  pages={646-652},
  keywords={Degradation;Measurement;Computers;Qubit;Finance;Machine learning;Reliability engineering;Quantum Computing;System;Reliability;Analysis},
  doi={10.1109/QCE53715.2022.00087}}

MARS: Malleable Actor-Critic Reinforcement Learning Scheduler

2022

Betis Baheri , Qiang Guan , Jacob Tronge , Bo Fang , Ang Li , Vipin Chaudhary

International Performance Computing and Communications Conference (IPCCC), November 11-13, 2022, Austin, TX, USA

Scheduling jobs in HPC environments requires handling dynamic, time-varying workloads that challenge static scheduling policies. MARS (Malleable Actor-Critic Reinforcement Learning Scheduler) uses malleable actor-critic RL to adaptively schedule computing jobs, dynamically resizing allocations in response to changing workloads to optimize throughput and resource utilization. Evaluated against standard scheduling baselines, MARS demonstrates consistent improvements in job completion time and cluster utilization across varied workload scenarios.

Artificial Intelligence HPC

DOI arXiv

BibTeX Citation

@inproceedings{Baheri_2022,
   title={MARS: Malleable Actor-Critic Reinforcement Learning Scheduler},
   url={http://dx.doi.org/10.1109/IPCCC55026.2022.9894315},
   DOI={10.1109/ipccc55026.2022.9894315},
   booktitle={2022 IEEE International Performance, Computing, and Communications Conference (IPCCC)},
   publisher={IEEE},
   author={Baheri, Betis and Tronge, Jacob and Fang, Bo and Li, Ang and Chaudhary, Vipin and Guan, Qiang},
   year={2022},
   month=nov, pages={217–226} }

Irrelevant Pixels are Everywhere: Find and Exclude Them for More Efficient Computer Vision

2022

Caleb Tung , Abhinav Goel , Xiao Hu , E. S. Amobi , George K. Thiruvathukal , Vipin Chaudhary , Yung-Hsiang Lu

IEEE International Conference on Artificial Intelligence Circuits and Systems (AICAS 2022), June 13-15, 2022, Incheon, Korea

CNNs are compute-intensive because they indiscriminately compute features on all pixels of an input image, yet many pixels are irrelevant to the vision task at hand. This paper demonstrates through analysis of three popular computer vision datasets that approximately 48% of pixels are irrelevant, and proposes the focused convolution—a drop-in CNN replacement that operates only on relevant pixels identified by an area of interest mask. On an embedded device, the approach achieves no loss in accuracy while reducing inference latency, energy consumption, and multiply-add count by approximately 45%.

Artificial Intelligence Computer Vision

arXiv

BibTeX Citation

@misc{tung2022irrelevantpixelseverywhereexclude,
      title={Irrelevant Pixels are Everywhere: Find and Exclude Them for More Efficient Computer Vision}, 
      author={Caleb Tung and Abhinav Goel and Xiao Hu and Nicholas Eliopoulos and Emmanuel Amobi and George K. Thiruvathukal and Vipin Chaudhary and Yung-Hsiang Lu},
      year={2022},
      eprint={2207.10741},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2207.10741}, 
}

A Unified Framework to Assess Market Implications of Institutional Investments

2022

Taruna Seth , Vipin Chaudhary

IEEE International Conference on Big Data, December 17-20, 2022, Osaka, Japan

Understanding the market implications of large institutional investment decisions requires modeling complex interactions between institutional behavior and market dynamics. This paper presents a unified framework using machine learning and statistical analysis to assess how large-scale institutional investment decisions affect market prices, volatility, and liquidity. The approach integrates multiple data sources to provide a comprehensive assessment of market implications across different investment types and market conditions.

Artificial Intelligence

View

BibTeX Citation

@inproceedings{6d6855415189449891528a6a2ea5edaa,
title = "A Unified Framework to Assess Market Implications of Institutional Investments",
keywords = "AI/ML, Market dynamics, Time-series",
author = "Taruna Seth and Cristian Tiu and Vipin Chaudhary",
note = "Publisher Copyright: {\textcopyright} 2022 IEEE.; 2022 IEEE International Conference on Big Data, Big Data 2022 ; Conference date: 17-12-2022 Through 20-12-2022",
year = "2022",
doi = "10.1109/BigData55660.2022.10020555",
language = "English",
series = "Proceedings - 2022 IEEE International Conference on Big Data, Big Data 2022",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
pages = "1914--1921",
editor = "Shusaku Tsumoto and Yukio Ohsawa and Lei Chen and \{Van den Poel\}, Dirk and Xiaohua Hu and Yoichi Motomura and Takuya Takagi and Lingfei Wu and Ying Xie and Akihiro Abe and Vijay Raghavan",
booktitle = "Proceedings - 2022 IEEE International Conference on Big Data, Big Data 2022",
address = "United States",
}

Practical Implications of Dequantization on Machine Learning Algorithms

2022

Vinooth Kulkarni , Daniel Chen , Shuai Xu , Qiang Guan , Vipin Chaudhary

7th International Conference on Connected Systems and Intelligence (ISI'22), September 2022, Trivandrum, India

Quantum computing algorithms offer theoretical speedups for certain machine learning tasks, but dequantization results show that classical algorithms can sometimes achieve comparable performance. This paper examines the practical implications of dequantization on machine learning algorithms, providing a systematic analysis of when quantum approaches offer genuine advantages versus when classical alternatives are sufficient. The work offers guidance for practitioners on determining which ML tasks are promising candidates for quantum speedup versus those where dequantization renders quantum approaches redundant.

Quantum Computing Artificial Intelligence

DOI

BibTeX Citation

@InProceedings{10.1007/978-981-19-8094-7_3,
author="Kulkarni, Vinooth Rao
and Chen, Daniel
and Xu, Shuai
and Guan, Qiang
and Chaudhary, Vipin",
editor="Thampi, Sabu M.
and Mukhopadhyay, Jayanta
and Paprzycki, Marcin
and Li, Kuan-Ching",
title="Practical Implications of Dequantization on Machine Learning Algorithms: A Survey",
booktitle="International Symposium on Intelligent Informatics",
year="2023",
publisher="Springer Nature Singapore",
address="Singapore",
pages="29--39",
abstract="Despite the promise for performance and accuracy improvements of quantum inspired (QI) algorithms over classical machine learning (ML) algorithms, such gains have not been realized in practice. The quantum inspired algorithms can theoretically achieve significant speed up based on sampling assumptions and have thus far failed to outperform the existing classical ML models in practical applications. The speedup of quantum machine learning (QML) algorithms assume the access to data in quantum random access memory (QRAM) which is a strong assumption with current quantum architectures. QI algorithms assume sample and query (SQ) access to input vector and norms of matrices using a dynamic data structure. We explore the components of these models and the assumptions in this paper by surveying the recent works in QML and QI Machine learning (QIML) algorithms. We limit our study to QML and QIML models on achieving a speed up over classical ML techniques rather than individual proofs of these algorithms. This study highlights the assumptions being made that are currently not practical for QML and QIML algorithms in achieving performance advantage over classical ML algorithms.",
isbn="978-981-19-8094-7"}

System-Auditing, Data Analysis and Characteristics of Cyber Attacks for Big Data System

2022

Liangyi Huang , Sophia Hall , Fei Shao , Arafath Nihar , Vipin Chaudhary , Yinghui Wu , Roger French , Xusheng Xiao

31st ACM Conference on Information and Knowledge Management (CIKM), October 2022, Atlanta, USA

Using big data, distributed computing systems such as Apache Hadoop requires processing massive amount of data to support business and research applications. Thus, it is critical to ensure the cyber security of such systems. To better defend from advanced cyber attacks that pose threats to even well-protected enterprises, system-auditing based techniques have been adopted for monitoring system activities and assisting attack investigation. In this demo, we are building a system that collects system auditing logs from a big data system and performs data analysis to understand how system auditing can be used more effectively to assist attack investigation on big systems. We also built a demo application that detects unexpected file deletion and presents root causes for the deletion.

Trustworthy AI HPC

DOI

BibTeX Citation

@inproceedings{10.1145/3511808.3557185,
author = {Huang, Liangyi and Hall, Sophia and Shao, Fei and Nihar, Arafath and Chaudhary, Vipin and Wu, Yinghui and French, Roger and Xiao, Xusheng},
title = {System-Auditing, Data Analysis and Characteristics of Cyber Attacks for Big Data Systems},
year = {2022},
isbn = {9781450392365},
publisher = {Association for Computing Machinery},
address = {New York, NY, USA},
url = {https://doi.org/10.1145/3511808.3557185},
doi = {10.1145/3511808.3557185},
booktitle = {Proceedings of the 31st ACM International Conference on Information \& Knowledge Management},
pages = {4872–4876},
numpages = {5},
keywords = {system auditing, cyber attack investigation, big data systems},
location = {Atlanta, GA, USA},
series = {CIKM '22}}

Approximate Quantum Circuit Reconstruction

2022

Daniel Chen , Betis Baheri , Vipin Chaudhary , Qiang Guan , Ning Xie , Shuai Xu

IEEE International Conference on Quantum Computing and Engineering (QCE22), September 2022, Colorado, USA

Current and imminent quantum hardware lacks reliability due to noise and limited qubit counts, and quantum circuit cutting—which divides large circuits into smaller subcircuits—faces exponential classical post-processing overhead. This paper introduces approximate circuit reconstruction using a sampling-based method (MCMC) to probabilistically select high-probability bit strings during reconstruction, avoiding excessive calculations for the full probability distribution. Results show that this sampling-based post-processing holds great potential for fast and reliable circuit reconstruction in the NISQ era and beyond.

Quantum Computing HPC

DOI arXiv

BibTeX Citation

@misc{chen2022approximatequantumcircuitcutting,
      title={Approximate Quantum Circuit Cutting}, 
      author={Daniel Chen and Betis Baheri and Vipin Chaudhary and Qiang Guan and Ning Xie and Shuai Xu},
      year={2022},
      eprint={2212.01270},
      archivePrefix={arXiv},
      primaryClass={quant-ph},
      url={https://arxiv.org/abs/2212.01270}, 
}

TQEA: Temporal Quantum Error Analysis

2021

Betis Baheri , Daniel Chen , Bo Fang , S. Stein , Vipin Chaudhary , Ying Mao , Shuai Xu , Ang Li , Qiang Guan

51st Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN), June 2021

Quantum errors in NISQ hardware vary temporally, but most error analysis tools treat noise as time-invariant. TQEA (Temporal Quantum Error Analysis) characterizes how quantum errors evolve over time by systematically measuring and modeling the temporal dynamics of noise in quantum computers. The framework provides insights for improving error mitigation strategies that account for drift and time-varying noise characteristics, supporting progress toward more reliable quantum computing.

Quantum Computing HPC

DOI

BibTeX Citation

@INPROCEEDINGS{9525517,
  author={Baheri, Betis and Chen, Daniel and Fang, Bo and Stein, Samuel A. and Chaudhary, Vipin and Mao, Ying and Xu, Shuai and Li, Ang and Guan, Qiang},
  booktitle={2021 51st Annual IEEE/IFIP International Conference on Dependable Systems and Networks - Supplemental Volume (DSN-S)}, 
  title={TQEA: Temporal Quantum Error Analysis}, 
  year={2021},
  volume={},
  number={},
  pages={65-67},
  keywords={Computers;Quantum computing;Error analysis;Finance;Machine learning;Robustness;Calibration;Quantum;Quantum Error;IBM-Q;Supercon-ducting quantum;Temporal},
  doi={10.1109/DSN-S52858.2021.00034}}

A Predictive Analytics Framework for Multi-Horizon Financial Crises Forecasting using Macro-Economic Data

2021

Taruna Seth , Vipin Chaudhary

IEEE International Conference on Big Data, December 15-18, 2021, Orlando, Florida, USA

Predictive analytics framework for multi-horizon financial crisis forecasting using macroeconomic data and ML to provide early warning signals.

Artificial Intelligence

DOI

BibTeX Citation

@INPROCEEDINGS{9671391,
author={Seth, Taruna and Chaudhary, Vipin},
booktitle={2021 IEEE International Conference on Big Data (Big Data)}, 
title={A Predictive Framework for Multi-Horizon Financial Crises Forecasting using Macro-Economic Data}, 
year={2021},
volume={},
number={},
pages={1109-1118},
keywords={Economics;Uncertainty;Biological system modeling;Vents;Predictive models;Big Data;Data models;Multi-Horizon predictive modeling;Deep networks;Time-series;Macroeconomic events forecasting},
doi={10.1109/BigData52589.2021.9671391}}

With or Without Knee Total Knee Arthroplasty? Deep Learning-powered Strategy to detect TKA in plain radiographs

2020

S. Yan , A. P. Tafti , E. Sagheb , S. Fu , S. Sohn , C. Ngufor , Vipin Chaudhary , H. Liu , W. Kremers , D. Lewallen , M. Taunton , H. M. Kremers

9th Annual International Congress on Arthroplasty Registries, May 2020

Deep learning approach for automatically detecting TKA implants in plain radiographs, enabling efficient large-scale retrospective analysis of orthopedic registries.

Medical Imaging Artificial Intelligence

Towards Performant Workflows, Monitoring and Measuring

2020

J. Spherhac , R. L. DeLeon , J. P. White , M. Jones , A. E. Bruno , R. Jones-Ivey , T. R. Furlani , J. E. Bard , Vipin Chaudhary

International Conference on Computer Communication and Networks (ICCCN 2020), August 3-6, 2020, Honolulu, Hawaii, USA

Scientific HPC workflows require robust monitoring and measurement infrastructure to understand performance characteristics and enable optimization. This paper presents approaches for building performant scientific workflows with integrated monitoring and measurement, enabling better characterization and optimization of HPC workflow performance across distributed computing environments. The work provides practical methodologies for workflow developers to identify bottlenecks and improve end-to-end throughput.

HPC

DOI

BibTeX Citation

@INPROCEEDINGS{9209647,
  author={Sperhac, Jeanette and DeLeon, Robert L. and White, Joseph P. and Jones, Matthew and Bruno, Andrew E. and Ivey, Renette Jones and Furlani, Thomas R. and Bard, Jonathan E. and Chaudhary, Vipin},
  booktitle={2020 29th International Conference on Computer Communications and Networks (ICCCN)}, 
  title={Towards Performant Workflows, Monitoring and Measuring}, 
  year={2020},
  volume={},
  number={},
  pages={1-9},
  keywords={Monitoring;Task analysis;Electronic mail;Tools;Logic gates;Measurement;Sequential analysis;Computer Performance;Data Processing},
  doi={10.1109/ICCCN49398.2020.9209647}}

A Predictive Analytics Framework for Insider Trading Events

2020

Taruna Seth , Vipin Chaudhary

IEEE International Conference on Big Data, December 10-13, 2020, Atlanta, Georgia, USA

Detecting and forecasting insider trading events using traditional methods is limited by their reliance on predefined rules and inability to capture subtle market signals. This paper presents a predictive analytics framework for insider trading events using machine learning applied to financial transaction data and market signals, demonstrating the ability to identify patterns predictive of insider trading. The approach provides early warning capabilities that can complement traditional regulatory surveillance methods.

Artificial Intelligence

DOI

BibTeX Citation

@INPROCEEDINGS{9377791,
  author={Seth, Taruna and Chaudhary, Vipin},
  booktitle={2020 IEEE International Conference on Big Data (Big Data)}, 
  title={A Predictive Analytics Framework for Insider Trading Events}, 
  year={2020},
  volume={},
  number={},
  pages={218-225},
  keywords={Regulators;Time series analysis;Training data;Predictive models;Big Data;Data models;Task analysis;Illegal Insider Trading;Classification;Deep Networks;Anomaly Detection;Illegal Events Library;Natural Language Processing;Time-series forecasting},
  doi={10.1109/BigData50022.2020.9377791}}

Research Staff

Mohsen Hariri

AI Scientist

Weicong Chen

AI Scientist

PhD Students

Alan Luo

PhD Student

Andrew Yu

PhD Student

Biyao Zhang

PhD Student

Chaoda Song

PhD Student

Cheng Guo

PhD Student

Debargha Ganguly

PhD Student

Jierui Peng

PhD Student

Nahal Shahani

PhD Student

Nengbo Wang

PhD Student

Shouren Wang

PhD Student

Srihari Sankar

PhD Student

Thomas Zhang

PhD Student

Vikash Singh

PhD Student

Vinooth Kulkarni

PhD Student

Wang (Van) Yang

PhD Student

Xinpeng Li

PhD Student

Yanyan Zhang

PhD Student

Yuting Shao

PhD Student

Zahra Rahmani

PhD Student

Master's Students

Harry Hillsdownley

MS Student

Suraj Kumar

MS Student

Undergraduate Students

Sheila Monera Cabarique

Lead Web Developer

Gabriel Natenshon

Researcher