Vikash Singh

PhD Student

Vikash Singh is a PhD student in Computer Science at Case Western Reserve University and a member of the PEAT AI Lab, where his research focuses on large language models, Trustworthy AI, and model optimization.

Education

Ph.D. in Computer Science, Case Western Reserve University, Cleveland, OH, USA (May 2024 – Present)
M.S. in Computer Science, Case Western Reserve University, Cleveland, OH, USA (Aug 2023 – May 2024)
B.Tech. in Civil Engineering, Minor in Computer Science, Indian Institute of Technology Mandi, Himachal Pradesh, India (Aug 2019 – May 2023)

Vikash Singh is a Computer Science PhD student at Case Western Reserve University, advised by Professor Vipin Chaudhary in the PEAT AI Lab. His research sits at the intersection of large language models and trustworthy AI, with a particular emphasis on Formal Verification, uncertainty quantification, explainable AI, formal reasoning, and efficient inference.

Before joining the PhD program, Vikash completed a Master of Science in Computer Science at Case Western Reserve University. He earned his Bachelor of Technology in Civil Engineering with a Minor in Computer Science from the Indian Institute of Technology Mandi, where his interest in machine learning and scientific computing first took shape.

Through his work, Vikash aims to make modern AI systems more transparent, reliable, and computationally efficient, so that they can be deployed responsibly in high-stakes scientific and engineering settings.

Vikash’s research centers on making large language models more interpretable, reliable, and efficient through work on uncertainty quantification, explainability, formal reasoning, and model pruning.

Trust the Typical

2026

Debargha Ganguly , Srihari Sankar , Biyao Zhang , Vikash Singh , Kanan Gupta , Harshini Kavuru , Alan Luo , Weicong Chen , Warren Morningstar , Raghu Machiraju , Vipin Chaudhary

14th International Conference on Learning Representations (ICLR), April 23-27, 2026, Rio De Janeiro, Brazil

Current approaches to LLM safety rely on a brittle pattern of identifying and blocking known threats via guardrails. This paper introduces Trust The Typical (T3), a framework that reframes safety as an out-of-distribution detection problem, learning the distribution of acceptable prompts in a semantic space and flagging significant deviations as potential threats. Unlike prior methods, T3 requires no training on harmful examples yet achieves state-of-the-art performance across 18 benchmarks spanning toxicity, jailbreaking, multilingual harms, and over-refusal—reducing false positive rates by up to 40× relative to specialized safety models. A single model trained on safe English text transfers effectively to over 14 languages without retraining.

Trustworthy AI Artificial Intelligence

arXiv

BibTeX Citation

@inproceedings{
ganguly2026trust,
title={Trust The Typical},
author={Debargha Ganguly and Sreehari Sankar and Biyao Zhang and Vikash Singh and Kanan Gupta and Harshini Kavuru and Alan Luo and Weicong Chen and Warren Richard Morningstar and Raghu Machiraju and Vipin Chaudhary},
booktitle={The Fourteenth International Conference on Learning Representations},
year={2026},
url={https://openreview.net/forum?id=vfbeleLBWv}
}

K4: Online Log Anomaly Detection via Unsupervised Typicality Learning

2025

Weicong Chen , Vikash Singh , Zahra Rahmani , Debargha Ganguly , Mohsen Hariri , Vipin Chaudhary

IEEE/ACM International Conference on High Performance Computing (SC25), December 17-20, 2025, Hyderabad, India

Existing log anomaly detection methods are often slow, dependent on error-prone parsing, and use unrealistic evaluation protocols. This paper introduces K4 (Knowing the Unknown by Knowing only the Known), a fully unsupervised, parser-independent framework that transforms arbitrary log embeddings into compact four-dimensional descriptors—Precision, Recall, Density, Coverage—using efficient k-nearest neighbor statistics. Under a realistic online chunk-based evaluation protocol, K4 achieves state-of-the-art AUROC of 0.995–0.999 across HDFS, BGL, and Thunderbird datasets, with training under 4 seconds and inference as low as 4 μs.

Trustworthy AI HPC Artificial Intelligence

DOI arXiv

BibTeX Citation

@misc{chen2025k4onlineloganomaly,
      title={$K^4$: Online Log Anomaly Detection Via Unsupervised Typicality Learning}, 
      author={Weicong Chen and Vikash Singh and Zahra Rahmani and Debargha Ganguly and Mohsen Hariri and Vipin Chaudhary},
      year={2025},
      eprint={2507.20051},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2507.20051}, 
}

Grammars of Formal Uncertainty: When to Trust LLMs in Automated Reasoning Tasks

2025

Debargha Ganguly , Vikash Singh , Sreehari Sankar , Biyao Zhang , Xuecen Zhang , Srinivasan Iyengar , Xiaotian Han , Amit Sharma , Shivkumar Kalyanaraman , Vipin Chaudhary

39th Conference on Neural Information Processing Systems (NeurIPS 2025), December 2025

Large language models show remarkable promise for automated reasoning by generating formal specifications, but a fundamental tension exists between their probabilistic nature and the deterministic guarantees required by formal verification. This paper comprehensively investigates failure modes and uncertainty quantification in LLM-generated formal artifacts, revealing that SMT-based autoformalization has highly domain-specific accuracy impacts ranging from +34.8% on logical tasks to −44.5% on factual ones. A probabilistic context-free grammar (PCFG) framework is introduced to model LLM outputs and yield a refined uncertainty taxonomy, finding that uncertainty signals are task-dependent—for example, grammar entropy for logic achieves AUROC > 0.93.

Artificial Intelligence Trustworthy AI

arXiv

BibTeX Citation

@misc{ganguly2025grammarsformaluncertaintytrust,
      title={Grammars of Formal Uncertainty: When to Trust LLMs in Automated Reasoning Tasks}, 
      author={Debargha Ganguly and Vikash Singh and Sreehari Sankar and Biyao Zhang and Xuecen Zhang and Srinivasan Iyengar and Xiaotian Han and Amit Sharma and Shivkumar Kalyanaraman and Vipin Chaudhary},
      year={2025},
      eprint={2505.20047},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2505.20047}, 
}

Efficient Fine-Grained GPU Performance Modeling for Distributed Deep Learning of LLM

2025

Biyao Zhang , Mingkai Zheng , Debargha Ganguly , Xuecen Zhang , Vikash Singh , Vipin Chaudhary , Zhao Zhang

IEEE/ACM International Conference on High Performance Computing (SC25), December 17-20, 2025, Hyderabad, India

Training large language models is one of the most compute-intensive tasks in HPC, and predicting end-to-end training time for multi-billion parameter models across hundreds of GPUs is challenging due to complex interactions between transformer components, parallelism strategies, and multi-tier communication. This paper addresses this by decomposing LLMs into core computational primitives and modeling them with operator-level decomposition, lightweight hardware-aware prediction models for key operations, and an end-to-end prediction system integrating these across complex parallelization strategies. The resulting framework enables accurate distributed LLM training performance prediction without costly full-scale sampling.

HPC Artificial Intelligence

DOI arXiv

BibTeX Citation

@misc{zhang2025efficientfinegrainedgpuperformance,
      title={Efficient Fine-Grained GPU Performance Modeling for Distributed Deep Learning of LLM}, 
      author={Biyao Zhang and Mingkai Zheng and Debargha Ganguly and Xuecen Zhang and Vikash Singh and Vipin Chaudhary and Zhao Zhang},
      year={2025},
      eprint={2509.22832},
      archivePrefix={arXiv},
      primaryClass={cs.DC},
      url={https://arxiv.org/abs/2509.22832}, 
}

Mentors

Vipin Chaudhary, PhD

Kevin J. Kranzusch Chair, Computer and Data Sciences,
Center for PEATAI, Case School of Engineering

Collaborators

Debargha Ganguly

PhD Student

Weicong Chen

AI Scientist

Biyao Zhang

PhD Student

Shouren Wang

PhD Student

Mohsen Hariri

AI Scientist