VoskanyanMichael_MCS_2026

Title VoskanyanMichael_MCS_2026
Alternative Title Vulnerability-Preserving Program Reduction for Common Vulnerabilities and Exposured using LLM-Based Reduction
Creator Voskanyan, Michael
Contributors Christi, Arpit (advisor); Al-Gahmi, Abdulmalek (advisor); Valle, Hugo (advisor)
Collection Name Master of Computer Science
Abstract Program debugging is challenging, time-consuming, and tedious because developers often spend substantial effort simplifying and isolating the failure-inducing portion of a program. To support this process, many program and test reduction techniques have been proposed to automatically minimize failure-inducing inputs while preserving the observed fault. In the security setting, however, reduction must preserve vulnerability-relevant behavior rather than merely produce a smaller artifact. Recent work has explored the use of large language models (LLMs) to assist program reduction, but such techniques have not been widely evaluated; for vulnerability-preserving reduction. This thesis evaluates LVulnReducer (LVR), an HDD-first, LLM-assisted reduction pipeline for vulnerability-inducing programs, on 25 Common Vulnerabilities and Exposures (CVE) artifacts drawn from the San2Patch; benchmark. Reduction quality is evaluated relative to C-Reduce using Clang AST-based statement comparison, while reduction effectiveness is measured using source lines removed and percentage reduction. The results show that LVulnReducer substantially reduces vulnerable programs while preserving benchmark-specific exploit evidence. On average, LVR removes 3,321.32 source lines per artifact, corresponding to an average reduction of 94.62%. In comparison to the C-Reduce reference reductions, LVR achieves an average precision of 99.66% and an average recall of 97.16%, indicating that its reductions remain structurally close; to those produced by C-Reduce. Although C-Reduce still produces smaller final artifacts overall, the results show that LLM-assisted reduction can meaningfully improve on lightweight hierarchical mechanical reduction and approximate the behavior of a mature reduction baseline in the vulnerability-preserving setting.
Subject Computer science; Large language models; Computer programming-Reduction; Natural language processing (Computer science)
Digital Publisher Digitized by Special Collections & University Archives, Stewart Library, Weber State University.
Date 2026-04
Medium theses
Type Text
Access Extent 37 page pdf
Conversion Specifications Adobe Acrobat
Language eng
Rights The author has granted Weber State University Archives a limited, non-exclusive, royalty-free license to reproduce his or her thesis, in whole or in part, in electronic or paper form and to make it available to the general public at no charge. The author retains all other rights. For further information: IN COPYRIGHT - EDUCATIONAL USE PERMITTED
Source University Archives Electronic Records: Master of Computer Science. Stewart Library, Weber State University
OCR Text Show
Format application/pdf
ARK ark:/87278/s6qz825m
Setname wsu_smt
ID 166263
Reference URL https://digital.weber.edu/ark:/87278/s6qz825m