Conference Talks
ACM Student Reasearch Competition: Undergraduate Poster
Statistical Prediction of Lossy Compression Ratios for 3D Scientific Data
November 2022  SlidesPoster
Abstract: In the fields of science and engineering, lossy compression plays a growing role in running scientific simulations, as output data is on the scale of terabytes. Using error bounded lossy compression reduces the amount of storage for each simulation; however, there is no known bound for the upper limit of lossy compressibility. Data correlation structures, compressors and error bounds are factors allowing larger compression ratios and improved quality metrics. This provides one direction towards quantifying lossy compressibility. Our previous work explored 2D statistical methods to characterize the data correlation structures and their relationships, through functional models, to compression ratios and quality metrics for 2D scientific data. In this poster, we explore the expansion of our statistical methods to 3D scientific data. The method was comparable to 2D. Our work is the next step towards evaluating the theoretical limits of lossy compressibility used to predict compression performance and optimally adapt compressors.
7th International Workshop on Data Analysis and Reduction for Big Scientific Data
Exploring Lossy Compressibility through Statistical Correlations of Scientific Datasets
November 2021  SlidesPaper
Abstract: Lossy compression plays a growing role in scientific simulations where the cost of storing their output data can span terabytes. Using error bounded lossy compression reduces the amount of storage for each simulation; however, there is no known bound for the upper limit on lossy compressibility. Correlation structures in the data, choice of compressor and error bound are factors allowing larger compression ratios and improved quality metrics. Analyzing these three factors provides one direction towards quantifying lossy compressibility. As a first step, we explore statistical methods to characterize the correlation structures present in the data and their relationships, through functional regression models, to compression ratios. We observed a relationship between compression ratios and several statistics summarizing the correlation structure of the data, which is a first step towards evaluating the theoretical limits of lossy compressibility used to eventually predict compression performance and adapt compressors to correlation structures present in the data.

Invited Talks
Constellation Workshop
Processing in Memory Execution Targets for Higher Level Languages
July 2023  Slides
Abstract: The Von-Neumann architecture has been in common-place for the last few decades. Much of the execution time on these systems is spent on memory transactions. This bottleneck can be lessened by utilizing processors in memory (PIM). Utilizing a data-centric computing model, we are expected to have speedups and reduction in energy consumption for workloads that can utilize fine grain threading. This work explores decision-support query (e.g., TPC-H) implementation on UPMEM. Future work explores targeting UPMEM using the Village toolchain.
Clemson CECAS Invited Talks
What's the deal with this 'Grad School' thing? (A guide to navigating options for graduate school as an undergrad)
December 2022  Slides
Abstract: Are you interested in graduate school as an option after graduation? Would you like to learn more about the application process, choosing your advisor, and what graduate school could mean for you? Attending graduate school in the field of Computer Science or ECE can help you stand out from others in the job market, specialize for a particular sub-field, or conduct research at the bleeding edge of your area-of-interest! If you would like to learn more and hear about other’s experiences in grad school, please attend this talk and Q&A session.
Summer Argonne Students Symposium (SASSy)
Statistical Prediction of Lossy Compression Ratios for 3D Scientific Data
July 2022  Slides