Conference Talks
ACM Student Reasearch Competition: Undergraduate Poster
Statistical Prediction of Lossy Compression Ratios for 3D Scientific Data
Abstract: In the fields of science and engineering, lossy compression plays a growing role in running scientific simulations, as output data is on the scale of terabytes. Using error bounded lossy compression reduces the amount of storage for each simulation; however, there is no known bound for the upper limit of lossy compressibility. Data correlation structures, compressors and error bounds are factors allowing larger compression ratios and improved quality metrics. This provides one direction towards quantifying lossy compressibility. Our previous work explored 2D statistical methods to characterize the data correlation structures and their relationships, through functional models, to compression ratios and quality metrics for 2D scientific data. In this poster, we explore the expansion of our statistical methods to 3D scientific data. The method was comparable to 2D. Our work is the next step towards evaluating the theoretical limits of lossy compressibility used to predict compression performance and optimally adapt compressors.
7th International Workshop on Data Analysis and Reduction for Big Scientific Data
Exploring Lossy Compressibility through Statistical Correlations of Scientific Datasets
Abstract: Lossy compression plays a growing role in scientific simulations where the cost of storing their output data can span terabytes. Using error bounded lossy compression reduces the amount of storage for each simulation; however, there is no known bound for the upper limit on lossy compressibility. Correlation structures in the data, choice of compressor and error bound are factors allowing larger compression ratios and improved quality metrics. Analyzing these three factors provides one direction towards quantifying lossy compressibility. As a first step, we explore statistical methods to characterize the correlation structures present in the data and their relationships, through functional regression models, to compression ratios. We observed a relationship between compression ratios and several statistics summarizing the correlation structure of the data, which is a first step towards evaluating the theoretical limits of lossy compressibility used to eventually predict compression performance and adapt compressors to correlation structures present in the data.
Invited Talks
Constellation Workshop
Processing in Memory Execution Targets for Higher Level Languages
July 2023 Slides
Abstract:
The Von-Neumann architecture has been in common-place for the last few decades.
Much of the execution time on these systems is spent on memory transactions. This
bottleneck can be lessened by utilizing processors in memory (PIM). Utilizing a data-centric
computing model, we are expected to have speedups and reduction in energy consumption
for workloads that can utilize fine grain threading. This work explores decision-support
query (e.g., TPC-H) implementation on UPMEM. Future work explores targeting UPMEM
using the Village toolchain.
Clemson CECAS Invited Talks
What's the deal with this 'Grad School' thing? (A guide to navigating options for graduate school as an undergrad)
December 2022 Slides
Abstract:
Are you interested in graduate school as an option after graduation? Would you
like to learn more about the application process, choosing your advisor, and
what graduate school could mean for you? Attending graduate school in the
field of Computer Science or ECE can help you stand out from others in the job
market, specialize for a particular sub-field, or conduct research at the bleeding
edge of your area-of-interest! If you would like to learn more and hear about
other’s experiences in grad school, please attend this talk and Q&A session.
Summer Argonne Students Symposium (SASSy)
Statistical Prediction of Lossy Compression Ratios for 3D Scientific Data
July 2022 Slides