Home
News
Research
Publications
Teaching
Contact
CV
Light
Dark
Automatic
papers
Opportunistic Query Execution on SmartNICs for Analyzing In-Transit Data
High-performance computing (HPC) systems researchers have proposed using current, programmable network interface cards (or SmartNICs) …
Jianshen Liu
,
Carlos Maltzahn
,
Craig Ulmer
Cite
Project
Project
Project
Skyhook: Towards an Arrow-Native Storage System
With the ever-increasing dataset sizes, several file formats such as Parquet, ORC, and Avro have been developed to store data …
Jayjeet Chakraborty
,
Ivo Jimenez
,
Sebastiaan Alvarez Rodriguez
,
Alexandru Uta
,
Jeff LeFevre
,
Carlos Maltzahn
PDF
Cite
Project
Project
Project
Project
Zero-Cost, Arrow-Enabled Data Interface for Apache Spark
Distributed data processing ecosystems are widespread and their components are highly specialized, such that efficient interoperability …
Sebastiaan Alvarez Rodriguez
,
Jayjeet Chakraborty
,
Aaron Chu
,
Ivo Jimenez
,
Jeff LeFevre
,
Carlos Maltzahn
,
Alexandru Uta
PDF
Cite
Zero-Cost, Arrow-Enabled Data Interface for Apache Spark
Distributed data processing ecosystems are widespread and their components are highly specialized, such that efficient interoperability …
Sebastiaan Alvarez Rodriguez
,
Jayjeet Chakraborty
,
Aaron Chu
,
Ivo Jimenez
,
Jeff LeFevre
,
Carlos Maltzahn
,
Alexandru Uta
PDF
Cite
Project
Project
Performance Characteristics of the BlueField-2 SmartNIC
High-performance computing (HPC) researchers have long envisioned scenarios where application workflows could be improved through the …
Jianshen Liu
,
Carlos Maltzahn
,
Craig Ulmer
,
Matthew Leon Curry
PDF
Cite
Project
Project
Project
Project
Towards an Arrow-native Storage System
With the ever-increasing dataset sizes, several file formats like Parquet, ORC, and Avro have been developed to store data efficiently …
Jayjeet Chakraborty
,
Ivo Jimenez
,
Sebastiaan Alvarez Rodriguez
,
Alexandru Uta
,
Jeff LeFevre
,
Carlos Maltzahn
PDF
Cite
Project
Project
Project
Project
Enabling seamless execution of computational and data science workflows on HPC and cloud with the Popper container-native automation engine
The problem of reproducibility and replication in scientific research is quite prevalent to date. Researchers working in fields of …
Jayjeet Chakraborty
,
Carlos Maltzahn
,
Ivo Jimenez
PDF
Cite
Project
Mapping Scientific Datasets to Programmable Storage
Access libraries such as ROOT and HDF5 allow users to interact with datasets using high level abstractions, like coordinate systems and …
Aaron Chu
,
Jeff LeFevre
,
Carlos Maltzahn
,
Aldrin Montana
,
Peter Alvaro
,
Dana Robinson
,
Quincey Koziol
PDF
Cite
Project
Project
Project
Project
The CROSS Incubator: A Case Study for funding and training RSEs
The incubator and research projects sponsored by the Center for Research in Open Source Software (CROSS, cross.ucsc.edu) at UC Santa …
Stephanie Lieggi
,
Ivo Jimenez
,
Jeff LeFevre
,
Carlos Maltzahn
PDF
Cite
Project
Slides
Scale-out Edge Storage Systems with Embedded Storage Nodes to Get Better Availability and Cost-Efficiency At the Same Time
In the resource-rich environment of data centers most failures can quickly failover to redundant resources. In contrast, failure in …
Jianshen Liu
,
Matthew Leon Curry
,
Carlos Maltzahn
,
Philip Kufeldt
PDF
Cite
Project
Slides
»
Cite
×