Jeff LeFevre
Adjunct Professor, Computer Science and Engineering
Jack Baskin School of Engineering
University of California, Santa Cruz
Office: Engineering 2, room 541A

My research interests are in cloud databases, database physical design, and storage systems. I currently lead the Skyhook Data Management project as part of the Center for Research on Open Source Software at UC Santa Cruz. SkyhookDM takes a 'programmable storage' approach that extends opens source software Apache Arrow and Ceph distributed object storage toward in-storage data processing and management through Ceph's built-in extensions framework ('cls'). Our extensions embed Arrow libraries within Ceph objects, enabling data processing functions as well as physical design manipulations of local object data such as data layouts or indexing. I also collaborate with the Systems Research Lab on the larger programmable storage effort. Through CROSS and CERN-HSF organizations, the Skyhook project has been a participant in Google Summer of Code 2019-2021.

I received my PhD in June 2014 from UC Santa Cruz Database group and subsequently joined Hewlett Packard Big Data R&D (Vertica database) where I worked on integrating Vertica with external analtyics engines such as Distributed-R and Apache Spark. At UC Santa Cruz my PhD advisor was Neoklis Polyzotis and my PhD thesis title is "Physical design tuning methods for emerging system architectures". My thesis (abstract) introduces new physical design methods for databases in the cloud. Specifically I address RDBMS, Hadoop, and hybrid 'multistore' (combined RDBMS + Hadoop co-processing) system architectures.

Previously, I received my MS from the University of California, San Diego in the Systems and Networking Group. My MS advisor was Walt Burkhard and my MS thesis title is "Improving disk array performance and reliability", which introduces a data layout and scheduling policy for RAID arrays. I received a BS in Computer Science & Engineering from the University of South Florida, where I did research on unique encodings for DNA languages. During graduate school I spent several summers at NEC Labs working on CloudDB in the Data Management group, at Google in the Platforms Storage group, and at Teradata in the Virtual Storage Architecture group.

Current CV.

Google Scholar profile

Courses Taught

Other Teaching

Professional Service