Jeff LeFevre, PhD
Adjunct Professor
Department of Computer Science and Engineering
University of California, Santa Cruz
Office: Engineering 2, room 541A

My research interests are in cloud databases, database physical design, and storage systems. I lead the Skyhook Data Management project as part of the Center for Research on Open Source Software at UC Santa Cruz, where I was awarded a CROSS Incubator Fellowship. SkyhookDM takes a 'programmable storage' approach that extends opens source software Apache Arrow and Ceph distributed object storage toward in-storage data processing and management through Ceph's built-in extensions framework ('cls'). Our extensions embed Arrow libraries within Ceph objects, enabling data processing functions as well as physical design manipulations of local object data such as data layouts or indexing.

I am pleased to announce that Skyhook has been merged into Apache Arrow mainline in October 2021! Please see our announcements page for the latest news.

Through the CROSS and CERN/HSF organizations, the Skyhook project has earned several spots in Google Summer of Code (GSoC) for which I was a mentor: 2019, 2020, 2021.

Through the Institute for Research and Innovation in Software for High Energy Physics (IRIS-HEP), Skyhook has also been awarded three fellowships for which I was a mentor: 2020 , 2021 2021 .

I received my PhD in 2014 from the UC Santa Cruz Database group. My PhD advisor was Neoklis Polyzotis. I subsequently joined HP Vertica database R&D group at Hewlett Packard in Palo Alto, where I worked on integrating Vertica with external analtyics engines such as Distributed-R and Apache Spark. During graduate school I spent several summers at

Selected Publications
Google Scholar profile

Courses Taught

Other Teaching

Professional Service, Mentoring

Professional Service, Peer Review

Patents Granted