Resilient distributed datasets: A fault-tolerant abstraction for in-memory cluster computing M Zaharia, M Chowdhury, T Das, A Dave, J Ma, M McCauley, MJ Franklin, ... Proceedings of the 9th USENIX conference on Networked Systems Design and …, 2012 | 6295 | 2012 |
GraphX: Graph Processing in a Distributed Dataflow Framework JE Gonzalez, RS Xin, A Dave, D Crankshaw, MJ Franklin, I Stoica 11th USENIX Symposium on Operating Systems Design and Implementation (OSDI …, 2014 | 3302* | 2014 |
Apache spark: a unified engine for big data processing M Zaharia, RS Xin, P Wendell, T Das, M Armbrust, A Dave, X Meng, ... Communications of the ACM 59 (11), 56-65, 2016 | 2992 | 2016 |
Opaque: An oblivious and encrypted distributed analytics platform W Zheng, A Dave, JG Beekman, RA Popa, JE Gonzalez, I Stoica 14th USENIX Symposium on Networked Systems Design and Implementation (NSDI …, 2017 | 450 | 2017 |
Fast and Interactive Analytics over Hadoop Data with Spark M Zaharia, M Chowdhury, T Das, A Dave, J Ma, M McCauley, MJ Franklin, ... USENIX ;login;, August 2012, 2012 | 273 | 2012 |
Graphframes: an integrated api for mixing graph and relational queries A Dave, A Jindal, LE Li, R Xin, J Gonzalez, M Zaharia Proceedings of the fourth international workshop on graph data management …, 2016 | 121 | 2016 |
G-ola: Generalized on-line aggregation for interactive analysis on big data K Zeng, S Agarwal, A Dave, M Armbrust, I Stoica Proceedings of the 2015 ACM SIGMOD International Conference on Management of …, 2015 | 96 | 2015 |
Photon: A fast query engine for lakehouse systems A Behm, S Palkar, U Agarwal, T Armstrong, D Cashman, A Dave, ... Proceedings of the 2022 International Conference on Management of Data, 2326 …, 2022 | 49 | 2022 |
Oblivious coopetitive analytics using hardware enclaves A Dave, C Leung, RA Popa, JE Gonzalez, I Stoica Proceedings of the Fifteenth European Conference on Computer Systems, 1-17, 2020 | 40 | 2020 |
Arthur: Rich Post-Facto Debugging for Production Analytics Applications A Dave, M Zaharia, S Shenker, I Stoica | 26 | 2013 |
CloudClustering: Toward an iterative data processing pattern on the cloud A Dave, W Lu, J Jackson, R Barga 2011 IEEE International Symposium on Parallel and Distributed Processing …, 2011 | 24 | 2011 |
Resilient distributed datasets: a fault-tolerant abstraction for in-memory cluster computing MZM Chowdhury, T Das, A Dave, MJ Franklin, J Ma, M McCauley, ... NSDI'12 Proceedings of the 9th USENIX conference on Networked Systems Design …, 2012 | 11 | 2012 |
Low-latency spark queries on updatable data A Uta, B Ghit, A Dave, P Boncz Proceedings of the 2019 International Conference on Management of Data, 2009 …, 2019 | 7 | 2019 |
IndexedRDD A Dave | 3 | 2014 |
In-Memory Indexed Caching for Distributed Data Processing A Uta, B Ghit, A Dave, J Rellermeyer, P Boncz 2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2022 | 1 | 2022 |
Persistent adaptive radix trees: Efficient fine-grained updates to immutable data A Dave, JE Gonzalez, MJ Franklin, I Stoica | 1 | |
Hash based rollup with passthrough A Behm, A Dave US Patent 11,675,767, 2023 | | 2023 |
LIFO based spilling for grouping aggregation A Behm, A Dave, R Deng, S Palkar US Patent 11,481,398, 2022 | | 2022 |
Secure, Expressive, and Debuggable Large-Scale Analytics A Dave UC Berkeley, 2020 | | 2020 |
VU Research Portal A Uta, D Duplyakin, C Abad, N Herbst, A Iosup | | 2019 |