WAVE: A Decentralized Authorization Framework with Transitive Delegation

Most deployed authorization systems rely on a central trusted service whose compromise can lead to the breach of millions of user accounts and permissions. We present WAVE, an authorization framework offering

Conference/Journal: 28th USENIX Security Symposium (Security 2019)
Publication Date: 08/14/2019
Author(s): Michael Andersen, Sam Kumar, Moustafa AbdelBaky, Gabe Fierro, Jack Kolb, Hyung-Sin Kim, David Culler, Raluca Ada Popa

Thread/OpenThread: A Compromise in Low-Power Wireless Multihop Network Architecture for the Internet of Things

Extending an Internet subnet by connecting resource-constrained nodes (e.g., embedded sensors and actuators) over multiple wireless hops is necessary to support the future Internet of Things (IoT). RPL, the IPv6 ro

Conference/Journal: IEEE Communications Magazine (Future Internet: Architectures and Protocols)
Publication Date: 07/18/2019
Author(s): Hyung-Sin Kim, Sam Kumar, David Culler

ALICE: Autonomous Link-based Cell Scheduling for TSCH

Although low-power lossy network (LLN), at its early stage, commonly used asynchronous link layer protocols for simple operation on resource-constrained nodes, development of embedded hardware and time synchronizat

Conference/Journal: IPSN 2019: The 18th ACM/IEEE Conference on Information Processing in Sensor Networks
Publication Date: 04/01/2019
Author(s): Seohyang Kim, Hyung-Sin Kim, Chongkwon Kim

Scaling Video Analytics to Large Camera Deployments

We discuss the potential of spatio-temporal correlations — content correlations between geographically proximate cameras in wide-area enterprise camera deployments — to improve cost efficiency and

Conference/Journal: ACM Workshop on Mobile Computing Systems and Applications (HotMobile '19)
Publication Date: 02/27/2019
Author(s): Samvit Jain, Ganesh Ananthanarayanan, Junchen Jiang, Yuanchao Shu, Joseph Gonzalez

Confluo: Distributed Monitoring and Diagnosis Stack for High-speed Networks

Confluo is an end-host stack that can be integrated with existing network management tools to enable monitoring and diagnosis of network-wide events using telemetry data distributed across end-hosts, even for

Conference/Journal: USENIX Symposium on Networked Systems Design and Implementation (NSDI'19)
Publication Date: 02/26/2019
Author(s): Anurag Khandelwal, Rachit Agarwal, Ion Stoica

FPGA Accelerated INDEL Realignment in the Cloud

Abstract: The amount of data being generated in genomics is predicted to be between 2 and 40 exabytes per year for the next decade, making genomic analysis the new frontier and the new challenge for precision medic

Conference/Journal: IEEE International Symposium on High-Performance Computer Architecture (HPCA) 2019
Publication Date: 02/19/2019
Author(s): Lisa Wu, David Bruns-Smith, Frank Nothaft, Qijing Huang, Sagar Karandikar, Johnny Le, Andrew Lin, Howard Mao, Brendan Sweeney, Krste Asanovic, David Patterson, Anthony Joseph

Serverless Computing: One Step Forward, Two Steps Back

Serverless computing offers the potential to program the cloud in an autoscaling, pay-as-you go manner. In this paper we address critical gaps in first-generation serverless computing, which place its

Conference/Journal: CIDR '19
Publication Date: 01/15/2019
Author(s): Joe Hellerstein, Jose M. Faleiro, Joseph Gonzalez, Johann Schleier-Smith, Vikram Sreekanti, Alexey Tumanov, Chenggang Wu

System Architecture Directions for Post-SoC/32-bit Networked Sensors

The emergence of low-power 32-bit Systems-on-Chip (SoCs), which integrate a 32-bit MCU, radio, and flash, presents an opportunity to re-examine design points and trade-offs at all levels of the system architecture

Conference/Journal: The 16th ACM Conference on Embedded Networked Sensor Systems (SenSys ’18)
Publication Date: 11/04/2018
Author(s): Hyung-Sin Kim, Michael Andersen, Kaifei Chen, Sam Kumar, William J. Zhao, Kevin Ma, David Culler

Benchmarks for reinforcement learning in mixed-autonomy traffic

We release new benchmarks in the use of deep reinforcement learning (RL) to create controllers for mixed-autonomy traffic, where connected and autonomous vehicles (CAVs) interact with human drivers and infrastruct

Conference/Journal: 2nd Conference on Robot Learning
Publication Date: 10/31/2018
Author(s): Eugene Vinitsky, Aboudy Kriedieh, Luc Le Flem, Nishant Kheterpal, Kathy Jang, Cathy Wu, Richard Liaw, Eric Liang, Alexandre Bayen

Mitigating the Latency-Accuracy Trade-off in Mobile Data Analytics Systems

An increasing amount of mobile analytics is performed on data that is procured in a real-time fashion to make real-time decisions. Such tasks include simple reporting on streams to sophisticated model building. How

Conference/Journal: 24th Annual International Conference on Mobile Computing and Networking (MobiCom)
Publication Date: 10/29/2018
Author(s): Anand Padmanabha Iyer, Li Erran Li, Mosharaf Chowdhury, Ion Stoica

Ray: A Distributed Framework for Emerging AI Applications

The next generation of AI applications will continuously interact with the environment and learn from these interactions. These applications impose new and demanding systems requirements, both in terms of performan

Conference/Journal: USENIX Symposium on Operating Systems Design and Implementation (OSDI)
Publication Date: 10/10/2018
Author(s): Philipp Moritz, Robert Nishihara, Stephanie Wang, Alexey Tumanov, Richard Liaw, Eric Liang, Melih Elibol, Zongheng Yang, William Paul, Michael Jordan, Ion Stoica

ASAP: Fast, Approximate Graph Pattern Mining at Scale

While there has been a tremendous interest in processing data that has an underlying graph structure, existing distributed graph processing systems take several minutes or even hours to mine simple patterns on grap

Conference/Journal: USENIX Symposium on Operating Systems Design and Implementation (OSDI)
Publication Date: 10/10/2018
Author(s): Anand Padmanabha Iyer, Zaoxing Liu, Xin Jin, Shivaram Venkataraman, Vladimir Braverman, Ion Stoica

FairFuzz: A Targeted Mutation Strategy for Increasing Greybox Fuzz Testing Coverage

In recent years, fuzz testing has proven itself to be one of the most effective techniques for finding correctness bugs and security vulnerabilities in practice. One particular fuzz testing tool, American Fuzzy Lop

Conference/Journal: 33rd ACM/IEEE International Conference on Automated Software Engineering
Publication Date: 09/03/2018
Author(s): Caroline Lemieux, Koushik Sen

e-mission: An Open-Source, Smartphone Platform for Collecting Human Travel Data

GPS-equipped smartphones provide new methods to collect data about travel behavior, including travel survey apps that incorporate automated location sensing. Previous approaches to this have involved proprietary or

Conference/Journal: Transportation Research Record: Journal of the Transportation Research Board
Publication Date: 08/19/2018
Author(s): K. Shankari, Mohamed Amine Bouzaghrane, Samuel M. Maurer, Paul Waddell, David Culler, Randy Katz

IDK Cascades: Fast Deep Learning by Learning not to Overthink

We introduce the "I Don't Know"(IDK) prediction cascades framework, a general framework to systematically compose a set of pre-trained models to accelerate inference without a loss in prediction accuracy.

Conference/Journal: Conference on Uncertainty in Artificial Intelligence (UAI) 2018
Publication Date: 07/18/2018
Author(s): Xin Wang, Yujia Luo, Dan Crankshaw, Alexey Tumanov, Fisher Yu, Joseph Gonzalez

Tributary: spot-dancing for elastic services with latency SLOs

The Tributary elastic control system embraces the uncertain nature of transient cloud resources, such as AWS spot instances, to manage elastic services with latency SLOs more robustly and more cost-effectively. Suc

Conference/Journal: 2018 USENIX Annual Technical Conference
Publication Date: 07/11/2018
Author(s): Aaron Harlap, Andrew Chung, Alexey Tumanov, Gregory R. Ganger, Phillip B. Gibbons

Monarch: Gaining Command on Geo-Distributed Graph Analytics

A number of existing and emerging application scenarios generate graph-structured data in a geo-distributed fashion. Although there is a lot of interest in distributed graph processing systems, none of them support

Conference/Journal: 10th USENIX Workshop on Hot Topics in Cloud Computing (HotCloud '18)
Publication Date: 07/09/2018
Author(s): Anand Padmanabha Iyer, Aurojit Panda, Mosharaf Chowdhury, Aditya Akella, Scott Shenker, Ion Stoica

Towards Fast and Scalable Graph Pattern Mining

While there has been a tremendous interest in processing graph-structured data, existing distributed graph processing systems take several minutes or even hours to mine simple patterns on graphs. In this paper, we

Conference/Journal: 10th USENIX Workshop on Hot Topics in Cloud Computing (HotCloud '18)
Publication Date: 07/09/2018
Author(s): Anand Padmanabha Iyer, Zaoxing Liu, Xin Jin, Shivaram Venkataraman, Vladimir Braverman, Ion Stoica

Shift: A Zero FLOP, Zero Parameter Alternative to Spatial Convolutions

Neural networks rely on convolutions to aggregate spatial information. However, spatial convolutions are expensive in terms of model size and computation, both of which grow quadratically with respect to kernel siz

Conference/Journal: Conference on Computer Vision and Pattern Recognition (CVPR)
Publication Date: 06/18/2018
Author(s): Bichen Wu, Alvin Wan, Xiangyu Yue, Peter Jin, Sicheng Zhao, Noah Golmant, Amir Gholaminejad, Joseph Gonzalez, Kurt Keutzer

Bridging the GAP: Towards Approximate Graph Analytics

While there has been a tremendous interest in processing data that has an underlying graph structure, existing distributed graph processing systems take several minutes or even hours to execute popular graph algori

Conference/Journal: SIGMOD Graph Data-management Experiences & Systems (GRADES)
Publication Date: 06/10/2018
Author(s): Anand Padmanabha Iyer, Aurojit Panda, Shivaram Venkataraman, Mosharaf Chowdhury, Aditya Akella, Scott Shenker, Ion Stoica

FireSim: FPGA-Accelerated Cycle-Exact Scale-Out System Simulation in the Public Cloud

We present FireSim, an open-source simulation platform that enables cycle-exact microarchitectural simulation of large scale-out clusters by combining FPGA-accelerated simulation of silicon-proven RTL designs

Conference/Journal: 2018 ACM/IEEE 45th Annual International Symposium on Computer Architecture (ISCA 2018)
Publication Date: 06/04/2018
Author(s): Sagar Karandikar, Howard Mao, Donggyu Kim, David Biancolin, Alon Amid, Dayeol Lee, Nathan Pemberton, Emmanuel Amaro, Colin Schmidt, Aditya Chopra, Qijing Huang, Kyle Kovacs, Borivoje Nikolic, Randy Katz, Jonathan Bachrach, Krste Asanovic

A Hardware Accelerator for Tracing Garbage Collection

A large number of workloads are written in garbage-collected languages. These applications spend up to 10-35% of their CPU cycles on GC, and these numbers increase further for pause-free concurrent collectors. As t

Conference/Journal: 45th International Symposium on Computer Architecture (ISCA'18)
Publication Date: 06/04/2018
Author(s): Martin Maas, Krste Asanovic, John Kubiatowicz

Parametrized Hierarchical Procedures for Neural Programming

Parametrized Hierarchical Procedures (PHP) represent a program as a hierarchy of procedures that call each other, each implemented by a neural network. We develop an algorithm for training PHPs from a set of

Conference/Journal: International Conferences on Learning Representations (ICLR) 2018
Publication Date: 04/30/2018
Author(s): Roy Fox, Eui Chul (Richard) Shin, Sanjay Krishnan, Kenneth Goldberg, Dawn song, Ion Stoica

3Sigma: distribution-based cluster scheduling for runtime uncertainty

The 3Sigma cluster scheduling system uses job runtime histories in a new way. Knowing how long each job will execute enables a scheduler to more effectively pack jobs with diverse time concerns (e.g., deadline vs.

Conference/Journal: ACM European Conference on Computer Systems (EuroSys'2018)
Publication Date: 04/23/2018
Author(s): Jun Woo Park, Alexey Tumanov, Angela Jiang, Michael A. Kozuch, Gregory R. Ganger

SafeBricks: Shielding Network Functions in the Cloud

With the advent of network function virtualization (NFV), outsourcing network processing to the cloud is growing in popularity amongst enterprises and organizations. Such outsourcing, however, poses a threat to the

Conference/Journal: 15th USENIX Symposium on Networked Systems Design and Implementation (NSDI) 2018
Publication Date: 04/09/2018
Author(s): Rishabh Poddar, Chang Lan, Raluca Ada Popa, Sylvia Ratnasamy

NetChain: Scale-Free Sub-RTT Coordination

Coordination services are a fundamental building block of modern cloud systems, providing critical functionalities like configuration management and distributed locking. The major challenge is to achieve low latenc

Conference/Journal: 15th USENIX Symposium on Networked Systems Design and Implementation (NSDI '18)
Publication Date: 04/09/2018
Author(s): Xin Jin, Ion Stoica, Xiaozhou Li, Haoyu Zhang , Nate Foster, Jeongkeun Lee, Robert Soulé, Changhoon Kim

RLlib: Abstractions for Distributed Reinforcement Learning

Reinforcement learning (RL) algorithms involve the deep nesting of highly irregular computation patterns, each of which typically exhibits opportunities for distributed computation. We argue for distributing

Conference/Journal: International Conference on Machine Learning (ICML 2018)
Publication Date: 12/07/2017
Author(s): Eric Liang, Richard Liaw, Robert Nishihara, Philipp Moritz, Roy Fox, Kenneth Goldberg, Joseph Gonzalez, Michael Jordan, Ion Stoica

SnapLink: Fast and Accurate Vision-Based Appliance Control in Large Commercial Buildings.

As the number and heterogeneity of appliances in smart buildings increases, identifying and controlling them becomes challenging. Existing methods face various challenges when deployed in large commercial buildings

Conference/Journal: Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, Vol. 1, No. 4
Publication Date: 12/01/2017
Author(s): Kaifei Chen, Jonathan Fürst, Jack Kolb, Hyung-Sin Kim, Xin Jin, David Culler, Randy Katz

Design and Analysis of a Query Processor for Brick

Brick is a recently proposed metadata schema and ontology for describing building components and the relationships between them. It represents buildings as directed labeled graphs using the RDF data model. Using th

Conference/Journal: 4th ACM International Conference on Systems for Energy-Efficient Built Environments (BuildSys), 2017
Publication Date: 11/09/2017
Author(s): Gabe Fierro,

Drizzle: Fast and Adaptable Stream Processing at Scale

Large scale streaming systems aim to provide high throughput and low latency. They are often used to run mission-critical applications, and must be available 24×7. Thus such systems need to adapt to failures and in

Conference/Journal: Symposium on Operating Systems Principles (SOSP) 2017
Publication Date: 10/28/2017
Author(s): Shivaram Venkataraman, Aurojit Panda, Kay Ousterhout, Michael Armbrust, Ali Ghodsi, Mike Franklin, Benjamin Recht, Ion Stoica

Automating Diagnosis of Cellular Radio Access Network Problems

In an increasingly mobile connected world, our user experience of mobile applications more and more depends on the performance of cellular radio access networks (RANs). To achieve high quality of experience for the

Conference/Journal: 23rd ACM Annual International Conference on Mobile Computing and Networking (MobiCom 2017)
Publication Date: 10/16/2017
Author(s): Anand Padmanabha Iyer, Li Erran Li, Ion Stoica

A Berkeley View of Systems Challenges for AI

In this paper, we propose several open research directions in systems, architectures, and security that can address these challenges and help unlock AI’s potential to improve lives and society.

Conference/Journal: UC Berkeley EECS Technical Report
Publication Date: 10/16/2017
Author(s): Ion Stoica, Dawn song, Raluca Ada Popa, , Michael Mahoney, Randy Katz, Anthony Joseph, Michael Jordan, Joe Hellerstein, et al.

Full-System Simulation of Java Workloads with RISC-V and the Jikes Research Virtual Machine

Managed languages such as Java, JavaScript or Python account for a large portion of workloads, both in cloud data centers and on mobile devices. It is therefore unsurprising that there is an interest in hardware-so

Conference/Journal: 1st Workshop on Computer Architecture Research with RISC-V (CARRV '17), Boston, MA, October 2017
Publication Date: 10/14/2017
Author(s): Martin Maas, Krste Asanovic, John Kubiatowicz

Selecting the Best VM across Multiple Public Clouds: A Data-Driven Performance Modeling Approach

Users of cloud services are presented with a bewildering choice of VM types and the choice of VM can have significant implications on performance and cost. In this paper we address the fundamental problem of accura

Conference/Journal: ACM Symposium on Cloud Computing 2017 (SoCC '17)
Publication Date: 09/25/2017
Author(s): Neeraja Yadwadkar, Bharath Hariharan, Joseph Gonzalez, Burton Smith, Randy Katz

Challenging the IPv6 Routing Protocol for Low-Power and Lossy Networks (RPL): A Survey

RPL is the IPv6 routing protocol for low-power and lossy networks, standardized by IETF in 2012 as RFC6550. Specifically, RPL is designed to be a simple and inter-operable networking protocol for resource-constrain

Conference/Journal: IEEE Communications Surveys & Tutorials ( Volume: 19, Issue: 4, Fourthquarter 2017 )
Publication Date: 09/13/2017
Author(s): Hyung-Sin Kim, Jeonggil Ko, David Culler, Jeongyeup Paek

Learning certifiably optimal rule lists for categorical data

As machine learning continues to gain prominence in socially-important decision-making, the interpretability of predictive models remains a crucial problem. Our goal is to build models that are highly predictive, t

Conference/Journal: 23rd SIGKDD Conference on Knowledge Discovery and Data Mining (KDD '17)
Publication Date: 08/15/2017
Author(s): Elaine Angelino, Nicholas Larus-Stone, Daniel Alabi, Margo Seltzer, Cynthia Rudin

Sketched Ridge Regression: Optimization Perspective, Statistical Perspective, and Model Averaging

We address the statistical and optimization impacts of using classical sketch versus Hessian sketch to solve approximately the Matrix Ridge Regression (MRR) problem. Prior research has considered the effects of cla

Conference/Journal: International Conference on Machine Learning
Publication Date: 08/07/2017
Author(s): Shusen Wang, Alex Gittens, Michael Mahoney

Capacity Releasing Diffusion for Speed and Locality

Diffusions and related random walk procedures are of central importance in many areas of machine learning, data analysis, and applied mathematics. Because they spread mass agnostically at each step in an iterative

Conference/Journal: Proceedings of the 34th International Conference on Machine Learning (ICML '17)
Publication Date: 07/28/2017
Author(s): Di Wang, Kimon Fountoulakis, Monika Henzinger, Michael W. Mahoney, Satish Rao

Real-Time Machine Learning: The Missing Pieces

Machine learning applications are increasingly deployed not only to serve predictions using static models, but also as tightly-integrated components of feedback loops involving dynamic, real-time decision

Conference/Journal: HotOS 2017
Publication Date: 05/10/2017
Author(s): Robert Nishihara, Philipp Moritz, Stephanie Wang, Alexey Tumanov, William Paul, Johann Schleier-Smith, Richard Liaw, Mehrdad Niknami, Michael Jordan, Ion Stoica

Opaque: An Oblivious and Encrypted Distributed Analytics Platform.

As enterprises move to cloud-based analytics, the risk of cloud security breaches poses a serious threat. Encrypting data at rest and in transit is a major first step. However, data must still be decrypted in memor

Conference/Journal: NSDI 2017 (USENIX Symposium of Networked Systems Design and Implementation)
Publication Date: 03/27/2017
Author(s): Wenting Zheng, Raluca Ada Popa, Ion Stoica, Joseph Gonzalez, Ankur Dave, Jethro Beekman

Grail Quest: A New Proposal for Hardware-assisted Garbage Collection

Many big data systems are written in garbage-collected languages and GC has a substantial impact on throughput, responsiveness and predicability of these systems. However, despite decades of research, there is stil

Conference/Journal: Sixth Workshop on Architectures and Systems for Big Data (ASBD 2016)
Publication Date: 03/19/2017
Author(s): Martin Maas, Krste Asanovic, John Kubiatowicz

EC-Cache: Load-balanced, Low-latency Cluster Caching with Online Erasure Coding

Data-intensive clusters and object stores are increasingly relying on in-memory object caching to meet the I/O performance demands. These systems routinely face the challenges of popularity skew, background

Conference/Journal: USENIX OSDI
Publication Date: 11/03/2016
Author(s): , Mosharaf Chowdhury, Jack Kosaian, Ion Stoica, Kannan Ramchandran

ReStream: Accelerating Backtesting and Stream Replay with Serial-Equivalent Parallel Processing

Real-time predictive applications can demand continuous and agile development, with new models constantly being trained, tested, and then deployed. Training and testing are done by replaying stored event logs,

Conference/Journal: SoCC '16 - Proceedings of the Seventh ACM Symposium on Cloud Computing
Publication Date: 10/06/2016
Author(s): Johann Schleier-Smith, Erik T. Krogen, Joe Hellerstein

Sub-microsecond Adaptive Voltage Scaling in a 28nm FD-SOI Processor

This work presents a RISC-V system-on-chip (SoC) with integrated voltage regulation and power management implemented in 28nm FD-SOI. A fully integrated switched-capacitor DC-DC converter, coupled with an

Conference/Journal: ESSCIRC - ESSDERC 2016
Publication Date: 09/15/2016
Author(s): Krste Asanovic, Benjamin Keller, Martin Cochet, Brian Zimmer, Jaehwa Kwak, Alberto Puggelli, Steven Bailey, Borivoje Nikolic, Palmer Dabbelt, et al.

The Renewed Case for the Reduced Instruction Set Computer: Avoiding ISA Bloat with Macro-Op Fusion for RISC-V

This report makes the case that a well-designed Reduced Instruction Set Computer (RISC) can match, and even exceed, the performance and code density of existing commercial Complex Instruction Set Computers (CISC) w

Conference/Journal: Technical Report
Publication Date: 07/15/2016
Author(s): Krste Asanovic, Christopher Celio, Daniel Dabbelt, David A. Patterson

Time-evolving Graph Processing at Scale

Time-evolving graph-structured big data arises naturally in many application domains such as social networks and communication networks. However, existing graph processing systems lack support for efficient computa

Conference/Journal: Graph Data-management Experiences & Systems (GRADES)
Publication Date: 06/24/2016
Author(s): Anand Padmanabha Iyer, Li Erran Li, Tathagata Das, Ion Stoica

Strober: Fast and Accurate Sample-Based Energy Simulation for Arbitrary RTL

This paper presents a sample-based energy simulation methodology that enables fast and accurate estimations of performance and average power for arbitrary RTL designs. Our approach uses an FPGA to simultaneously si

Conference/Journal: ISCA 2016
Publication Date: 06/15/2016
Author(s): Krste Asanovic, Donggyu Kim, Adam Izraelevitz, Christopher Celio, Hokeum Kim, Brian Zimmer

Multi-Task Learning for Straggler Avoiding Predictive Job Scheduling

Parallel processing frameworks (Dean and Ghemawat, 2004) accelerate jobs by breaking them into tasks that execute in parallel. However, slow running or straggler tasks can run up to 8 times slower than the median t

Conference/Journal: Journal of Machine Learning Research
Publication Date: 06/01/2016
Author(s): Neeraja Yadwadkar, Bharath Hariharan, Joseph Gonzalez, Randy Katz

Optimizing the diamond lane: A more tractable carpool problem and algorithms

Carpooling has been long deemed a promising approach to better utilizing existing transportation infrastructure. However, there are several reasons carpooling is still not the preferred mode of commute in the Unite

Conference/Journal: 19th IEEE Intelligent Transportation Systems Conference
Publication Date:
Author(s): K. Shankari, Ece Kamar, Randy Katz, , Christos Papadimitriou, Eric Horvitz, Alexandre Bayen

The emission mobilityscope: personalized data collection for agile urban planning

We have built an initial version of an open source mobilityscope or transportation meter that can track end to end travel patterns using smartphone sensors and user input. The travel patterns can be used to

Conference/Journal: California Higher Education Sustainability Conference (CHESC), Santa Barbara CA, June 2016
Publication Date:
Author(s): K. Shankari, Cathy Wu

Morpheus: Towards Automated SLOs for Enterprise Clusters

Modern resource management frameworks for largescale analytics leave unresolved the problematic tension between high cluster utilization and job’s performance predictability—respectively coveted by operators an

Conference/Journal: Proceedings of the 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI'16).
Publication Date:
Author(s): C. Curino, I. Menache, S. Narayanamurthy, Alexey Tumanov, J. Yaniv, R. Mavlyutov, I. Goiri, S. Krishnan, J. Kulkarni, S. Rao

Proteus: agile ML elasticity through tiered reliability in dynamic resource markets

Many shared computing clusters allow users to utilize excess idle resources at lower cost or priority, with the proviso that some or all may be taken away at any time. But, exploiting such dynamic resource availabi

Conference/Journal: ACM European Conference on Computer Systems (EuroSys'2017)
Publication Date:
Author(s): Aaron Harlap, Alexey Tumanov, Andrew Chung, Gregory R. Ganger, Phil Gibbons

Decoding from Pooled data: Phase Transitions of Message Passing

We consider the problem of decoding a discrete signal of categorical variables from the observation of several histograms of pooled subsets of it. We present an Approximate Message Passing (AMP) algorithm for recov

Conference/Journal: short version submitted to International Symposium on Information Theory (ISIT), long version to be submitted to IEEE Transactions on Information Theory (IEEEIT)
Publication Date:
Author(s): Aaditya Ramdas, Ahmed El Alaoui, Michael Jordan, Florent Krzakala, Lenka Zdeborova

Random Projection Design for Scalable Implicit Smoothing of Randomly Observed Stochastic Processes

Standard methods for multi-variate time series analysis are hampered by sampling at random timestamps, long range dependencies , and the scale of the data. In this paper we present a novel estimator for

Conference/Journal: AISTATS 2017
Publication Date:
Author(s): , Joseph Gonzalez, Evan Sparks, Alexandre M. Bayen

MiniCrypt: Reconciling Encryption and Compression for Big Data Stores.

More and more applications and web services generate larger and larger amounts of confidential data, such as user and financial data. On one hand, these systems must use encryption to ensure confidentiality, while

Conference/Journal: EuroSys 2017
Publication Date:
Author(s): Wenting Zheng, Raluca Ada Popa, Ion Stoica, Rachit Agarwal, Frank Li