ScratchPad
- Jointly Optimizing Preprocessing and Inference for DNN-based Visual Analytics
- Strata: A Cross Media File System
- Occupy the Cloud: Distributed Computing for the 99%
- Slicer: Auto-Sharding for Datacenter Applications
- Andromeda: Performance, Isolation, and Velocity at Scale in Cloud Network Virtualization
- Evolve or Die: High-Availability Design Principles Drawn from Google's Network Infrastructure
- Benchmarking TPU, GPU, and CPU Platforms for Deep Learning
- RDMA over Commodity Ethernet at Scale
- Jupiter Rising A Decade of Clos Topologies and Centralized Control in Google's Datacenter Network
- Project PBerry: FPGA Acceleration for Remote Memory
- Azure Accelerated Networking: SmartNICs in the Public Cloud
- In-Datacenter Performance Analysis of a Tensor Processing Unit
- ASIC Clouds: Specializing the Datacenter
- Happiness index: Right-sizing the cloud’s tenant-provider interface
- Nines are Not Enough: Meaningful Metrics for Clouds
- Attack of the Killer Microseconds
- A Case for NOW (Networks of Workstations)
- Fractional GPUs: Software-based Compute and Memory Bandwidth Reservation for GPUs
- Unifying Data, Model and Hybrid Parallelism in Deep Learning via Tensor Tiling
- Beyond Data and Model Parallelism for Deep Neural Networks
- PipeDream: Generalized Pipeline Parallelism for DNN Training
- SplitFS: Reducing Software Overhead in File Systems for Persistent Memory
- PRETZEL: Opening the Black Box of Machine Learning Prediction Serving Systems
- ALEX: An Updatable Adaptive Learned Index
- Deep Learning Inference Service at Microsoft
© 2020 ScratchPad ― Powered by Jekyll and Textlog theme