Publications
Copyright notice for published papers: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author’s copyright.
Copyright notice for presentations and other work: All of my presentations, talks, and images on this page are available with Creative Commons Attribution ShareAlike 4.0 license.
1. | Ahmad, Sohaib; Guan, Hui; Friedman, Brian D.; Williams, Thomas; Sitaraman, Ramesh K.; Woo, Thomas Proteus: A High-Throughput Inference-Serving System with Accuracy Scaling Conference In 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 1 (ASPLOS ’24), 2024. @conference{AhmadGSWSW23, |
2. | Chen, Jiayi; Sharma, Nihal; Khan, Tarannum; Liu, Shu; Chang, Brian; Akella, Aditya; Shakkottai, Sanjay; Sitaraman, Ramesh K. Darwin: Flexible Learning-based CDN Caching Conference ACM SIGCOMM Conference, 2023. @conference{ChenSKLCASS2023, |
3. | Maji, Diptyaroop; Pfaff, Ben; R, Vipin P; Sreenivasan, Rajagopal; Firoiu, Victor; Iyer, Sreeram; Josephson, Colleen; Pan, Zhelong; Sitaraman, Ramesh K. Bringing Carbon Awareness to Multi-cloud Application Delivery Conference In 2nd Workshop on Sustainable Computer Systems (HotCarbon ’23), 2023. @conference{MajiPVSFIJPS23, |
4. | Maghakian, Jessica; Lee, Russell; Hajiesmaili, Mohammad; Li, Jian; Sitaraman, Ramesh; Liu, Zhenhua Applied Online Algorithms with Heterogeneous Predictors Conference Proceedings of the 40th International Conference on Machine Learning (ICML), 2023. @conference{MaghakianLHLSL23, |
5. | Maji, Diptyaroop; Shenoy, Prashant; Sitaraman, Ramesh K. Multi-day Forecasting of Electric Grid Carbon Intensity using Machine Learning Journal Article In: ACM SIGENERGY Energy Informatics Review, vol. 3 , iss. 2, 2023. @article{MajiSS23, |
6. | Kumar, Dhruv; Ahmad, Sohaib; Chandra, Abhishek; Sitaraman, Ramesh K. AggFirstJoin: Optimizing Geo-Distributed Joins using Aggregation-Based Transformations Conference IEEE/ACM 23rd International Symposium on Cluster, Cloud and Internet Computing (CCGrid), 2023. @conference{KumarACS23, |
7. | Anirudh Sabnis and, Tareq Si Salem; Neglia, Giovanni; Garetto, Michele; Leonardi, Emilio; Sitaraman, Ramesh K. GRADES: Gradient Descent for Similarity Caching Journal Article In: IEEE/ACM TRANSACTIONS ON NETWORKING, vol. 31, no. 1, 2023. @article{SabnisSNGLS23, |
8. | Vats, Shivi; Park, Jounsup; Nahrstedt, Klara; Zink, Michael; Sitaraman, Ramesh; Hellwagner, Hermann Semantic-Aware View Prediction for 360-Degree Videos at the 5G Edge Conference 2022. @conference{VatsPNZSH22, |
9. | Maji, Diptyaroop; Shenoy, Prashant; Sitaraman, Ramesh K CarbonCast: Multi-Day Forecasting of Grid Carbon Intensity Conference ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation (BuildSys '22), 2022. @conference{MajiSS22, |
10. | Sabnis, Anirudh; Sitaraman, Ramesh K. JEDI: Model-driven trace generation for cache simulations Proceedings Article In: 22nd ACM Internet Measurement Conference (IMC '22), 2022. @inproceedings{SabnisS22, A major obstacle for caching research is the increasing diffi- culty of obtaining original traces from production caching systems. Original traces are voluminous and also may contain private and proprietary information, and hence not generally made available to the public. The lack of original traces hampers our ability to evaluate new cache designs and provides the rationale for JEDI, our new synthetic trace generation tool. JEDI generates a synthetic trace that is “similar” to the original trace collected from a production cache, in particular, the two traces have similar object-level properties and produce similar hit rates in a cache simulation. JEDI uses a novel traffic model called Popularity-Size Footprint Descriptor (pFD) that concisely captures key properties of the original trace and uses the pFD to generate the synthetic trace. We show that the synthetic traces produced by JEDI can be used to accurately simulate a wide range of cache admission and eviction algorithms and the hit rates obtained from these simulations correspond closely to those obtained from simulations that use the original traces. JEDI will be provided to the public as open-source, along with a library of pFD’s computed from traffic classes hosted on Akamai’s production CDN. This will allow researchers to produce realistic synthetic traces for their own caching research. |
11. | Kumar, Dhruv; Wolfrath, Joel; Chandra, Abhishek; Sitaraman, Ramesh K. Towards WAN-Aware Join Sampling over Geo-Distributed Data Proceedings Article In: In 5th ACM International Workshop on Edge Systems, Analytics and Networking (EdgeSys’22), 2022. @inproceedings{KumarWCS21, Large scale data analytics over geographically distributed data sources is challenging primarily due to the constrained and hetero- geneous resource availability such as the wide area network (WAN) bandwidth. In this work, we look at the problem of generating random samples over joins for geo-distributed data sources. Joins are one of the most fundamental yet expensive operations in data analytics. To reduce the cost of computing joins, existing techniques have looked at efficiently generating a random sample over the join result for centralized environments, where all the data is available in one location. These techniques fail to address the unique chal- lenges posed by geo-distributed environments. To address these challenges, we propose a sampling technique which aims to reduce the WAN traffic and latency, thereby reducing the overall latency for generating samples over joins for geo-distributed data sources. We implement our geo-distributed sampling technique on top of Apache Spark and compare it with existing state-of-the-art sam- pling techniques to identify scenarios where the proposed approach gives significant benefits. Based on this exploration, we provide a detailed outline of additional factors which should be consid- ered when designing a WAN-aware join sampling technique for geo-distributed environments. |
12. | Yang, Juncheng; Sabnis, Anirudh; Berger, Daniel S.; Rashmi, K. V.; Sitaraman, Ramesh K. C2DN: How to Harness Erasure Codes at the Edge for Efficient Content Delivery Proceedings Article In: 19th USENIX Symposium on Networked Systems Design and Implementation (NSDI 22), pp. 1159–1177, USENIX Association, Renton, WA, 2022, ISBN: 978-1-939133-27-4. @inproceedings{YangSBRS22, Content Delivery Networks (CDNs) deliver much of the world’s web and video content to users from thousands of clusters deployed at the “edges” of the Internet. Maintain- ing consistent performance in this large distributed system is challenging. Through analysis of month-long logs from over 2000 clusters of a large CDN, we study the patterns of server unavailability. For a CDN with no redundancy, each server unavailability causes a sudden loss in performance as the objects previously cached on that server are not accessible, which leads to a miss ratio spike. The state-of-the-art miti- gation technique used by large CDNs is to replicate objects across multiple servers within a cluster. We find that although replication reduces miss ratio spikes, spikes remain a perfor- mance challenge. We present C2DN, the first CDN design that achieves a lower miss ratio, higher availability, higher resource efficiency, and close-to-perfect write load balancing. The core of our design is to introduce erasure coding into the CDN architecture and use the parity chunks to re-balance the write load across servers. We implement C2DN on top of open-source production software and demonstrate that com- pared to replication-based CDNs, C2DN obtains 11% lower byte miss ratio, eliminates unavailability-induced miss ratio spikes, and reduces write load imbalance by 99%. |
13. | Yang, Lin; Zeynali, Ali; Hajiesmaili, Mohammad H.; Sitaraman, Ramesh K.; Towsley, Don Competitive Algorithms for Online Multidimensional Knapsack Problems Journal Article In: Proceedings of the ACM on the Measurement Analysis of Computing Systems (POMACS), 2021. @article{YangZHST21, |
14. | Kumar, Dhruv; Ahmad, Sohaib; Chandra, Abhishek; Sitaraman, Ramesh K. AggNet: Cost-Aware Aggregation Networks for Geo-distributed Streaming Analytics Conference ACM/IEEE Symposium on Edge Computing (SEC '21), 2021. @conference{KumarACS21, |
15. | Sabnis, Anirudh; Sitaraman, Ramesh TRAGEN: A Synthetic Trace Generator for Realistic Cache Simulations Conference ACM Internet Measurement Conference (IMC), 2021. @conference{SabnisSitaraman2021, |
16. | Wang, Lingdong; Hajiesmaili, Mohammad; Sitaraman, Ramesh K FOCAS: Practical Video Super Resolution using Foveated Rendering Conference ACM International Conference on Multimedia (MM ’21), 2021. @conference{WangHS21, Super-resolution (SR) is a well-studied technique for reconstructing high-resolu- tion (HR) images from low-resolution (LR) ones. SR holds great promise for video streaming since an LR video segment can be transmitted from the video server to the client that then reconstructs the HR version using SR, resulting in a significant reduction in network bandwidth. However, SR is seldom used in practice for real-time video streaming, because the computational overhead of frame reconstruction results in large latency and low frame rate. To reduce the computational overhead and make SR practi- cal, we propose a deep-learning-based SR method called Foveated Cascaded Video Super Resolution (FOCAS). FOCAS relies on the fact that human eyes only have high acuity in a tiny central foveal region of the retina. FOCAS uses more neural network blocks in the foveal region to provide higher video quality, while using fewer blocks in the periphery as lower quality is sufficient. To optimize the computational resources and reduce reconstruction latency, FOCAS formulates and solves a convex optimization problem to decide the number of neural network blocks to use in each region of the frame. Using extensive experiments, we show that FOCAS reduces the latency by 50% − 70% while maintaining comparable visual quality as traditional (non-foveated) SR. Further, FOCAS provides a 12 − 16× reduction in the client-to-server network bandwidth in comparison with sending the full HR video segments. |
17. | Zhang, Xiao; Sen, Tanmoy; Zhang, Zheyuan; April, Tim; Chandrasekaran, Balakrishnan; Choffnes, David; Maggs, Bruce M.; Shen, Haiying; Sitaraman, Ramesh K.; Yang, Xiaowei AnyOpt: Predicting and Optimizing IP Anycast Performance Proceedings Article In: Proceedings of the ACM SIGCOMM Conference, 2021. @inproceedings{Zhangetal2021, |
18. | Lee, Russell; Maghakian, Jessica; Hajiesmaili, Mohammad; Li, Jian; Sitaraman, Ramesh; Liu, Zhenhua Online Peak-Aware Energy Scheduling with Untrusted Advice Conference ACM International Conference on Future Energy Systems (ACM e-Energy), 2021. @conference{LeeMHLSL21, |
19. | Sabnis, Anirudh; Salem, Tareq Si; Neglia, Giovanni; Garetto, Michele; Leonardi, Emilio; Sitaraman, Ramesh K GRADES: Gradient Descent for Similarity Caching Proceedings Article In: IEEE Conference on Computer Communications (INFOCOM), 2021. @inproceedings{sabnis2021grades, |
20. | Park, Jounsup; Wu, Mingyuan; Lee, Eric; Chen, Bo; Nahrstedt, Klara; Zink, Michael; Sitaraman, Ramesh SEAWARE: Semantic Aware View Prediction System for 360-degree Video Streaming Conference IEEE International Symposium on Multimedia (ISM), 2020. @conference{ParkWLCNZS2020, Future view prediction for a 360-degree video streaming system is important to save the network bandwidth and improve the Quality of Experience (QoE). Historical view data of a single viewer and multiple viewers have been used for future view prediction. Video semantic information is also useful to predict the viewer’s future behavior. However, extracting video semantic information requires powerful computing hardware and large memory space to perform deep learning-based video analysis. It is not a desirable condition for most of client devices, such as small mobile devices or Head Mounted Display (HMD). Therefore, we develop an approach where video semantic analysis is executed on the media server, and the analysis results are shared with clients via the Semantic Flow Descriptor (SFD) and View-Object State Machine (VOSM). SFD and VOSM become new descriptive additions of the Media Presentation Description (MPD) and Spatial Relation Description (SRD) to support 360- degree video streaming via the DASH framework. Using the semantic-based approach, we design the Semantic-Aware View Prediction System (SEAWARE) to improve the overall view prediction performance. The evaluation results of 360-degree videos and real HMD view traces show that the SEAWARE system improves the view prediction performance and streams high-quality video with limited network bandwidth. |