NDM 2015‎ > ‎Workshop Program‎ > ‎

Accepted Papers

ACM proceedings: http://dl.acm.org/citation.cfm?id=2832099
Papers are available through the ACM Digital Library and IEEE Xplore free of charge during and immediately after SC'15

paper 2: 
Managing Scientific Data with Named Data Networking (NDN)
[slides]

Chengyu Fan, Susmit Shannigrahi, Steve Dibendetto, Catherine Olschanowsky, Christos Papadopoulos, Harvey Newman, Edmund Yeh, Jean-Roch Vlimant, Azher Amin, Dorian Kcira, Iosif Legrand, Ramiro Voicu, David Randall, Kelley Wittmeyer, Mark Branson and Don Dazlich
(Colorado State University, Northeastern University, Caltech, CERN)

Abstract: Many scientific domains, such as climate science and High Energy Physics (HEP), have data management requirements that are not well supported by the IP network architecture. Named Data Networking (NDN) is a new network architecture whose service model is better aligned with the needs of data-oriented applications. NDN provides features such as best-location retrieval, caching, load sharing, and trans- parent failover that would otherwise be painstakingly (re-)implemented by each application using point-to-point semantics in an IP network. We present the first scientific data management application designed and implemented on top of NDN. We use this application to manage climate and HEP data over a dedicated, high-performance, testbed. Our application has two main components: a UI for dataset discovery queries and a federation of synchronized name catalogs. We show how NDN primitives can be used to implement common data management operations such as publishing, search, efficient retrieval, and publication access control.


paper 3: A Multi-domain SDN for Dynamic Layer-2 Path Service 
[slides]

Scott Tepsuporn, Fatemah Alali, Malathi Veeraraghavan, Xiang Ji, Brian Cashman, A. J. Ragusa, Luke Fowler, Chin Guok, Tom Lehman and Xi Yang  
(University of Virginia, Indiana University, ESnet, University of Maryland)

Abstract: This paper describes our experience in deploying a multi- domain Software-Defined Network (SDN) that supports dynamic Layer-2 (L2) path service, and offers insights gained from this experience. SDN controllers, capable of handling requests for advance-reservation and provisioning of rate- guaranteed L2 paths, were deployed in each domain. The experience demonstrated that this architecture can support global-scale multi-domain dynamic L2 path service. How- ever, to reach this scale, better tools are required for diagnostics of end-to-end L2 connectivity, and better error- reporting functionality is needed from the SDN controllers. As a use case for rate-guaranteed L2 path service, we experimented with high-speed large dataset transfers. We found that a combination of Circuit TCP (CTCP), in which the sending rate is held fixed, and a token bucket filter based rate shaper at the sending host, is best to achieve almost 0-loss, high-throughput transfers across L2 paths. Detailed studies were conducted to understand the impact of the rate-shaper and CTCP parameters to find the best settings.



paper 7: Approximate Causal Consistency for Partially Replicated Geo-Replicated Cloud Storage [slides]
Ajay Kshemkalyani and Tayuan Hsu   
(University of Illinois at Chicago)

Abstract: In geo-replicated systems and the cloud, data replication provides fault tolerance and low latency. Causal consistency in such systems is an interesting consistency model. Most existing works assume the data is fully replicated because this greatly simplifies the design of the algorithms to implement causal consistency. Recently, we proposed causal consistency under partial replication because it reduces the number of messages used under a wide range of workloads. One drawback of partial replication is that its meta-data tends to be relatively large when the message size is small. In this paper, we propose approximate causal consistency whereby we can reduce the meta-data at the cost of some violations of causal consistency. The amount of violations can be made arbitrarily small by controlling a tunable parameter, that we call credits.


paper 9: Design and Implementation of Control Sequence Generator for SDN-enhanced MPI
[slides]
Baatarsuren Munkhdorj, Keichi Takahashi, Dashdavaa Dashdavaa, Yasuhiro Watashiba, Yoshiyuki Kido, Susumu Date and Shinji Shimojo   
(Osaka University, Japan)

Abstract: MPI (Message Passing Interface) offers a suite of APIs for inter-process communication among parallel processes. We have approached to the acceleration of MPI collective com- munication such as MPI_Bcast and MPI_Allreduce, taking advantage of network programmability brought by Software Defined Networking (SDN). The basic idea is to allow a SDN controller to dynamically control the packet flows generated by MPI collective communication based on the communication pattern and the underlying network conditions. Al- though our research have succeeded to accelerate an MPI collective communication in terms of execution time, the switching of network control functionality for MPI collective communication along MPI program execution have not been considered yet. This paper presents a mechanism that provides the control sequence for SDN controller to control packet flows based on the communication plan for the entire MPI application. The control sequence encloses a chronologically ordered list of the MPI collectives operated in the MPI application and the process-related information of each in the list. To verify if the SDN-enhanced MPI collectives can be used in combination with the proposed mechanism, the envisioned environment was prototyped. As a result, SDN-enhanced MPI collectives were able to be used in combination.


   
paper 10:
Hysteresis-based Optimization of Data Transfer Throughput 

[slides]
 
Md S Q Zulkar Nine, Kemal Guner and Tevfik Kosar   
(University at Buffalo, SUNY)

Abstract: The achievable throughput for a data transfer can be deter- mined by a variety of factors such as network bandwidth, round trip time, background traffic, dataset size, and end- system configuration. For the best-effort optimization of the transfer throughput, three application-layer transfer parameters – pipelining, parallelism and concurrency – have been actively used in the literature. However, it is highly challenging to identify the best combination of these parameter settings for a specific data transfer request. In this paper, we analyze historical data consisting of 70 Million file transfers; apply data mining techniques to extract the hidden relations among the parameters and the optimal through- put; and propose a novel approach based on hysteresis to predict the optimal parameter settings.