the power of remote GPU virtualization

Similar documents
Bringing hardware affinity information into MPI communication strategies

Installation Guide. Unisphere Central. Installation. Release number REV 07. October, 2015

Multi Nodal Regional ATFM/CDM Concept and Operational Trials Colombo 7 May 2014

# 1 in ease-of-use. Guest Service Interconnectivity. Made by hoteliers, for hoteliers.

Distributed Object Storage System Ceph in Practice

RSA SecurID Ready Implementation Guide

Enter here your Presentation Title 1

DATA MANAGEMENT & CONNECTED SOLUTIONS

ATPCO. Intended positioning on the market

Scalable Runtime Support for Data-Intensive Applications on the Single-Chip Cloud Computer

FOR SMALL AND MEDIUM SIZED AIRPORTS Velocity FIDS

Model-based development of self-organized earthquake early warning systems

Washington Dulles International Airport (IAD) Aircraft Noise Contour Map Update

Discrete-Event Simulation of Air Traffic Flow

Remote towers at your service. First validation of remote tower for multiple airports (PJ05)

ICAO Cyber Summit & Exhibition CLOSING REMARKS. 10 April 2017

Gogo Connected Aircraft Services

TABLE OF CONTENTS. Coral Springs Charter High School and Middle School Job No Page 2

Embedded Platforms Conference Program

Towards a 100% renewable energy future: WÄRTSILÄ Energy Solutions

Menlo Park Fire District Training Division. Unmanned Aerial System Pilot

PRAJWAL KHADGI Department of Industrial and Systems Engineering Northern Illinois University DeKalb, Illinois, USA

Implementing an Air Taxi System

ICAO Initiatives on Global Tracking

Four Ways to Enjoy Your Marriott Vacation Club Timeshare Interest

Mathematical modeling in the airline industry: optimizing aircraft assignment for on-demand air transport

FUTURE PASSENGER PROCESSING. ACRP New Concepts for Airport Terminal Landside Facilities

A Statistical Method for Eliminating False Counts Due to Debris, Using Automated Visual Inspection for Probe Marks

JUNEAU RUNWAY INCURSION MITIGATION (RIM) PROGRAM. April 10 th 2017

AIRPORT ADVISORY BOARD COMMUNICATION

A New Way to Work in the ERCOT Market

TWELFTH AIR NAVIGATION CONFERENCE

EUROPEAN FORUM ON CLEAN ENERGY FOR ISLANDS

Constrained Long-Range Plan for the National Capital Region.

A Network Model to Simulate Airport Surface Operations

Hiway Gateway Specification and Technical Data

Digital transformation of aviation Marouan CHIDA

WELCOME TO THE AGE OF THE CONNECTED AIRCRAFT

Atennea Air. The most comprehensive ERP software for operating & financial management of your airline

Todsanai Chumwatana, and Ichayaporn Chuaychoo Rangsit University, Thailand, {todsanai.c;

Granting in a multi-lane environment

- HPEC 2010 Workshop -

Evaluation of Quality of Service in airport Terminals

Documentation Issues and Initiatives

9/16/ CHG 213 VOLUME 3 GENERAL TECHNICAL ADMINISTRATION CHAPTER 61 AIRCRAFT NETWORK SECURITY PROGRAM

REG ASP MIL APO USE INT IND NM

IATA Paperless Operations; Update

P.O. Box 1749 Halifax, Nova Scotia B3J 3A5 Canada Item No. 2 Halifax Regional Council November 13, 2018

SMILEGOV Islands fostering multi-level governance for sustainable energy planning

PSS E 34.0 Release Webinar 23 April 2015

Ground Handling Operations Manual

Amadeus Altéa Airport Link

PSS MVS 7.15 announcement

M. Morant, M.J. VIÑALS, C. Ferris y M.E.Casar Universitat Politècnica de València (Spain) Proyecto financiado por:

David Controle, Analytics Accelerator Airbus. Why Invest in AI and Deep Learning NVIDIA GTC

Traffic Flow Management

world class Infrastructure meets state of the art technology?

FAA GBAS Program Update January 29, 2010

RED ATLAS PRODUCT BROCHURE. From Nevalee Business Solutions

Future airport concept

SLIDING WINDOW & DOOR LOCK

Mobile Work Caravans Product Guide

American Institute of Aeronautics and Astronautics

THE MIDCAS PROJECT. Johan Pellebergs Saab Aerosystems. Keywords: UAS, Sense & Avoid, Standardization, Non-segregated Airspace

The passenger in focus of multimodal airport management

SPADE-2 - Supporting Platform for Airport Decision-making and Efficiency Analysis Phase 2

Setup and Configure the Siteminder Policy Store with Dxmanager

ATC Simulators. The manufacturer of

Chapter 1 EXECUTIVE SUMMARY

PSS VM 7.15 announcement

BUILDING A REGIONAL INFRASTRUCTURE FOR THE FUTURE OF INTERNET IN LATIN AMERICA

etrust SiteMinder Agent r5.5 for BEA WebLogic 9.0 etrust SiteMinder Agent for BEA WebLogic Guide

Evaluation of Alternative Aircraft Types Dr. Peter Belobaba

Runway Length Analysis Prescott Municipal Airport

Flight Crew Operating Manual STANDARD OPERATING PROCEDURES

Wärtsilä Italia S.p.A. Confindustria Venezia Giulia - Trieste, 28 th February 2018

New Distribution Capability (NDC) Enhancing Business and Leisure Travel

Issues and Achievements of Computer Science Students by Historical Data Analyses - Are We Ready for Education Big Data?

Technical Standard Order

UAS Training Solutions

Venue Hire Churchill House

Airport Master Plan. Brookings Regional Airport. Runway Runway 17-35

C O N V E N E S P R E S E N T S C O O R D I N A T E S

ARINC Project Initiation/Modification (APIM)

U.S. DEPARTMENT OF TRANSPORTATION FEDERAL AVIATION ADMINISTRATION. National Policy

Logic Control Summer Semester Assignment: Modeling and Logic Controller Design 1

Agenda: SASP SAC Meeting 3

ANA Domestic Flight Online Business Travel Arrangement System. October 2017 All Nippon Airways Co., Ltd.

AIRPORT MASTER PLAN UPDATE

ACRP Problem Statement No Recommended Allocation: -- ACRP Airport Baggage Handling Opportunity

We transform travel companies into travel retailers

Dark Ride Application IO Control

ARIS/CI check-in counter allocator

6 Haeryip Sihombing Universiti Teknikal Malaysia Melaka (UTeM)

TABLE OF CONTENTS 2019 PILOT TRAINING CATALOG 2

Permeable RECREATIONAL TRAILS

Introduction. You Fly & Rely On Us. Rest Is For Us To Handle.

Aeronautics & Air Transport in FP7. DG RTD-H.3 - Aeronautics Brussels, January 2007

ARINC Project Initiation/Modification (APIM)

Sustaining quality of services through service reliability and availability

Transcription:

Creating new horizons with rcuda: the power of remote virtualization Federico Silla Universitat Politècnica de València Spain

s are great! HPC Advisory Council Perth Conference 2017 2/55

Basics of computing (single node) Basic behavior of CUDA HPC Advisory Council Perth Conference 2017 3/55

Basics of computing (single node) s can only be used within the node they are attached to Basic behavior of CUDA HPC Advisory Council Perth Conference 2017 4/55

Using s across the cluster A -enabled cluster is a set of independent self-contained nodes that share nothing among them: MPI is required for aggregating resources within the cluster (included s) node 1 node 2 node 3 node n Interconnection HPC Advisory Council Perth Conference 2017 5/55

s are great! How can we make an even better usage of s? HPC Advisory Council Perth Conference 2017 6/55

s are great! How can we make an even better usage of s? Which characteristics do we miss from s? HPC Advisory Council Perth Conference 2017 7/55

Characteristics missing in s How can we make an even better usage of s? Which characteristics do we miss from s? 1. Many s in a single box 2. Easily sharing a given (or s) HPC Advisory Council Perth Conference 2017 8/55

Characteristics missing in s 1. Why many s in a single box Traditionally, in order to use many s, applications had to use MPI: s can only be used within the node they are attached to Nothing is directly shared among nodes (MPI required for aggregating computing resources within the cluster) node 1 node 2 node 3 node n A non-mpi application running in this node can only use the s in this node Interconnection HPC Advisory Council Perth Conference 2017 9/55

Characteristics missing in s 1. Many s in a single box HPC Advisory Council Perth Conference 2017 10/55

Characteristics missing in s 1. Many s in a single box The amount of s is limited by the physical space inside the node HPC Advisory Council Perth Conference 2017 11/55

Characteristics missing in s 1. Having many s in a single box MonteCarlo multi- program running in 10 NVIDIA Tesla K40 s HPC Advisory Council Perth Conference 2017 12/55

Characteristics missing in s 1. Having many s in a single box 64 s!! HPC Advisory Council Perth Conference 2017 13/55

Characteristics missing in s 1. Many s in a single box How many s are many s in a single box? HPC Advisory Council Perth Conference 2017 14/55

Characteristics missing in s 1. Many s in a single box How many s are many s in a single box? As many s as they can be installed in the cluster node 1 node 2 node 3 node n Interconnection HPC Advisory Council Perth Conference 2017 15/55

Characteristics missing in s How can we make an even better usage of s? Which characteristics do we miss from s? 1. Many s in a single box 2. Easily sharing a given (or s) HPC Advisory Council Perth Conference 2017 16/55

Characteristics missing in s 2. Easily sharing a given Why should we be interested on sharing s among applications? HPC Advisory Council Perth Conference 2017 17/55

usage of -Blast assigned but not used assigned but not used NVIDIA Tesla K20 HPC Advisory Council Perth Conference 2017 18/55

usage of CUDA-MEME utilization is far away from maximum NVIDIA Tesla K20 HPC Advisory Council Perth Conference 2017 19/55

usage of LAMMPS assigned but not used NVIDIA Tesla K20 HPC Advisory Council Perth Conference 2017 20/55

Sharing a among jobs: -Blast One instance required about 51 seconds Two concurrent instances of -Blast HPC Advisory Council Perth Conference 2017 21/55

Sharing a among jobs: -Blast First instance Two concurrent instances of -Blast HPC Advisory Council Perth Conference 2017 22/55

Sharing a among jobs: -Blast Second instance First instance Two concurrent instances of -Blast HPC Advisory Council Perth Conference 2017 23/55

Sharing a among applications K20 (5GB memory) LAMMPS: 876 MB mcuda-meme: 151 MB BarraCUDA: 3319 MB MUMmer: 2104 MB -LIBSVM: 145 MB HPC Advisory Council Perth Conference 2017 24/55

Sharing a among applications K20 (5GB memory) LAMMPS: 876 MB mcuda-meme: 151 MB BarraCUDA: 3319 MB MUMmer: 2104 MB -LIBSVM: 145 MB The main concern for sharing a is the memory limitation HPC Advisory Council Perth Conference 2017 25/55

Characteristics missing in s How can we make an even better usage of s? Which characteristics do we miss from s? 1. Many s in a single box 2. Easily sharing a given (or s) HPC Advisory Council Perth Conference 2017 26/55

Easily sharing a among VMs A is assigned to a VM by using PCI passthrough Assignment is done exclusively to a single virtual machine. Concurrent usage of the is not possible HPC Advisory Council Perth Conference 2017 27/55

Easily sharing a among VMs High performance network available Low performance network available HPC Advisory Council Perth Conference 2017 28/55

Characteristics missing in s Which characteristics do we miss from s? 1. Many s in a single box 2. Easily sharing a given (or s) HPC Advisory Council Perth Conference 2017 29/55

Characteristics missing in s Which characteristics do we miss from s? 1. Many s in a single box 2. Easily sharing a given (or s) The remote virtualization technique can efficiently address these concerns HPC Advisory Council Perth Conference 2017 30/55

Characteristics missing in s node 1 node 2 node 3 node n Interconnection The remote virtualization technique can efficiently address these concerns HPC Advisory Council Perth Conference 2017 31/55

Characteristics missing in s Interconnection The remote virtualization technique can efficiently address these concerns HPC Advisory Council Perth Conference 2017 32/55

Characteristics missing in s Interconnection The remote virtualization technique can efficiently address these concerns HPC Advisory Council Perth Conference 2017 33/55

Characteristics missing in s Interconnection The remote virtualization technique can efficiently address these concerns HPC Advisory Council Perth Conference 2017 34/55

Remote virtualization What is remote virtualization? HPC Advisory Council Perth Conference 2017 35/55

Basics of computing Basic behavior of CUDA HPC Advisory Council Perth Conference 2017 36/55

Basics of computing HPC Advisory Council Perth Conference 2017 37/55

Remote virtualization No HPC Advisory Council Perth Conference 2017 38/55

rcuda remote CUDA A software technology that enables a more flexible use of s in computing facilities No rcuda is a development by Technical University of Valencia HPC Advisory Council Perth Conference 2017 39/55

Basics of rcuda rcuda is a development by Universitat Politècnica de València, Spain HPC Advisory Council Perth Conference 2017 40/55

Basics of rcuda rcuda is a development by Universitat Politècnica de València, Spain HPC Advisory Council Perth Conference 2017 41/55

Remote virtualization envision Remote virtualization allows a new vision of a deployment, moving from the usual cluster configuration: node 1 node 2 node 3 node n Physical configuration Interconnection to the following one: node 1 Logical connections node 2 node 3 node n Logical configuration Interconnection HPC Advisory Council Perth Conference 2017 42/55

Performance of rcuda Guy Kawasaki, marketing specialist and Silicon Valley venture capitalist Ideas Are Easy, Implementation Is Hard HPC Advisory Council Perth Conference 2017 43/55

Performance of rcuda to Higher is better to HPC Advisory Council Perth Conference 2017 44/55

Performance of rcuda to Higher is better to HPC Advisory Council Perth Conference 2017 45/55

Performance of rcuda to Higher is better to HPC Advisory Council Perth Conference 2017 46/55

Performance of rcuda CUDA rcuda rcuda scenario 1 rcuda scenario 2 HPC Advisory Council Perth Conference 2017 47/55

Performance of rcuda CUDA rcuda rcuda scenario 1 Ideas Are Easy, Implementation Is Hard rcuda scenario 2 Guy Kawasaki, marketing specialist and Silicon Valley venture capitalist Higher is better HPC Advisory Council Perth Conference 2017 48/55

Performance of applications using rcuda K40 s and EDR InfiniBand Lower is better MonteCarlo multi- program running in 10 NVIDIA Tesla K40 s HPC Advisory Council Perth Conference 2017 49/55

Performance of applications using rcuda 64 s!! HPC Advisory Council Perth Conference 2017 50/55

Performance of applications using rcuda K20 and FDR InfiniBand K40 and EDR InfiniBand Lower is better HPC Advisory Council Perth Conference 2017 51/55

Performance of applications using rcuda EDR InfiniBand and P100 Lower is better BarraCUDA CUDA-MEME Lower is better HPC Advisory Council Perth Conference 2017 52/55

Get a free copy of rcuda at http://www.rcuda.net More than 850 requests world wide @rcuda_ rcuda is a development by Technical University of Valencia HPC Advisory Council Perth Conference 2017 53/55

Tony Díaz Pablo Higueras Javier Prades Carlos Reaño Jaime Sierra Federico Silla rcuda is a development by Technical University of Valencia HPC Advisory Council Perth Conference 2017 54/55

Thanks! Questions? rcuda is a development by Technical University of Valencia HPC Advisory Council Perth Conference 2017 55/55