Disrupt The Datacenter With Orchestration – The Next Platform
Posted: April 2, 2020 at 7:49 am
Since 1965, the computer industry has relied on Moores Law to accelerate innovation, pushing more transistors into integrated circuits to improve computation performance. Making transistors smaller helped lift all boats for the entire industry and enable new applications. At some point, we will reach a physical limit that is, a limit stemming from physics itself. Even with this setback, improvements kept on pace thanks to increased parallelism of computation and consolidation of specialized functions into single chip packages, such as systems on chip).
In recent years, we are nearing another peak. This article proposes to improve computation performance not only by building better hardware, but by changing how we use existing hardware. More specifically, the focusing on how we use existing processor types. I call this approach Compute Orchestration: automatic optimization of machine code to best use the modern datacenter hardware (again, with special emphasis on different processor types).
So what is compute orchestration? It is the embracing of hardware diversity to support software.
There are many types of processors: Microprocessors in small devices, general purpose CPUs in computers and servers, GPUs for graphics and compute, and programmable hardware like FPGAs. In recent years, specialized processors like TPUs and neuromorphic processors for machine learning are rapidly entering the datacenter.
There is potential in this variety: Instead of statically utilizing each processor for pre-defined functions, we can use existing processors as a swarm, each processor working on the most suitable workloads. Doing that, we can potentially deliver more computation bandwidth with less power, lower latency and lower total cost of ownership).
Non-standard utilization of existing processors is already happening: GPUs, for example, were already adapted from processors dedicated to graphics into a core enterprise component. Today, GPUs are used for machine learning and cryptocurrency mining, for example.
I call the technology to utilize the processors as a swarm Compute Orchestration. Its tenets can be described in four simple bullets:
Compute orchestration is, in short, automatic adaptation of binary code and automatic allocation to the most suitable processor types available. I split the evolution of compute orchestration into four generations:
Compute Orchestration Gen 1: Static Allocation To Specialized Co-Processors
This type of compute orchestration is everywhere. Most devices today include co-processors to offload some specialized work from the CPU. Usually, the toolchain or runtime environment takes care of assigning workloads to the co-processor. This is seamless to the developer, but also limited in functionality.
Best known example is the use of cryptographic co-processors for relevant functions. Being liberal in our definitions of co-processor, Memory Management Units (MMUs) to manage virtual memory address translation can also be considered an example.
Compute Orchestration Gen 2: Static Allocation, Heterogeneous Hardware
This is where we are at now. In the second generation, the software relies on libraries, dedicated run time environments and VMs to best use the available hardware. Lets call the collection of components that help better use the hardware frameworks. Current frameworks implement specific code to better use specific processors. Most prevalent are frameworks that know how to utilize GPUs in the cloud. Usually, better allocation to bare metal hosts remains the responsibility of the developer. For example, the developer/DevOps engineer needs to make sure a machine with GPU is available for the relevant microservice. This phenomenon is what brought me to think of Compute Orchestration in the first place, as it proves there is more slack in our current hardware.
Common frameworks like OpenCL allow programming compute kernels to run on different processors. TensorFlow allows assigning nodes in a computation graph to different processors (devices).
This better use of hardware by using existing frameworks is great. However, I believe there is a bigger edge. Existing frameworks still require effort from the developer to be optimal they rely on the developer. Also, no legacy code from 2016 (for example) is ever going to utilize a modern datacenter GPU cluster. My view is that by developing automated and dynamic frameworks, that adapt to the hardware and workload, we can achieve another leap.
Compute Orchestration Gen 3: Dynamic Allocation To Heterogeneous Hardware
Computation can take an example from the storage industry: Products for better utilization and reliability of storage hardware have innovated for years. Storage startups develop abstraction layers and special filesystems that improve efficiency and reliability of existing storage hardware. Computation, on the other hand, remains a stupid allocation of hardware resources. Smart allocation of computation workloads to hardware could result in better performance and efficiency for big data centers (for example hyperscalers like cloud providers). The infrastructure for such allocation is here, with current data center designs pushing to more resource disaggregation, introduction of diverse accelerators, and increased work on automatic acceleration (for example: Workload-aware Automatic Parallelization for Multi-GPU DNN Training).
For high level resource management, we already have automatic allocation. For example, project Mesos (paper) focusing on fine-grained resource sharing, Slurm for cluster management, and several extensions using Kubernetes operators.
To further advance from here would require two steps: automatic mapping of available processors (which we call the compute environment) and workload adaptation. Imagine a situation where the developer doesnt have to optimize her code to the hardware. Rather, the runtime environment identifies the available processing hardware and automatically optimizes the code. Cloud environments are heterogeneous and changing, and the code should change accordingly (in fact its not the code, but the execution model in the run time environment of the machine code).
Compute Orchestration Gen 4: Automatic Allocation To Dynamic Hardware
A thought, even a possibility, can shatter and transform us. Friedrich Wilhelm Nietzsche
The quote above is to say that there we are far from practical implementation of the concept described here (as far as I know). We can, however, imagine a technology that dynamically re-designs a data center to serve needs of running applications. This change in the way whole data centers meet computation needs as already started. FGPAs are used more often and appear in new places (FPGAs in hosts, FPGA machines in AWS, SmartNICs), providing the framework for constant reconfiguration of hardware.
To illustrate the idea, I will use an example: Microsoft initiated project Catapult, augmenting CPUs with an interconnected and configurable compute layer composed of programmable silicon. The timeline in the projects website is fascinating. The project started off in 2010, aiming to improve search queries by using FPGAs. Quickly, it proposed the use of FPGAs as bumps in the wire, adding computation in new areas of the data path. Project Catapult also designed an architecture for using FPGAs as a distributed resource pool serving all the data center. Then, the project spun off Project BrainWave, utilizing FPGAs for accelerating AI/ML workloads.
This was just an example of innovation in how we compute. Quick online search will bring up several academic works on the topic. All we need to reach the 4th generation is some idea synthesis, combining a few concepts together:
Low effort HDL generation (for example Merlin compiler, BORPH)
In essence, what I am proposing is to optimize computation by adding an abstraction layer that:
Automatic allocation on agile hardware is the recipe for best utilizing existing resources: faster, greener, cheaper.
The trends and ideas mentioned in this article can lead to many places. It is very likely, that we are already working with existing hardware in the optimal way. It is my belief that we are in the midst of the improvement curve. In recent years, we had increased innovation in basic hardware building blocks, new processors for example, but we still have room to improve in overall allocation and utilization. The more we deploy new processors in the field, the more slack we have in our hardware stack. New concepts, like edge computing and resource disaggregation, bring new opportunities for optimizing legacy code by smarter execution. To achieve that, legacy code cant be expected to be refactored. Developers and DevOps engineers cant be expected to optimize for the cloud configuration. We just need to execute code in a smarter way and that is the essence of compute orchestration.
The conceptual framework described in this article should be further explored. We first need to find the killer app (what type of software we optimize to which type of hardware). From there, we can generalize. I was recently asked in a round table what is the next generation of computation? Quantum computing? Tensor Processor Units? I responded that all of the above, but what we really need is better usage of the existing generation.
Guy Harpak is the head of technology at Mercedes-Benz Research & Devcelopment in its Tel Aviv, Israel facility. Please feel free to contact him on any thoughts on the topics above at harpakguy@gmail.com. Harpak notes that this contributed article reflects his personal opinion and is in no way related to people or companies that he works with or for.
Related Reading: If you find this article interesting, I would recommend researching the following topics:
Some interesting articles on similar topics:
Return Of The Runtimes: Rethinking The Language Runtime System For The Cloud 3.0 Era
The Deep Learning Revolution And Its Implications For Computer Architecture And Chip Design (by Jeffrey Dean from Google Research)
Beyond SmartNICs: Towards A Fully Programmable Cloud
Hyperscale Cloud: Reimagining Datacenters From Hardware To Applications
Read more from the original source:
Disrupt The Datacenter With Orchestration - The Next Platform
- Why Move Fast and Break Things Doesn't Work Anymore - Harvard Business Review [Last Updated On: December 11th, 2019] [Originally Added On: December 11th, 2019]
- Security leaders fear that quantum computing developments will outpace security technologies - Continuity Central [Last Updated On: December 11th, 2019] [Originally Added On: December 11th, 2019]
- Inside the weird, wild, and wondrous world of quantum video games - Digital Trends [Last Updated On: December 11th, 2019] [Originally Added On: December 11th, 2019]
- This Week in Tech: What on Earth Is a Quantum Computer? - The New York Times [Last Updated On: December 11th, 2019] [Originally Added On: December 11th, 2019]
- Charles Hoskinson Predicts Economic Collapse, Rise of Quantum Computing, Space Travel and Cryptocurrency in the 2020s - The Daily Hodl [Last Updated On: January 12th, 2020] [Originally Added On: January 12th, 2020]
- Jeffrey Epstein scandal: MIT professor put on leave, he 'failed to inform' college that sex offender made donations - CNBC [Last Updated On: January 12th, 2020] [Originally Added On: January 12th, 2020]
- Were approaching the limits of computer power we need new programmers now - The Guardian [Last Updated On: January 12th, 2020] [Originally Added On: January 12th, 2020]
- Is Quantum Technology The Future Of The World? - The Coin Republic [Last Updated On: January 12th, 2020] [Originally Added On: January 12th, 2020]
- Google and IBM square off in Schrodingers catfight over quantum supremacy - The Register [Last Updated On: January 12th, 2020] [Originally Added On: January 12th, 2020]
- CES 2020: IBM and Daimler teaming up for a quantum leap in battery tech - CNET [Last Updated On: January 12th, 2020] [Originally Added On: January 12th, 2020]
- 5G, AI and Quantum Computing: Who Knows Where It Will All Lead? - Planet Vending [Last Updated On: January 27th, 2020] [Originally Added On: January 27th, 2020]
- University of Sheffield launches Quantum centre to develop the technologies of tomorrow - Quantaneo, the Quantum Computing Source [Last Updated On: January 27th, 2020] [Originally Added On: January 27th, 2020]
- Quantum networking projected to be $5.5 billion market in 2025 - TechRepublic [Last Updated On: January 27th, 2020] [Originally Added On: January 27th, 2020]
- Delta Partners with IBM to Explore Quantum Computing - Database Trends and Applications [Last Updated On: January 27th, 2020] [Originally Added On: January 27th, 2020]
- The End Of The Digital Revolution Is Coming: Here's What's Next - Innovation Excellence [Last Updated On: January 27th, 2020] [Originally Added On: January 27th, 2020]
- What Is Quantum Computing, And How Can It Unlock Value For Businesses? - Computer Business Review [Last Updated On: January 27th, 2020] [Originally Added On: January 27th, 2020]
- TensorFlow gets its quantum of solace, lid lifted on 'all-seeing crime-detecting' AI upstart, and more - The Register [Last Updated On: March 17th, 2020] [Originally Added On: March 17th, 2020]
- Career navigation Be at the core or be at the edge - The Financial Express BD [Last Updated On: March 19th, 2020] [Originally Added On: March 19th, 2020]
- Work from home: Improve your security with MFA - We Live Security [Last Updated On: March 19th, 2020] [Originally Added On: March 19th, 2020]
- Quantum Computing for Everyone - The Startup - Medium [Last Updated On: March 19th, 2020] [Originally Added On: March 19th, 2020]
- Quantum computing is right around the corner, but cooling is a problem. What are the options? - Diginomica [Last Updated On: March 19th, 2020] [Originally Added On: March 19th, 2020]
- Quantum Computing: Will It Actually Produce Jobs? - Dice Insights [Last Updated On: March 19th, 2020] [Originally Added On: March 19th, 2020]
- Q-CTRL to Host Live Demos of 'Quantum Control' Tools - Quantaneo, the Quantum Computing Source [Last Updated On: April 2nd, 2020] [Originally Added On: April 2nd, 2020]
- We're Getting Closer to the Quantum Internet, But What Is It? - HowStuffWorks [Last Updated On: April 2nd, 2020] [Originally Added On: April 2nd, 2020]
- D-Wave makes its quantum computers free to anyone working on the coronavirus crisis - VentureBeat [Last Updated On: April 2nd, 2020] [Originally Added On: April 2nd, 2020]
- Making Sense of the Science and Philosophy of Devs - The Ringer [Last Updated On: April 16th, 2020] [Originally Added On: April 16th, 2020]
- Alex Garland on 'Devs,' free will and quantum computing - Engadget [Last Updated On: April 16th, 2020] [Originally Added On: April 16th, 2020]
- COVID-19: Quantum computing could someday find cures for coronaviruses and other diseases - TechRepublic [Last Updated On: April 16th, 2020] [Originally Added On: April 16th, 2020]
- Calling On AI And Quantum Computing To Fight The Coronavirus - Forbes [Last Updated On: April 16th, 2020] [Originally Added On: April 16th, 2020]
- Quantum computer chips demonstrated at the highest temperatures ever - New Scientist News [Last Updated On: April 16th, 2020] [Originally Added On: April 16th, 2020]
- New Princeton study takes superconductivity to the edge - Princeton University [Last Updated On: May 2nd, 2020] [Originally Added On: May 2nd, 2020]
- Devs: Here's the real science behind the quantum computing TV show - New Scientist News [Last Updated On: May 2nd, 2020] [Originally Added On: May 2nd, 2020]
- Online course trains students in the bizarre world of quantum computing - Livescience.com [Last Updated On: May 2nd, 2020] [Originally Added On: May 2nd, 2020]
- Between God and Science in the Surreal Silicon Valley of Devs - The Nation [Last Updated On: May 12th, 2020] [Originally Added On: May 12th, 2020]
- Kerry Emanuel, David Sabatini, and Peter Shor receive BBVA Frontiers of Knowledge awards - MIT News [Last Updated On: May 12th, 2020] [Originally Added On: May 12th, 2020]
- Recent Research Answers the Future of Quantum Machine Learning on COVID-19 - Analytics Insight [Last Updated On: May 12th, 2020] [Originally Added On: May 12th, 2020]
- David Graves to Head New Research at PPPL for Plasma Applications in Industry and Quantum Information Science - HPCwire [Last Updated On: May 12th, 2020] [Originally Added On: May 12th, 2020]
- IonQ CEO Peter Chapman on how quantum computing will change the future of AI - VentureBeat [Last Updated On: May 12th, 2020] [Originally Added On: May 12th, 2020]
- VTT to acquire Finland's first quantum computer seeking to bolster Finland's and Europe's competitiveness - Quantaneo, the Quantum Computing Source [Last Updated On: May 12th, 2020] [Originally Added On: May 12th, 2020]
- Light, fantastic: the path ahead for faster, smaller computer processors - News - The University of Sydney [Last Updated On: May 17th, 2020] [Originally Added On: May 17th, 2020]
- Registration Open for Inaugural IEEE International Conference on Quantum Computing and Engineering - HPCwire [Last Updated On: May 17th, 2020] [Originally Added On: May 17th, 2020]
- Video: The Future of Quantum Computing with IBM - insideHPC [Last Updated On: May 17th, 2020] [Originally Added On: May 17th, 2020]
- Quantum computing analytics: Put this on your IT roadmap - TechRepublic [Last Updated On: May 17th, 2020] [Originally Added On: May 17th, 2020]
- Quantum computing will (eventually) help us discover vaccines in days - VentureBeat [Last Updated On: May 17th, 2020] [Originally Added On: May 17th, 2020]
- IBM Z mainframes revived by Red Hat, AI and security - TechTarget [Last Updated On: August 14th, 2020] [Originally Added On: August 14th, 2020]
- Toshiba Exits PC Business 35 Years of IBM Compatible PCs - Electropages [Last Updated On: August 14th, 2020] [Originally Added On: August 14th, 2020]
- 6 new degrees approved, including graduate degrees in biostatistics and quantum information science: News at IU - IU Newsroom [Last Updated On: August 14th, 2020] [Originally Added On: August 14th, 2020]
- The race to building a fully functional quantum stack - TechCrunch [Last Updated On: August 14th, 2020] [Originally Added On: August 14th, 2020]
- IEEE International Conference on Quantum Computing and Engineering (QCE20) Transitions to All-Virtual Event - PRNewswire [Last Updated On: August 14th, 2020] [Originally Added On: August 14th, 2020]
- Major quantum computational breakthrough is shaking up physics and maths - The Conversation UK [Last Updated On: August 14th, 2020] [Originally Added On: August 14th, 2020]
- Quantum mechanics is immune to the butterfly effect - The Economist [Last Updated On: August 14th, 2020] [Originally Added On: August 14th, 2020]
- Quantum Computing for the Next Generation of Computer Scientists and Researchers - Campus Technology [Last Updated On: August 14th, 2020] [Originally Added On: August 14th, 2020]
- Honeywell Wants To Show What Quantum Computing Can Do For The World - Forbes [Last Updated On: August 14th, 2020] [Originally Added On: August 14th, 2020]
- I confess, I'm scared of the next generation of supercomputers - TechRadar [Last Updated On: September 1st, 2020] [Originally Added On: September 1st, 2020]
- This Equation Calculates The Chances We Live In A Computer Simulation - Discover Magazine [Last Updated On: September 1st, 2020] [Originally Added On: September 1st, 2020]
- Q-NEXT collaboration awarded National Quantum Initiative funding - University of Wisconsin-Madison [Last Updated On: September 1st, 2020] [Originally Added On: September 1st, 2020]
- Quantum Cryptography Market Research Analysis Including Growth Factors, Types And Application By Regions From 2024 - Kentucky Journal 24 [Last Updated On: September 1st, 2020] [Originally Added On: September 1st, 2020]
- Researchers Found Another Impediment for Quantum Computers to Overcome - Dual Dove [Last Updated On: September 1st, 2020] [Originally Added On: September 1st, 2020]
- The future of artificial intelligence and quantum computing - Military & Aerospace Electronics [Last Updated On: September 1st, 2020] [Originally Added On: September 1st, 2020]
- Fermilab to lead $115 million National Quantum Information Science Research Center to build revolutionary quantum computer with Rigetti Computing,... [Last Updated On: September 1st, 2020] [Originally Added On: September 1st, 2020]
- Could Quantum Computing Progress Be Halted by Background Radiation? - Singularity Hub [Last Updated On: September 1st, 2020] [Originally Added On: September 1st, 2020]
- IBM plans to build a 1121 qubit system. What does this technology mean? - The Hindu [Last Updated On: September 26th, 2020] [Originally Added On: September 26th, 2020]
- Inaugural OSA Quantum 2.0 Conference Featured Talks on Emerging Technologies - Novus Light Technologies Today [Last Updated On: September 26th, 2020] [Originally Added On: September 26th, 2020]
- IBM, Alphabet and well-funded startups in the race for quantum supremacy - IT Brief Australia [Last Updated On: September 26th, 2020] [Originally Added On: September 26th, 2020]
- IBM Partners With HBCUs to Diversify Quantum Computing Workforce - Diverse: Issues in Higher Education [Last Updated On: September 26th, 2020] [Originally Added On: September 26th, 2020]
- Baidu offers quantum computing from the cloud - VentureBeat [Last Updated On: September 26th, 2020] [Originally Added On: September 26th, 2020]
- oneAPI Academic Center of Excellence Established at the Heidelberg University Computing Center (URZ) - HPCwire [Last Updated On: October 3rd, 2020] [Originally Added On: October 3rd, 2020]
- Berkeley Lab Technologies Honored With 7 R&D 100 Awards - Lawrence Berkeley National Laboratory [Last Updated On: October 3rd, 2020] [Originally Added On: October 3rd, 2020]
- Global QC Market Projected to Grow to More Than $800 million by 2024 - HPCwire [Last Updated On: October 3rd, 2020] [Originally Added On: October 3rd, 2020]
- Schrdingers Web offers a sneak peek at the quantum internet - Science News [Last Updated On: October 3rd, 2020] [Originally Added On: October 3rd, 2020]
- ESAs -Week: Digital Twin Earth, Quantum Computing and AI Take Center Stage - SciTechDaily [Last Updated On: October 3rd, 2020] [Originally Added On: October 3rd, 2020]
- A new claimant for "most powerful quantum computer" - Axios [Last Updated On: October 3rd, 2020] [Originally Added On: October 3rd, 2020]
- SC20 Invited Speakers Tackle Challenges for the Earth, Its Inhabitants, and Our Security Using 'More Than HPC' - HPCwire [Last Updated On: October 8th, 2020] [Originally Added On: October 8th, 2020]
- Google's Billion Dollar News, Commercial Quantum Computers And More In This Week's Top News - Analytics India Magazine [Last Updated On: October 8th, 2020] [Originally Added On: October 8th, 2020]
- Canadian quantum computing firms partner to spread the technology - IT World Canada [Last Updated On: October 8th, 2020] [Originally Added On: October 8th, 2020]
- Quantum computing: Photon startup lights up the future of computers and cryptography - ZDNet [Last Updated On: October 8th, 2020] [Originally Added On: October 8th, 2020]
- Race for quantum supremacy gathers momentum with several companies joining bandwagon, says GlobalData - Quantaneo, the Quantum Computing Source [Last Updated On: October 11th, 2020] [Originally Added On: October 11th, 2020]
- 4 Reasons Why Now Is the Best Time to Start With Quantum Computing - Medium [Last Updated On: October 11th, 2020] [Originally Added On: October 11th, 2020]
- What is Quantum Computing, and How does it Help Us? - Analytics Insight [Last Updated On: October 11th, 2020] [Originally Added On: October 11th, 2020]
- 01 Communique to Present at the Benzinga Global Small Cap Conference on December 8 - IT News Online [Last Updated On: November 30th, 2020] [Originally Added On: November 30th, 2020]