Tag Archives: Radeon Instinct

AMD Datacenter Leadership In 2020 & Beyond!

AMD Datacenter Leadership In 2020 & Beyond!

AMD Senior VP and General Manager Forrest Norrod just shared AMD’s datacenter leadership with EPYC and Radeon Instinct, and AMD’s datacenter roadmap beyond 2020!

 

Forrest Norrod : Senior VP + GM, AMD Datacenter + Embedded Solutions Business Group

Forrest Norrod is senior vice president and general manager of the Datacenter and Embedded Solutions Business Group at AMD.

He is responsible for managing all aspects of strategy, business management, engineering and sales for AMD datacenter and embedded products.

Norrod has more than 25 years of technology industry experience across a number of engineering and business management roles at both the chip and system level.

 

AMD Datacenter Leadership In 2020 & Beyond!

During AMD Financial Analyst Day 2020, Forrest Norrod shared AMD’s datacenter leadership with EPYC and Radeon Instinct, and AMD’s datancenter roadmap in this presentation.

Here are the key points from Forrest Norrod’s presentation :

  • AMD won the contract to power the recently announced El Capitan supercomputer at Lawrence Livermore National Laboratory with EPYC processors and Radeon Instinct GPUs.
  • Expected to come online in 2023, El Capitan is expected to deliver more than 2 exaFLOPs of double-precision performance, making it more powerful than today’s 200 fastest supercomputers combined.

  • AMD is continuing to gain traction with its 2nd Generation AMD EPYC processors in enterprise, cloud and HPC markets based on delivering performance leadership and TCO advantages across the most important enterprise and cloud workloads.
  • AMD EPYC is enabling Nokia to double the performance of their 5G Cloud Packet Core.
  • In 2020 AMD expects more than 150 AMD EPYC processor-powered cloud instances and 140 server platforms to be available.

  • AMD is introducing new technologies including AMD CDNA architecture, 3rd Generation Infinity Architecture and the ROCm 4.0 software platform, all of which will support the AMD-powered Frontier and El Capitan supercomputers.
  • AMD plans to ship the 3rd Gen AMD EPYC “Milan” processor in Late 2020, and it will provide 100% coverage of enterprise requirements – whether it’s for the cloud, HPC or enterprise IT.
  • Milan will remain on the 7 nm process, but the next-generation Genda core (Zen 4) will use the 5 nm process technology.

  • The AMD CDNA architecture will allow for better scalability, with accelerators fully interconnected with 2nd Gen Infinity Architecture.
  • But the next-generation AMD CDNA 2 architecture will allow for Unified Data, with CPU + GPU coherency with 3rd Gen Infinity Architecture – allowing for easier programming and improved performance.

 

Recommended Reading

Go Back To > Computer Hardware | Business | Home

Support Tech ARP!

If you like our work, you can help support our work by visiting our sponsors, participating in the Tech ARP Forums, or even donating to our fund. Any help you can render is greatly appreciated!


El Capitan Supercomputer : AMD Selected As Node Supplier!

It’s official – AMD has been selected as the node supplier for the El Capitan supercomputer, which is projected to be the world’s most powerful supercomputer when it is fully deployed!

 

El Capitan Supercomputer : A Quick Primer!

El Capitan is a supercomputer funded by the Advanced Simulation and Computing (ASC) program at the National Nuclear Security Administration (NNSA) from the Department of Energy.

When it is fully deployed in 2023, it will perform complex and increasingly predictive modelling and simulation for the NNSA’s Life Extension Programs (LEPs), which addresses nuclear weapon raging and emergent threat issues.

This will allow the United States to keep its nuclear stockpile safe, secure and reliable, in the absence of underground nuclear testing.

“This unprecedented computing capability, powered by advanced CPU and GPU technology from AMD, will sustain America’s position on the global stage in high-performance computing and provide an observable example of the commitment of the country to maintaining an unparalleled nuclear deterrent,” said LLNL Director Bill Goldstein.

“Today’s news provides a prime example of how government and industry can work together for the benefit of the entire nation.”

Besides supporting the nuclear stockpile, El Capitan will perform secondary US national security missions, including nuclear nonproliferation and counterterrorism.

NNSA laboratories – Lawrence Livermore, Los Alamos and Sandia national laboratories – are building machine learning and AI into computational techniques and analysis that will benefit NNSA’s primary missions and unclassified projects such as climate modelling and cancer research for DOE.

To that end, it will use a combination of CPUs and GPUs to exceed 2 exaFLOPS in performance – that’s two quintillion floating point operations per second. That will make it the world’s most powerful supercomputer!

 

El Capitan Supercomputer : AMD Selected As Node Supplier!

El Capitan will be powered by the next-generation AMD EPYC processors, codenamed Genoa and featuring the upcoming AMD Zen 4 processor cores, as well as the next-generation AMD Radeon Instinct GPUs based on a new compute-optimised architecture.

The nodes will run on the AMD Radeon Open Compute (ROCm) heterogenous computing platform, with most of their floating point computing power delivered by the Radeon Instinct GPUs.

Not only will the El Capitan nodes offer significantly greater per-node performance than any current system, they will also offer dramatically better energy efficiency.

El Capitan will also integrated advanced features that have not yet been widely deployed, including :

  • HPE Cray Slingshot interconnect network, which will enable large calculations across many nodes
  • new HPE optics technologies to deliver higher data transmission rates with better power efficiency and reliability
  • new Cray Shasta software platform, with a new container-based architecture

“El Capitan will drive unprecedented advancements in HPC and AI, powered by the next-generation AMD EPYC CPUs and Radeon Instinct GPUs,” said Forrest Norrod, senior vice president and general manager, Datacenter and Embedded Systems Group, AMD.

“Building on our strong foundation in high-performance computing and adding transformative coherency capabilities, AMD is enabling the NNSA Tri-Lab community — LLNL, Los Alamos and Sandia national laboratories — to achieve their mission-critical objectives and contribute new AI advancements to the industry.”

“We are extremely proud to continue our exascale work with HPE and NNSA and look forward to the delivery of the most powerful supercomputer in the world, expected in early 2023.”

 

Recommended Reading

Go Back To > Computer Hardware | Home

Support Tech ARP!

If you like our work, you can help support our work by visiting our sponsors, participating in the Tech ARP Forums, or even donating to our fund. Any help you can render is greatly appreciated!


The 32-Core AMD Naples CPU Tech Report

Just 3 months ago, we revealed to you the first AMD Naples + Radeon Instinct reference design. Yet, details were scarce. Now that AMD has finally launched their Ryzen 7 desktop processors, they are finally ready to reveal details of the AMD Naples CPU.

 

 

AMD Naples Revealed!

Like Ryzen, the AMD Naples CPU is based on their brand-new Zen microarchitecture. Designed for servers, the Naples CPU offers up to 32 processor cores, an integrated high-speed DDR4 memory controller and a new high-speed coherent interconnect.

These four slides summarises the Naples CPU’s key features :

  • 32 AMD Zen cores., each capable of handling 2 simultaneous threads.
  • Integrated DDR4 memory controller with 8 memory channels, each supporting two DIMMs. That’s a total of 16 DIMMs per processor socket, allowing for up to 2 TB of memory. A 2-socket server will support up to 32 DIMMs, with a total memory capacity of 4 TB.
  • The Naples CPU is fully integrated with high-speed I/O including 128 PCIe 3.0 lanes.
  • Because the Naples is a complete SoC, there is no need for a separate chipset, reducing cost and power consumption.

In a 2-socket system, two Naples CPUs will communicate with each other via the AMD Infinity Fabric coherent interconnect. Even though each Naples CPU technically supports 128 PCIe 3.0 lanes, it is limited to 64 PCIe 3.0 lanes in a 2-socket design.

A 2-socket Naples server would offer 64 processor cores that can handle 128 simultaneous cores, and up to 4 TB of DDR4 memory (32 x 128 GB DIMMs). Just how fast is such a server? Let’s find out…

Next Page > The AMD Naples CPU Performance, AMD Naples + Radeon Instinct

[adrotate banner=”5″]

 

Support Tech ARP!

If you like our work, you can help support our work by visiting our sponsors, participating in the Tech ARP Forums, or even donating to our fund. Any help you can render is greatly appreciated!

The AMD Naples CPU Performance

AMD benchmarked the performance of a 2-socket Naples server against a 2-socket Intel Xeon E5-2699A v4 server, with the following specifications. The workload was Seismic Analysis, involving iterations of 3D wave equations.

When they were compared with similar number of cores (44), the Naples CPU was almost twice as fast as the Intel Xeon E5-2699A v4 CPU.

But when they unleashed all 64-cores in that 2-socket Naples server, it is now 2.5X faster than the Intel Xeon E5-2699A v4.

In this final test, they tested both servers with a 4x larger dataset – 4 billion samples, instead of just 1 billion samples. Due to its limited memory channels, the Intel Xeon E5-2699A v4 is unable to support enough memory to load the dataset.

[adrotate banner=”5″]

 

AMD Naples + Radeon Instinct

Naples is more than just a server CPU to AMD. Its integrated support for 128 PCIe 3.0 lanes allows a single Naples CPU to support up to 4 AMD Radeon Instinct accelerators.

Be sure to check out their first AMD Naples + Radeon Instinct reference design.

Next Page > The AMD Naples Pricing & Availability, The Presentation Slides

 

Support Tech ARP!

If you like our work, you can help support our work by visiting our sponsors, participating in the Tech ARP Forums, or even donating to our fund. Any help you can render is greatly appreciated!

The AMD Naples Pricing & Availability

You may wonder why AMD compared the Naples CPU against the 22-core Intel Xeon E5-2699A v4, instead of a more powerful processor like the Intel Xeon E7-8893 v4. We suspect it is because they are going to price the Naples CPU against the Xeon E5-2699A v4, which costs US$ 4,938.

Imagine how well it will sell if it’s priced at the same or slightly lower price point than the Xeon E5-2699A v4, while offering 45% more cores, 60% more I/O capacity and 122% more memory bandwidth. If we have to guess, we would think that AMD will finally price the Naples CPU at around $4,000 – a 20% discount on the Xeon E5-2699A v4.

More importantly, AMD appears to be on track for a Q2 2017 launch for the Naples CPU. We will keep you updated!

[adrotate group=”1″]

 

The AMD Naples Presentation Slides

Here is the complete set of slides used by Forrest Norrod, SVP and GM of the AMD Enterprise, Embedded and Semi-Custom Business Group.

Go Back To > First PageArticles | Home

 

Support Tech ARP!

If you like our work, you can help support our work by visiting our sponsors, participating in the Tech ARP Forums, or even donating to our fund. Any help you can render is greatly appreciated!

The Complete AMD Radeon Instinct Tech Briefing Rev. 3.0

The AMD Tech Summit held in Sonoma, California from December 7-9, 2016 was not only very exclusive, it was highly secretive. The first major announcement we have been allowed to reveal is the new AMD Radeon Instinct heterogenous computing platform.

In this article, you will hear from AMD what the Radeon Instinct platform is all about. As usual, we have a ton of videos from the event, so it will be as if you were there with us. Enjoy! 🙂

Originally published @ 2016-12-12

Updated @ 2017-01-11 : Two of the videos were edited to comply with the NDA. Now that the NDA on AMD Vega has been lifted, we replaced the two videos with their full, unedited versions. We also made other changes, including adding links to the other AMD Tech Summit articles.

Updated @ 2017-01-20 : Replaced an incorrect slide, and a video featuring that slide. Made other small updates to the article.

 

The AMD Radeon Instinct Platform Summarised

For those who want the quick low-down on AMD Radeon Instinct, here are the key takeaway points :

  • The AMD Radeon Instinct platform is made up of two components – hardware and software.
  • The hardware components are the AMD Radeon Instinct accelerators built around the current Polaris and the upcoming Vega GPUs.
  • The software component is the AMD Radeon Open Compute (ROCm) platform, which includes the new MIOpen open-source deep learning library.
  • The first three Radeon Instinct accelerator cards are the MI6, MI8 and MI25 Vega with NCU.
  • The AMD Radeon Instinct MI6 is a passively-cooled inference accelerator with 5.7 TFLOPS of FP16 processing power, 224 GB/s of memory bandwidth, and a TDP of <150 W. It will come with 16 GB of GDDR5 memory.
  • The AMD Radeon Instinct MI8 is a small form-factor (SFF) accelerator with 8.2 TFLOPS of processing power, 512 GB/s of memory bandwidth, and a TDP of <175 W. It will come with 4 GB of HBM memory.
  • The AMD Radeon Instinct MI25 Vega with NCU is a passively-cooled training accelerator with 25 TFLOPS of processing power, support for 2X packed math, a High Bandwidth Cache and Controller, and a TDP of <300 W.
  • The Radeon Instinct accelerators will all be built exclusively by AMD.
  • The Radeon Instinct accelerators will all support MxGPU SRIOV hardware virtualisation.
  • The Radeon Instinct accelerators are all passively cooled.
  • The Radeon Instinct accelerators will all have large BAR (Base Address Register) support for multiple GPUs.
  • The upcoming AMD Zen “Naples” server platform is designed to supported multiple Radeon Instinct accelerators through a high-speed network fabric.
  • The ROCm platform is not only open source, it will support a multitude of standards in addition to MIOpen.
  • The MIOpen deep learning library is open source, and will be available in Q1 2017.
  • The MIOpen deep learning library is optimised for Radeon Instinct, allowing for 3X better performance in machine learning.
  • AMD Radeon Instinct accelerators will be significantly faster than NVIDIA Titan X GPUs based on the Maxwell and Pascal architectures.

In the subsequent pages, we will give you the full low-down on the Radeon Instinct platform, with the following presentations by AMD :

[adrotate banner=”4″]

We also prepared the complete video and slides of the Radeon Instinct tech briefing for your perusal :

Next Page > Heterogenous Computing, The Radeon Instinct Accelerators, MIOpen, Performance

 

Support Tech ARP!

If you like our work, you can help support our work by visiting our sponsors, participating in the Tech ARP Forums, or even donating to our fund. Any help you can render is greatly appreciated!

Why Is Heterogenous Computing Important?

Dr. Lisa Su, kicked things off with an inside look at her two-year long journey as AMD President and CEO. Then she revealed why Heterogenous Computing is an important part of AMD’s future going forward. She also mentioned the success of the recently-released Radeon Software Crimson ReLive Edition.

 

Here Are The New AMD Radeon Instinct Accelerators!

Next, Raja Koduri, Senior Vice President and Chief Architect of the Radeon Technologies Group, officially revealed the new AMD Radeon Instinct accelerators.

 

The MIOpen Deep Learning Library For Radeon Instinct

MIOpen is a new deep learning library optimised for Radeon Instinct. It is open source and will become part of the Radeon Open Compute (ROCm) platform. It will be available in Q1 2017.

[adrotate banner=”5″]

 

The Performance Advantage Of Radeon Instinct & MIOpen

MIOpen is optimised for Radeon Instinct, offering 3X better performance in machine learning. It allows the Radeon Instinct accelerators to be significantly faster than NVIDIA Titan X GPUs based on the Maxwell and Pascal architectures.

Next Page > Radeon Instinct MI25 & MI8 Demos, Zen “Naples” Platform, The First Servers, ROCm Discussion

 

Support Tech ARP!

If you like our work, you can help support our work by visiting our sponsors, participating in the Tech ARP Forums, or even donating to our fund. Any help you can render is greatly appreciated!

The Radeon Instinct MI25 Training Demonstration

Raja Koduri roped in Ben Sander, Senior Fellow at AMD, to show off the Radeon Instinct MI25 running a training demo.

 

The Radeon Instinct MI8 Visual Inference Demonstration

The visual inference demo is probably much easier to grasp, as it is visual in nature. AMD used the Radeon Instinct MI8 in this example.

 

The Radeon Instinct On The Zen “Naples” Platform

The upcoming AMD Zen “Naples” server platform is designed to supported multiple AMD Radeon Instinct accelerators through a high-speed network fabric.

[adrotate banner=”5″]

 

The First Radeon Instinct Servers

This is not a vapourware launch. Raja Koduri revealed the first slew of Radeon Instinct servers that will hit the market in H1 2017.

 

The Radeon Open Compute (ROCm) Platform Discussion

To illustrate the importance of heterogenous computing on Radeon Instinct, Greg Stoner (ROCm Senior Director at AMD), hosted a panel of AMD partners and early adopters in using the Radeon Open Compute (ROCm) platform.

Next Page > Closing Remarks On Radeon Instinct, The Complete Radeon Instinct Tech Briefing Video & Slides

 

Support Tech ARP!

If you like our work, you can help support our work by visiting our sponsors, participating in the Tech ARP Forums, or even donating to our fund. Any help you can render is greatly appreciated!

Closing Remarks On Radeon Instinct

Finally, Raja Koduri concluded the launch of the Radeon Instinct Initiative with some closing remarks on the recent Radeon Software Crimson ReLive Edition.

 

The Complete AMD Radeon Instinct Tech Briefing

This is the complete AMD Radeon Instinct tech briefing. Our earlier video was edited to comply with the AMD Vega NDA (which has now expired).

[adrotate banner=”5″]

 

The Complete AMD Radeon Instinct Tech Briefing Slides

Here are the Radeon Instinct presentation slides for your perusal.

 

Support Tech ARP!

If you like our work, you can help support our work by visiting our sponsors, participating in the Tech ARP Forums, or even donating to our fund. Any help you can render is greatly appreciated!

The First AMD Naples + Radeon Instinct Reference Design Revealed!

We had earlier revealed the upcoming AMD Ryzen desktop processor, and the AMD Radeon Instinct compute accelerators. But that was not all they revealed at the AMD Tech Summit held in Sonoma last month. AMD also showcased the world’s first AMD Naples and Radeon Instinct reference design!

This is a compact 2U (19″) server with two AMD Naples processors, and two Radeon Instinct accelerators. Check it out!

 

What Is AMD Naples?

Formerly known as Summit Ridge, the AMD Ryzen is an octa-core desktop processor that promises to match, if not beat, the Intel Core i7-6900K in both performance and power consumption. Like Ryzen, the AMD Naples processor is based on the AMD Zen microarchitecture. But instead of having just 8 cores, AMD Naples is a server-class processor that boasts 32 cores!

Like Intel Xeon processors, the AMD Naples processor can process two threads per core. So the 32-core AMD Naples processor can handle up to 64 threads simultaneously. It will also reportedly feature a massive 512 MB L3 cache.

Compared that to the top-of-the-line Intel Xeon E7-8890 v4 processor that only has 24 cores and handles up to 48 threads simultaneously, and only has a 60 MB L3 cache.

If the AMD Naples processor delivers the same performance and power consumption we saw with AMD Ryzen, then it should deliver at least 33% better compute performance than the Intel Xeon E7-8890 v4 processor, and blow it out of the water in memory performance. Now that will be a killer server CPU.

[adrotate group=”1″]

 

Compute Boost With Radeon Instinct

If the potentially superior AMD Naples compute performance does not impress you, the reference design also showcased the new AMD Radeon Instinct accelerators. Their 2U server reference design featured what appears to be two AMD Radeon Instinct MI8 SFF accelerators.

The Radeon Instinct MI8 is a passively-cooled accelerator that requires only 175 W of power. Each MI8 accelerator delivers 8.2 TFLOPS of FP16 compute performance with 512 GB/s of memory bandwidth. So two of them will offer over 16 TFLOPs of FP16 compute performance in this 2U server.

Go Back To > Enterprise | Home

 

Support Tech ARP!

If you like our work, you can help support our work by visiting our sponsors, participating in the Tech ARP Forums, or even donating to our fund. Any help you can render is greatly appreciated!

The First AMD Radeon Instinct Servers Revealed!

When AMD launched Radeon Instinct at the 2016 AMD Tech Summit in Sonoma earlier this month, they showed off several servers that will be powered by the new Radeon Instinct accelerators. These Radeon Instinct servers can now deliver up to 3 petaflops (3,000 TFLOPS) of FP16 compute performance using those Radeon Instinct accelerators.

Most of the performance boost comes from the combination of the new Vega GPU architecture, which allows for 2X packed FP16 math ops; and the new AMD MIOpen deep learning library.

After the launch event, we were given the opportunity to look inside two of these servers – the Supermicro 1028GQ-TRT and the Invented K888 G3. Both of these servers will ship with multiple Radeon Instinct MI25 Vega with NCU accelerators, allowing them to deliver up to 100 TFLOPS of FP16 compute performance.

We also had a look at the Falconwitch PS1816 server which can host a whopping 16 Radeon Instinct MI25 Vega with NCU accelerators to deliver 300 teraflops of FP16 compute performance!

 

The Supermicro 1028GQ-TRT

This is the server Ben Sander used to demonstrate the training capability of the Radeon Instinct MI25 accelerator in the 2016 AMD Tech Summit.

The Supermicro 1028GQ-TRT is a 1U server that fits up to 3 Radeon Instinct MI25 Vega with NCU accelerators. That allows it to deliver up to 75 teraflops of FP16 compute performance.

Multiple servers can be combined to increase compute performance. In his demo, Ben Sander used two of these Supermicro servers to obtain 150 teraflops of computing performance.

 

The Inventec K888 G3

The Inventec K888 G3 is a 2U, 2-processor server that fits up to 4 Radeon Instinct MI25 Vega with NCU accelerators. This allows it to deliver up to 100 teraflops of FP16 compute performance.

In this example, the Inventec K888 is powered by four FirePro S9300 X2 cards instead. Each of these FirePro S9300 X2 cards deliver slightly more FP16 compute performance than the Radeon Instinct MI25 Vega, [adrotate banner=”5″]

 

The Falconwitch PS1816

The Falconwitch PS1816 is a 2U, 24-bay server that boasts a total of 288 PCIe lanes. This allows it to support up to sixteen Radeon Instinct MI25 Vega with NCU accelerators to deliver 400 teraflops of FP16 compute performance.

If that’s not enough, there is an Inventec Radeon Instinct 42U rack that features six of these Falconwitch PS1816 servers and an additional four Radeon Instinct MI25 Vega with NCU accelerators. That is a total of 120 Radeon Instinct MI25 accelerators, delivering 3,000 teraflops or 3 petaflops of FP16 compute performance! This is literally, the mother of all Radeon Instinct servers!

 

Raja Koduri Introducing The First Radeon Instinct Servers

For those who missed our complete coverage of Radeon Instinct, here is the video of Radeon Technologies Group Senior Vice President and Chief Architect, Raja Koduri introducing the first Radeon Instinct servers.

For more information on the Radeon Instinct accelerators, and MIOpen deep learning library, please take a look at our article – The Complete AMD Radeon Instinct Tech Briefing!

 

Support Tech ARP!

If you like our work, you can help support our work by visiting our sponsors, participating in the Tech ARP Forums, or even donating to our fund. Any help you can render is greatly appreciated!