Tag Archives: AMD Radeon Instinct MI25

The AMD EPYC 7000 Series Processor Tech Report

AMD just launched the AMD EPYC 7000 series processors. Formerly known as AMD Naples, it boasts up to 32 AMD Zen processor cores, and challenges the Intel Xeon’s dominance in the datacenter. After a sneak peek at their Computex 2017 press conference, AMD finally revealed the EPYC 7000 series processors. Let’s take a look!

 

The AMD EPYC 7000 Series Processor

Like Ryzen, the EPYC processor is based on their new Zen microarchitecture. Designed for servers, the EPYC processor offers up to 32 processor cores, an integrated high-speed DDR4 memory controller and a new high-speed coherent interconnect. AMD now reveals that it also comes with an embedded security subsystem.

Here is a summary of the EPYC processor’s key features :

  • Supports up to 32 AMD Zen cores., each capable of handling 2 simultaneous threads.
  • [adrotate group=”2″]Integrated DDR4 memory controller with 8 memory channels, each supporting two DIMMs. That’s a total of 16 DIMMs per processor socket, allowing for up to 2 TB of memory. A 2-socket server will support up to 32 DIMMs, with a total memory capacity of 4 TB.
  • The EPYC processor is fully integrated with high-speed I/O including 128 PCIe 3.0 lanes.
  • Because the EPYC is a complete SoC, there is no need for a separate chipset, reducing cost and power consumption.
  • In a 2-socket system, two EPYC processors communicate with each other via the AMD Infinity Fabric coherent interconnect. Even though each EPYC processor has 128 PCIe 3.0 lanes, it is limited to 64 PCIe 3.0 lanes in a 2-socket design.  The 2P EPYC system will have a total of 128 PCI 3.0 lanes, not 256.

In addition, AMD revealed the first EPYC family of processors – the EPYC 7000 Series, with nine processors divided into four segments. They also introduced three EPYC processor variants designed for single socket servers.

  • EPYC 7601, 7551, 7551P and 7501 processors have 32 cores and a peak TDP of 170-180 W.
  • EPYC 7451, 7401 and 7401P processors have 24 cores and a peak TDP of 170-180 W.
  • EPYC 7351, 7351P, 7301 and 7281 processors have 16 cores and a peak TDP of 170 W.
  • EPYC 7251 processor has 8 cores and a peak TDP of 120 W.

 

The AMD EPYC 7000 Series Presentation

If you have the time, check out this 103-minute presentation on the EPYC 7000 Series processors by Dr. Lisa Su and Forrest Norrod from AMD, as well as their partners. Dr. Lisa also comes out at the end to reveal the AMD Radeon Instinct MI25 accelerator, that we revealed several months earlier.

Next Page >  EPYC 7000 Series Performance & Key Advantages

 

Support Tech ARP!

If you like our work, you can help support our work by visiting our sponsors, participating in the Tech ARP Forums, or even donating to our fund. Any help you can render is greatly appreciated!

EPYC 7000 Series Processor Performance

AMD also revealed their internal benchmarks for the EPYC 7000 series processors :[adrotate group=”2″]

Single Socket AMD EPYC 7601 Server

  • SPECiut_rate2006 : 1200
  • SPECfp_rate2006 : 943

Two Socket AMD EPYC 7601 Server

  • SPECiut_rate2006 : 1390
  • SPECfp_rate2006 : 1330

AMD compared the two processor performance of the EPYC 7601, showing that it beat the Intel Xeon E5-2699A v4 in integer compute by 47%, and floating point compute by 75%.

 

Price Performance Advantage

AMD also took pains to point out that the price performance advantage the EPYC 7000 Series processors have over their Intel Xeon rivals, in both the 2-socket and single-socket segments.

 

Data Security Advantage

Other than raw computing performance and support for a truckload of memory, EPYC processors also offer an integrated hardware security subsystem. This allows the EPYC processors to deliver full memory encryption and secure multi-tenancy for data domain with no application impact.

 

Compatibility & Support

Finally, AMD wants everyone to know that the EPYC is an x86 processor and readily supports all x86 applications in the market. They have also performed extensive tests with industry partners like Microsoft, VMware and Red Hat.

They have also lined up considerable support from their server partners to deliver AMD EPYC solutions to customers worldwide.

Next Page > The Official Series Press Release & Slides

 

Support Tech ARP!

If you like our work, you can help support our work by visiting our sponsors, participating in the Tech ARP Forums, or even donating to our fund. Any help you can render is greatly appreciated!

AMD EPYC 7000 Series Processor Press Release

AUSTIN, Texas — June 20, 2017 — AMD (NASDAQ: AMD), and a global ecosystem of server partners, today marked a new era in the datacenter with the launch of AMD EPYC 7000 series high-performance datacenter processors. AMD was joined by multiple customers and partners at the global launch event in presenting a wide array of systems, performance demonstrations, and customer testimonials. The innovative, record-setting EPYC design, with up to 32 high-performance “Zen” cores and an unparalleled feature set, delivers greater performance than the competition across a full range of integer, floating point, memory bandwidth, and I/O benchmarks and workloads.

“With our EPYC family of processors, AMD is delivering industry-leading performance on critical enterprise, cloud, and machine intelligence workloads,” said Lisa Su, president and CEO, AMD. “EPYC processors offer uncompromising performance for single-socket systems while scaling dual-socket server performance to new heights, outperforming the competition at every price point. We are proud to bring choice and innovation back to the datacenter with the strong support of our global ecosystem partners.”

The world’s largest server manufacturers introduced products based on EPYC 7000-series processors at today’s launch, including HPE, Dell, ASUS, Gigabyte, Inventec, Lenovo, Sugon, Supermicro, Tyan, and Wistron. Primary hypervisor and server operating system providers Microsoft, Red Hat, and VMware showcased optimized support for EPYC, while key server hardware ecosystem partners Mellanox, Samsung Electronics, and Xilinx were also featured in EPYC-optimized platforms.

Cloud Datacenter and Enterprise Customers

Datacenter and cloud service providers also welcomed EPYC to the market today. Members of the “Super 7” datacenter services providers, including Baidu and Microsoft Azure, as well as 1&1, Bloomberg, Dropbox and LexisNexis, all voiced their support at launch.

Record-Setting EPYC Performance

The excitement around EPYC is driven by multiple record-setting server benchmarks achieved by EPYC-powered one-socket and two-socket systems.

AMD EPYC processors set several performance records, including:

  • Two-Socket Server
    • EPYC 7601-based system scored 2360 on SPECint_rate2006, higher than any other two-socket system score
  • One-Socket Server
    • EPYC 7601-based system scored 1200 on SPECint_rate2006, higher than any other mainstream one-socket x86-based system score
    • EPYC 7601-based system scored 943 on SPECfp_rate2006, higher than any other one-socket system score

All EPYC processors combine innovative security features, enterprise class reliability, and support a full feature-set. An EPYC 7601 CPU-based one-socket system shifts expectations for single socket server performance, helping lower total-cost-of-ownership (TCO), providing up to 20% CapEx savings compared to the Intel Xeon E5-2660 v4-based two-socket system. At every targeted price point for two-socket processors, EPYC outperforms the competition, with up to 70% more performance in the eight hundred dollar price band and up to 47% more performance at the high-end of the market of four thousand dollars or more.

EPYC Product Overview

  • A highly scalable System on Chip (SoC) design ranging from 8-core to 32-core, supporting two high-performance threads per core.
  • Industry-leading memory bandwidth across the line-up, with 8 channels of memory on every EPYC device. In a two-socket server, support for up to 32 DIMMS of DDR4 on 16 memory channels, delivering up to 4 terabytes of total memory capacity.
  • Unprecedented support for integrated, high-speed I/O with 128 lanes of PCIe 3 on every product
  • A highly-optimized cache structure for high-performance, energy efficient compute
  • AMD Infinity Fabric coherent interconnect linking EPYC CPUs in a two-socket system
  • Dedicated security hardware
[adrotate group=”1″]

 

The Official Presentation Slides

Suggested Reading

Go Back To > First PageArticles | Home

 

Support Tech ARP!

If you like our work, you can help support our work by visiting our sponsors, participating in the Tech ARP Forums, or even donating to our fund. Any help you can render is greatly appreciated!

The Complete AMD Radeon Instinct Tech Briefing Rev. 3.0

The AMD Tech Summit held in Sonoma, California from December 7-9, 2016 was not only very exclusive, it was highly secretive. The first major announcement we have been allowed to reveal is the new AMD Radeon Instinct heterogenous computing platform.

In this article, you will hear from AMD what the Radeon Instinct platform is all about. As usual, we have a ton of videos from the event, so it will be as if you were there with us. Enjoy! 🙂

Originally published @ 2016-12-12

Updated @ 2017-01-11 : Two of the videos were edited to comply with the NDA. Now that the NDA on AMD Vega has been lifted, we replaced the two videos with their full, unedited versions. We also made other changes, including adding links to the other AMD Tech Summit articles.

Updated @ 2017-01-20 : Replaced an incorrect slide, and a video featuring that slide. Made other small updates to the article.

 

The AMD Radeon Instinct Platform Summarised

For those who want the quick low-down on AMD Radeon Instinct, here are the key takeaway points :

  • The AMD Radeon Instinct platform is made up of two components – hardware and software.
  • The hardware components are the AMD Radeon Instinct accelerators built around the current Polaris and the upcoming Vega GPUs.
  • The software component is the AMD Radeon Open Compute (ROCm) platform, which includes the new MIOpen open-source deep learning library.
  • The first three Radeon Instinct accelerator cards are the MI6, MI8 and MI25 Vega with NCU.
  • The AMD Radeon Instinct MI6 is a passively-cooled inference accelerator with 5.7 TFLOPS of FP16 processing power, 224 GB/s of memory bandwidth, and a TDP of <150 W. It will come with 16 GB of GDDR5 memory.
  • The AMD Radeon Instinct MI8 is a small form-factor (SFF) accelerator with 8.2 TFLOPS of processing power, 512 GB/s of memory bandwidth, and a TDP of <175 W. It will come with 4 GB of HBM memory.
  • The AMD Radeon Instinct MI25 Vega with NCU is a passively-cooled training accelerator with 25 TFLOPS of processing power, support for 2X packed math, a High Bandwidth Cache and Controller, and a TDP of <300 W.
  • The Radeon Instinct accelerators will all be built exclusively by AMD.
  • The Radeon Instinct accelerators will all support MxGPU SRIOV hardware virtualisation.
  • The Radeon Instinct accelerators are all passively cooled.
  • The Radeon Instinct accelerators will all have large BAR (Base Address Register) support for multiple GPUs.
  • The upcoming AMD Zen “Naples” server platform is designed to supported multiple Radeon Instinct accelerators through a high-speed network fabric.
  • The ROCm platform is not only open source, it will support a multitude of standards in addition to MIOpen.
  • The MIOpen deep learning library is open source, and will be available in Q1 2017.
  • The MIOpen deep learning library is optimised for Radeon Instinct, allowing for 3X better performance in machine learning.
  • AMD Radeon Instinct accelerators will be significantly faster than NVIDIA Titan X GPUs based on the Maxwell and Pascal architectures.

In the subsequent pages, we will give you the full low-down on the Radeon Instinct platform, with the following presentations by AMD :

[adrotate banner=”4″]

We also prepared the complete video and slides of the Radeon Instinct tech briefing for your perusal :

Next Page > Heterogenous Computing, The Radeon Instinct Accelerators, MIOpen, Performance

 

Support Tech ARP!

If you like our work, you can help support our work by visiting our sponsors, participating in the Tech ARP Forums, or even donating to our fund. Any help you can render is greatly appreciated!

Why Is Heterogenous Computing Important?

Dr. Lisa Su, kicked things off with an inside look at her two-year long journey as AMD President and CEO. Then she revealed why Heterogenous Computing is an important part of AMD’s future going forward. She also mentioned the success of the recently-released Radeon Software Crimson ReLive Edition.

 

Here Are The New AMD Radeon Instinct Accelerators!

Next, Raja Koduri, Senior Vice President and Chief Architect of the Radeon Technologies Group, officially revealed the new AMD Radeon Instinct accelerators.

 

The MIOpen Deep Learning Library For Radeon Instinct

MIOpen is a new deep learning library optimised for Radeon Instinct. It is open source and will become part of the Radeon Open Compute (ROCm) platform. It will be available in Q1 2017.

[adrotate banner=”5″]

 

The Performance Advantage Of Radeon Instinct & MIOpen

MIOpen is optimised for Radeon Instinct, offering 3X better performance in machine learning. It allows the Radeon Instinct accelerators to be significantly faster than NVIDIA Titan X GPUs based on the Maxwell and Pascal architectures.

Next Page > Radeon Instinct MI25 & MI8 Demos, Zen “Naples” Platform, The First Servers, ROCm Discussion

 

Support Tech ARP!

If you like our work, you can help support our work by visiting our sponsors, participating in the Tech ARP Forums, or even donating to our fund. Any help you can render is greatly appreciated!

The Radeon Instinct MI25 Training Demonstration

Raja Koduri roped in Ben Sander, Senior Fellow at AMD, to show off the Radeon Instinct MI25 running a training demo.

 

The Radeon Instinct MI8 Visual Inference Demonstration

The visual inference demo is probably much easier to grasp, as it is visual in nature. AMD used the Radeon Instinct MI8 in this example.

 

The Radeon Instinct On The Zen “Naples” Platform

The upcoming AMD Zen “Naples” server platform is designed to supported multiple AMD Radeon Instinct accelerators through a high-speed network fabric.

[adrotate banner=”5″]

 

The First Radeon Instinct Servers

This is not a vapourware launch. Raja Koduri revealed the first slew of Radeon Instinct servers that will hit the market in H1 2017.

 

The Radeon Open Compute (ROCm) Platform Discussion

To illustrate the importance of heterogenous computing on Radeon Instinct, Greg Stoner (ROCm Senior Director at AMD), hosted a panel of AMD partners and early adopters in using the Radeon Open Compute (ROCm) platform.

Next Page > Closing Remarks On Radeon Instinct, The Complete Radeon Instinct Tech Briefing Video & Slides

 

Support Tech ARP!

If you like our work, you can help support our work by visiting our sponsors, participating in the Tech ARP Forums, or even donating to our fund. Any help you can render is greatly appreciated!

Closing Remarks On Radeon Instinct

Finally, Raja Koduri concluded the launch of the Radeon Instinct Initiative with some closing remarks on the recent Radeon Software Crimson ReLive Edition.

 

The Complete AMD Radeon Instinct Tech Briefing

This is the complete AMD Radeon Instinct tech briefing. Our earlier video was edited to comply with the AMD Vega NDA (which has now expired).

[adrotate banner=”5″]

 

The Complete AMD Radeon Instinct Tech Briefing Slides

Here are the Radeon Instinct presentation slides for your perusal.

 

Support Tech ARP!

If you like our work, you can help support our work by visiting our sponsors, participating in the Tech ARP Forums, or even donating to our fund. Any help you can render is greatly appreciated!

The First AMD Radeon Instinct Servers Revealed!

When AMD launched Radeon Instinct at the 2016 AMD Tech Summit in Sonoma earlier this month, they showed off several servers that will be powered by the new Radeon Instinct accelerators. These Radeon Instinct servers can now deliver up to 3 petaflops (3,000 TFLOPS) of FP16 compute performance using those Radeon Instinct accelerators.

Most of the performance boost comes from the combination of the new Vega GPU architecture, which allows for 2X packed FP16 math ops; and the new AMD MIOpen deep learning library.

After the launch event, we were given the opportunity to look inside two of these servers – the Supermicro 1028GQ-TRT and the Invented K888 G3. Both of these servers will ship with multiple Radeon Instinct MI25 Vega with NCU accelerators, allowing them to deliver up to 100 TFLOPS of FP16 compute performance.

We also had a look at the Falconwitch PS1816 server which can host a whopping 16 Radeon Instinct MI25 Vega with NCU accelerators to deliver 300 teraflops of FP16 compute performance!

 

The Supermicro 1028GQ-TRT

This is the server Ben Sander used to demonstrate the training capability of the Radeon Instinct MI25 accelerator in the 2016 AMD Tech Summit.

The Supermicro 1028GQ-TRT is a 1U server that fits up to 3 Radeon Instinct MI25 Vega with NCU accelerators. That allows it to deliver up to 75 teraflops of FP16 compute performance.

Multiple servers can be combined to increase compute performance. In his demo, Ben Sander used two of these Supermicro servers to obtain 150 teraflops of computing performance.

 

The Inventec K888 G3

The Inventec K888 G3 is a 2U, 2-processor server that fits up to 4 Radeon Instinct MI25 Vega with NCU accelerators. This allows it to deliver up to 100 teraflops of FP16 compute performance.

In this example, the Inventec K888 is powered by four FirePro S9300 X2 cards instead. Each of these FirePro S9300 X2 cards deliver slightly more FP16 compute performance than the Radeon Instinct MI25 Vega, [adrotate banner=”5″]

 

The Falconwitch PS1816

The Falconwitch PS1816 is a 2U, 24-bay server that boasts a total of 288 PCIe lanes. This allows it to support up to sixteen Radeon Instinct MI25 Vega with NCU accelerators to deliver 400 teraflops of FP16 compute performance.

If that’s not enough, there is an Inventec Radeon Instinct 42U rack that features six of these Falconwitch PS1816 servers and an additional four Radeon Instinct MI25 Vega with NCU accelerators. That is a total of 120 Radeon Instinct MI25 accelerators, delivering 3,000 teraflops or 3 petaflops of FP16 compute performance! This is literally, the mother of all Radeon Instinct servers!

 

Raja Koduri Introducing The First Radeon Instinct Servers

For those who missed our complete coverage of Radeon Instinct, here is the video of Radeon Technologies Group Senior Vice President and Chief Architect, Raja Koduri introducing the first Radeon Instinct servers.

For more information on the Radeon Instinct accelerators, and MIOpen deep learning library, please take a look at our article – The Complete AMD Radeon Instinct Tech Briefing!

 

Support Tech ARP!

If you like our work, you can help support our work by visiting our sponsors, participating in the Tech ARP Forums, or even donating to our fund. Any help you can render is greatly appreciated!