Memory - Page 24 of 148

Nvidia’s Arm-Powered Grace CPU Debuts, Claims 10X More Performance Than x86 Servers

(Image credit: Nvidia)

Nvidia introduced its Arm-based Grace CPU architecture that the company will use to power two new AI supercomputers. Nvidia says its new chips deliver 10X more performance than today’s fastest servers in AI and HPC workloads.

The new Grace CPU architecture comes powered by unspecified “next-generation” Arm Neoverse CPU cores paired with LPDDR5x memory that pumps out 500 GBps of throughput, along with a 900 GBps NVLink connection to an unspecified GPU for the leading-edge devices. Nvidia also revealed a new roadmap (below) that shows a “Grace Next” CPU coming in 2025, along with a new “Ampere Next Next” GPU that will arrive in mid-2024.

Notably, Nvidia named the Grace CPU architecture after Grace Hopper, a famous computer scientist. Nvidia is also rumored to be working on its chiplet-based Hopper GPUs, which would make for an interesting pairing of CPU and GPU codenames that we could see more of in the future.

Nvidia’s pending ARM acquisition, which is still winding its way through global regulatory bodies, has led to plenty of speculation that we could see Nvidia-branded Arm-based CPUs. Nvidia CEO Jensen Huang confirmed that was a distinct possibility, and while the first instantiation of the Grace CPU architecture doesn’t come as a general-purpose design in the socketed form factor we’re accustomed to (instead coming mounted on a board with a GPU), it is clear that Nvidia is serious about deploying its own Arm-based data center CPUs.

Nvidia hasn’t shared core counts or frequency information yet, which isn’t entirely surprising given that the Grace CPUs won’t come to market until early 2023. The company did specify that these are next-generation Arm Neoverse cores. Given what we know about Arm’s current public roadmap (slides below), these are likely the V1 Platform ‘Zeus’ cores, which are optimized for maximum performance at the cost of power and die area.

Image 1 of 3

(Image credit: Nvidia )

Image 2 of 3

(Image credit: ARM)

Image 3 of 3

(Image credit: ARM)

Chips based on the Zeus cores will come in either 7nm or 5nm flavors and offer a 50% increase in IPC over the current Arm N1 cores. Nvidia says its Grace CPU will have plenty of performance, with a 300+ projected score in the SPECrate_2017_int_base benchmark. That’s impressive for a freshman effort — AMD’s EPYC Milan chips, the current performance leader in the data center, have posted results ranging from 382 to 424, putting Grace more on par with the 64-core AMD Rome chips. Given Nvidia’s ’10X’ performance claims relative to existing servers, it appears the company is referring to primarily GPU-driven workloads.

The Arm V1 platform supports all the latest high-end tech, like PCIe 5.0, DDR5, and either HBM2e or HBM3, along with the CCIX 1.1 interconnect. It appears that, at least for now, Nvidia is utilizing its own NVLink instead of CCIX to connect its CPU and GPU.

As we can see above, the first versions of the Nvidia Grace CPU will come mounted as a BGA package (meaning it won’t be a socketed part like traditional x86 server chips) and comes flanked by what appear to be eight packages of LPDDR5x memory. Nvidia says that LPDDR5x ECC memory provides twice the bandwidth and 10x better power efficiency over standard DDR4 memory subsystems.

Nvidia’s next-generation NVLink, which it hasn’t shared many details about yet, connects the chip to the adjacent CPU with a 900 GBps transfer rate (14X faster), outstripping the data transfer rates that are traditionally available from a CPU to a GPU by 30X. The company also claims the new design can transfer data between CPUs at twice the rate of standard designs, breaking the shackles of suboptimal data transfer rates between the various compute elements, like CPUs, GPUs, and system memory.

Image 1 of 3

(Image credit: Nvidia)

Image 2 of 3

(Image credit: Nvidia)

Image 3 of 3

(Image credit: Nvidia)

The graphics above outline Nvidia’s primary problem with feeding its GPUs with enough bandwidth in a modern system. The first slide shows the bandwidth limitation of 64 GBps from memory to GPU in an x86 CPU-driven system, with the limitations of PCIe throughput (16 GBps) exacerbating the low throughput and ultimately limiting how much system memory the GPU can utilize fully. The second slide shows throughput with the Grace CPUs: With four NVLinks, throughput is boosted to 500 GBps, while memory-to-GPU throughput increases 30X to 2,000 GBps.

The NVLink implementation also provides cache coherency, which brings the system and GPU memory (LPDDR5x and HBM) under the same memory address space to simplify programming. Cache coherency also reduces data movement between the CPU and GPU, thus increasing both performance and efficiency. This addition allows Nvidia to offer similar functionality to AMD’s pairing of EPYC CPUs with Radeon Instinct GPUs in the Frontier exascale supercomputer, and also Intel’s combination of the Ponte Vecchio graphics cards with the Sapphire Rapids CPUs in the Aurora supercomputer, another world-leading exascale supercomputer.

Nvidia says this combination of features will reduce the amount of time it takes to train GPT-3, the world’s largest natural language AI model, with 2.8 AI-exaflops Selene, the world’s current fastest AI supercomputer, from fourteen days to two.

(Image credit: Nvidia)

Nvidia also revealed a new roadmap that it says will dictate its cadence of updates over the next several years, with GPUs, CPUs (Arm and x86), and DPUs all co-existing and evolving on a steady cadence. Huang said the company would advance each architecture every two years, with a possible “kicker” generation in between, which likely will consist of smaller advances to process technology rather than architectures.

Image 1 of 4

(Image credit: Nvidia)

Image 2 of 4

(Image credit: Nvidia)

Image 3 of 4

(Image credit: Nvidia)

Image 4 of 4

(Image credit: Nvidia)

The US Department of Energy’s Los Alamos National Laboratory will build a Grace-powered supercomputer. This system will be built by HPE (the division formerly known as Cray) and will come online in 2023, but the DOE hasn’t shared many details about the new system.

The Grace CPU will also power what Nvidia touts as the world’s most powerful AI-capable supercomputer, the Alps system that will be deployed at the Swiss National Computing Center (CSCS). Alps will primarily serve European scientists and researchers when it comes online in 2023 for workloads like climate, molecular dynamics, computational fluid dynamics, and the like.

Given Nvidia’s interest in purchasing Arm, it’s natural to expect the company to begin broadening its relationships with existing Arm customers. To that effect, Nvidia will also bring support for its GPUs to Amazon Web Service’s powerful Graviton 2 Arm chips, which is a key addition as AWS adoption of the Arm architecture has led to broader uptake for cloud workloads.

Gigabyte Z590 Aorus Master Review: Remastered for Z590

Our Verdict

Gigabyte’s Aorus Z590 Master is a well-rounded upper mid-range motherboard with a VRM rivaled by boards that cost twice as much. Between the Wi-Fi 6E and 10 GbE, three M.2 sockets and six SATA ports for storage, plus its premium appearance, the Z590 Master is an excellent option to get into the Z590 platform if you’re willing to spend around $400.

For

+ Fast Networking, Wi-Fi 6E/10 GbE
+ Superior 18-phase 90A VRM
+ 10 USB ports

Against

– No PCIe x1 slot(s)
– Audible VRM fan
– Price

Go to page:

Features and Specifications

Editor’s Note: A version of this article appeared as a preview before we had a Rocket Lake CPU to test with Z590 motherboards. Now that we do (and Intel’s performance embargo has passed), we have completed testing (presented on page 3) with a Core i9-11900K and have added a score and other elements (as well as removing some now-redundant sentences and paragraphs) to make this a full review.

Gigabyte’s Z590 Aorus Master includes an incredibly robust VRM, ultra-fast Wi-Fi and wired networking, premium audio, and more. While its price of roughly $410 is substantial, it’s reasonable for the features you get, and far from the price of the most premium models in recent generations. If you don’t mind a bit of audible VRM fan noise and like lots of USB and fast wired and wireless networking, it’s well worth considering.

Gigabyte’s current Z590 product stack consists of 13 models. There are familiar SKUs and a couple of new ones. Starting with the Aorus line, we have the Aorus Xtreme (and potentially a Waterforce version), Aorus Master, Aorus Ultra, and the Aorus Elite. Gigabyte brings back the Vision boards (for creators) and their familiar white shrouds. The Z590 Gaming X and a couple of boards from the budget Ultra Durable (UD) series are also listed. New for Z590 is the Pro AX board, which looks to slot somewhere in the mid-range. Gigabyte will also release the Z590 Aorus Tachyon, an overbuilt motherboard designed for extreme overclocking.

On the performance front, the Gigabyte Z590 Aorus Master did well overall, performing among the other boards with raised power limits. There wasn’t a test where it did particularly poorly, but the MS Office and PCMark tests on average were slightly higher than most. Overall, there is nothing to worry about when it comes to stock performance on this board. Overclocking proceeded without issue as well, reaching our 5.1 GHz overclock along with the memory sitting at DDR4 4000.

The Z590 Aorus Master looks the part of a premium motherboard, with brushed aluminum shrouds covering the PCIe/M.2/chipset area. The VRM heatsink and its NanoCarbon Fin-Array II provide a nice contrast against the smooth finish on the board’s bottom. Along with Wi-Fi 6E integration, it also includes an Aquantia based 10GbE, while most others use 2.5 GbE. The Aorus Master includes a premium Realtek ALC1220 audio solution with an integrated DAC, three M.2 sockets, reinforced PCIe and memory slots and 10 total USB ports, including a rear USB 3.2 Gen2x2 Type-C port. We’ll cover those features and much more in detail below. But first, here are full the specs from Gigabyte.

Specifications – Gigabyte Z590 Aorus Master

Socket	LGA 1200
Chipset	Z590
Form Factor	ATX
Voltage Regulator	19 Phase (18+1, 90A MOSFETs)
Video Ports	(1) DisplayPort v1.2
USB Ports	(1) USB 3.2 Gen 2×2, Type-C (20 Gbps)
	(5) USB 3.2 Gen 2, Type-A (10 Gbps)
	(4) USB 3.2 Gen 1, Type-A (5 Gbps)
Network Jacks	(1) 10 GbE
Audio Jacks	(5) Analog + SPDIF
Legacy Ports/Jacks	✗
Other Ports/Jack	✗
PCIe x16	(2) v4.0 x16, (x16/x0 or x8/x8
	(1) v3.0 x4
PCIe x8	✗
PCIe x4	✗
PCIe x1	✗
CrossFire/SLI	AMD Quad GPU Crossfire and 2-Way Crossfire
DIMM slots	(4) DDR4 5000+, 128GB Capacity
M.2 slots	(1) PCIe 4.0 x4 / PCIe (up to 110mm)
	(1) PCIe 3.0 x4 / PCIe + SATA (up to 110mm)
	(1) PCIe 3.0 x4 / PCIe + SATA (up to 110mm)
U.2 Ports	✗
SATA Ports	(6) SATA3 6 Gbps (RAID 0, 1, 5 and 10)
USB Headers	(1) USB v3.2 Gen 2 (Front Panel Type-C)
	(2) USB v3.2 Gen 1
	(2) USB v2.0
Fan/Pump Headers	(10) 4-Pin
RGB Headers	(2) aRGB (3-pin)
	(2) RGB (4-pin)
Legacy Interfaces	✗
Other Interfaces	FP-Audio, TPM
Diagnostics Panel	Yes, 2-character debug LED, and 4-LED ‘Status LED’ display
Internal Button/Switch	Power, Reset, BIOS switch, SB switch
SATA Controllers	✗
Ethernet Controller(s)	(1) Aquantia AQC107 (10 GbE)
Wi-Fi / Bluetooth	Intel WiFi-6E AX210 (802.11ax, 2×2, MU-MIMO, OFDMA, BT 5.2)
USB Controllers	Realtek RTS5411E, ASMedia 1074
HD Audio Codec	Realtek ALC1220-VB
DDL/DTS Connect	✗ / DTS:X Ultra
Warranty	3 Years

Features

As we open up the retail packaging, along with the board, we’re greeted by a slew of included accessories. The Aorus Master contains the basics (guides, driver CD, SATA cables, etc.) and a few other things that make this board complete. Below is a full list of all included accessories.

Installation Guide
User’s Manual
G-connector
Sticker sheet / Aorus badge
Wi-Fi Antenna
(4) SATA cables
(3) Screws for M.2 sockets
(2) Temperature probes
Microphone
RGB extension cable

Image 1 of 3

(Image credit: Gigabyte)

Image 2 of 3

(Image credit: Gigabyte)

Image 3 of 3

(Image credit: Gigabyte)

After taking the Z590 Aorus Master out of the box, its weight was immediately apparent, with the shrouds, heatsinks and backplate making up the majority of that weight. The board sports a matte-black PCB, with black and grey shrouds covering the PCIe/M.2 area and two VRM heatsinks with fins connected by a heatpipe. The chipset heatsink has the Aorus Eagle branding lit up, while the rear IO shroud arches over the left VRM bank with more RGB LED lighting. The Gigabyte RGB Fusion 2.0 application handles RGB control. Overall, the Aorus Master has a premium appearance and shouldn’t have much issue fitting in with most build themes.

(Image credit: Gigabyte)

Looking at the board’s top half, we’ll first focus on the VRM heatsinks. They are physically small compared to most boards, but don’t let that fool you. The fin array uses a louvered stacked-fin design Gigabyte says increases surface area by 300% and improves thermal efficiency with better airflow and heat exchange. An 8mm heat pipe also connects them to share the load. Additionally, a small fan located under the rear IO shroud actively keeps the VRMs cool. The fan here wasn’t loud, but was undoubtedly audible at default settings.

We saw a similar configuration in the previous generation, which worked out well with an i9-10900K, so it should do well with the Rocket Lake flagship, too. We’ve already seen reports indicating the i9-11900K has a similar power profile to its predecessor. Feeding power to the VRMs is two reinforced 8-pin EPS connectors (one required).

To the right of the socket, things start to get busy. We see four reinforced DRAM slots supporting up to 128GB of RAM. Oddly enough, the specifications only list support up to DDR4 3200 MHz, the platform’s limit. But further down the webpage, it lists DDR4 5000. I find it odd it is listed this way, though it does set up an expectation that anything above 3200 MHz is overclocking and not guaranteed to work.

Above the DRAM slots are eight voltage read points covering various relevant voltages. This includes read points for the CPU Vcore, VccSA, VccIO, DRAM, and a few others. When you’re pushing the limits and using sub-ambient cooling methods, knowing exactly what voltage the component is getting (software can be inaccurate) is quite helpful.

Above those on the top edge are four fan headers (next to the EPS connectors is a fifth) of 10. According to the manual, all CPU fan and pump headers support 2A/24W each. You shouldn’t have any issues powering fans and a water cooling pump. Gigabyte doesn’t mention if these headers use auto-sensing (for DC or PWM control), but they handled both when set to ‘auto’ in the BIOS. Both a PWM and DC controlled fan worked without intervention.

The first two (of four) RGB LED headers live to the fan headers’ right. The Z590 Aorus Master includes two 3-pin ARGB headers and two 4-pin RGB headers. Since this board takes a minimal approach to RGB lighting, you’ll need to use these to add more bling to your rig.

We find the power button and 2-character debug LED for troubleshooting POST issues on the right edge. Below is a reinforced 24-pin ATX connector for power to the board, another fan header and a 2-pin temperature probe header. Just below all of that are two USB 3.2 Gen1 headers and a single USB 3.2 Gen2x2 Type-C front-panel header for additional USB ports.

(Image credit: Tom’s Hardware)

Gigabyte chose to go with a 19-phase setup for the Vcore and SOC on the power delivery front. Controlling power is an Intersil ISL6929 buck controller that manages up to 12 discrete channels. The controller then sends the power to ISL6617A phase doublers and the 19 90A ISL99390B MOSFETs. This is one of the more robust VRMs we’ve seen on a mid-range board allowing for a whopping 1,620A available for the CPU. You won’t have any trouble running any compatible CPU, including using sub-ambient overclocking.

(Image credit: Gigabyte)

The bottom half of the board is mostly covered in shrouds hiding all the unsightly but necessary bits. On the far left side, under the shrouds, you’ll find the Realtek ALC1220-VB codec along with an ESS Sabre ESS 9118 DAC and audiophile-grade WIMA and Nichicon Fine Gold capacitors. With the premium audio codec and DAC, an overwhelming majority of users will find the audio perfectly acceptable.

We’ll find the PCIe slots and M.2 sockets in the middle of the board. Starting with the PCIe sockets, there are a total of three full-length slots (all reinforced). The first and second slots are wired for PCIe 4.0, with the primary (top) slot wired for x16 and the bottom maxes out at x8. Gigabyte says this configuration supports AMD Quad-GPU Cand 2-Way Crossfire. We didn’t see a mention of SLI support even though the lane count supports it. The bottom full-length slot is fed from the chipset and runs at PCIe 3.0 x4 speeds. Since the board does without x1 slots, this is the only expansion slot available if you’re using a triple-slot video card. Anything less than that allows you to use the second slot.

Hidden under the shrouds around the PCIe slots are three M.2 sockets. Unique to this setup is the Aorus M.2 Thermal Guard II, which uses a double-sided heatsink design to help cool M.2 SSD devices with double-sided flash. With these devices’ capacities rising and more using flash on both sides, this is a good value-add.

The top socket (M2A_CPU) supports up to PCIe 4.0 x4 devices up to 110mm long. The second and third sockets, M2P_SB and M2M_SB, support both SATA and PCIe 3.0 x3 modules up to 110mm long. When using a SATA-based SSD on M2P_SB, SATA port 1 will be disabled. When M2M_SB (bottom socket) is in use, SATA ports 4/5 get disabled.

To the right of the PCIe area is the chipset heatsink with the Aorus falcon lit up with RGB LEDs from below. There’s a total of six SATA ports that support RAID0, 1, 5 and 10. Sitting on the right edge are two Thunderbolt headers (5-pin and 3-pin) to connect to a Gigabyte Thunderbolt add-in card. Finally, in the bottom-right corner is the Status LED display. The four LEDs labeled CPU, DRAM, BOOT and VGA light up during the POST process. If something hangs during that time, the LED where the problem resides stays lit, identifying the problem area. This is good to have, even with the debug LED at the top of the board.

Across the board’s bottom are several headers, including more USB ports, fan headers and more. Below is the full list, from left to right:

Front-panel audio
BIOS switch
Dual/Single BIOS switch
ARGB header
RGB header
TPM header
(2) USB 2.0 headers
Noise sensor header
Reset button
(3) Fan headers
Front panel header
Clear CMOS button

(Image credit: Gigabyte)

The Z590 Aorus Master comes with a pre-installed rear IO panel full of ports and buttons. To start, there are a total of 10 USB ports out back, which should be plenty for most users. You have a USB 3.2 Gen2x2 Type-C port, five USB 3.2 Gen2 Type-A ports and four USB 3.2 Gen1 Type-A ports. There is a single DisplayPort output for those who would like to use the CPU’s integrated graphics. The audio stack consists of five gold-plated analog jacks and a SPDIF out. On the networking side is the Aquantia 10 GbE port and the Wi-Fi antenna. Last but not least is a Clear CMOS button and a Q-Flash button, the latter designed for flashing the BIOS without a CPU.

MORE: Best Motherboards

MORE: How To Choose A Motherboard

MORE: All Motherboard Content

Intel Axes Optane SSD DC P4800X SSDs with Memory Drive Technology

(Image credit: Intel)

Intel has initiated the end-of-life plan for all of its Optane DC P4800X SSDs with Memory Drive Technology (MDT). The same drives without the Memory Drive software will continue to be shipped as long as demand is there, but the SKUs with the said program will not be available from Intel by October.

The discontinued family of Optane SSD DC P4800X with MDT products includes models with 100GB, 375GB, 750GB, and 1.5TB capacities in U.2 and card form-factors with a PCIe 3.0 x4 interface.

Along with the drives, Intel has also EOL’d the Memory Drive Technology Software that’s sold separately for its Optane DC P4800X and SSD 900/905P drives. Interested parties should place their orders on the said products by June 30, 2021; Intel will ship the last drives with MDT on September 30, 2021.

Intel’s Memory Drive Technology software extends system memory to Optane SSDs transparently to the OS and essentially makes 3D XPoint-based drives appear like DRAM to the OS and applications. The software was introduced in 2018 alongside the Optane SSD DC P4800X/P4801X as well as Optane SSD 900P/905P drives and was designed primarily to expand system memory capacity on first-gen Intel Xeon Scalable (and older) machines in a very cost-efficient way, as 3D XPoint is significantly cheaper than DRAM.

Back in 2018, Intel sold its 1st Generation Xeon Scalable processors (and even their predecessors) that did not support yet-to-be-launched Optane Persistent Memory modules, so the Memory Drive Technology software made quite a lot of sense for the company and its customers that needed a cheap system memory expansion for their in-memory applications. In mid-2019 the company introduced its 2nd Generation Xeon Scalable ‘Cascade Lake’ CPUs that added support for Optane Persistent Memory Modules and it became even easier for its clients to expand system memory using 3D XPoint-based PMMs.

By now, the share of outdated Xeon CPUs in Intel’s shipments has probably dropped so significantly that it no longer needs either MDT or drives that come with it. To that end, it does not make sense to keep the SKUs in the catalog. Meanwhile, regular Optane DC P4800X SSDs will continue to be shipped as Intel has not announced any plans about them.

ASRock Unveils Intel Z590 Mini-ITX Motherboard with Thunderbolt 4

(Image credit: ASRock)

ASRock has quietly introduced one of the industry’s first Intel Z590-based Mini-ITX motherboards with a Thunderbolt 4 port. The manufacturer positions its Z590 Phantom Gaming-ITX/TB4 platform as its top-of-the-range offering for compact gaming builds for enthusiasts that want to have all the capabilities of large tower desktops and then some, so it is packed with advanced features.

The ASRock Z590 Phantom Gaming-ITX/TB4 motherboard supports all of Intel’s 10th and 11th Generation Comet Lake and Rocket Lake processors, including the top-of-the-range Core i9-11900K with a 125W TDP.

One of the main selling points of the Z590 Phantom Gaming-ITX/TB4 motherboard is of course its Thunderbolt 4 port, which supports a 40 Gb/s throughput when attached to appropriate TB3/TB4 devices (or 10 Gb/s when connected to a USB 3.2 Gen 2) such as high-end external storage subsystems (in case internal storage is not enough on a Mini-ITX build) and can handle two 4K displays or one 8K monitor (albeit with DSC). Furthermore, the motherboard has five USB 3.2 Gen 2 ports on the back as well as an internal header to connect a front panel USB 3.2 Gen 2×2 port which supports transfer rates up to 20 Gb/s.

The platform relies on a 10-layer PCB and is equipped with a 10-phase VRM featuring 90A solid-state coils, 90A DrMOS power stage solutions, and solid-state Nichicon 12K capacitors to ensure maximum performance, reliable operation, and some additional overclocking potential. Interestingly, the motherboard’s CPU fan header provides a maximum 2A power to support water pumps.

(Image credit: ASRock)

The Z590 Phantom Gaming-ITX/TB4 also has a PCIe 4.0 x16 slot for graphics cards, two slots for up to 64 GB of DDR4-4266+ memory, two M.2-2280 slots for SSDs (with a PCIe 4.0 x4 as well as a PCIe 3.0 x4/SATA interface), and three SATA connectors. To guarantee the consistent performance and stable operation of high-end SSDs, ASRock supplies its own heat spreaders for M.2 drives that match its motherboard’s design.

(Image credit: ASRock)

Being a top-of-the-range product, the ASRock Z590 Phantom Gaming-ITX/TB4 naturally has support for addressable RGB lighting (using the ASRock Polychrome Sync/Polychrome RGB software) and has a very sophisticated input/output department that has a number of unique features, such as three display outputs and multi-gig networking.

(Image credit: ASRock)

In addition, the mainboard has a DisplayPort 1.4 as well as an HDMI 2.0b connector. Keeping in mind that Intel’s desktop UHD Graphics has three display pipelines, the motherboard can handle three monitors even without a discrete graphics card. Meanwhile, integrated Intel’s Xe-LP architecture used in Rocket Lake’s UHD Graphics 730 has very advanced media playback capabilities (e.g., a hardware-accelerated 12-bit video pipeline for wide-color 8K60 with HDR playback), so it can handle Ultra-HD Blu-ray, contemporary video services that use modern codecs, and next-generation 8Kp60 video formats.

(Image credit: ASRock)

Next up is networking. The Z590 Phantom Gaming-ITX/TB4 comes with an M.2-2230 Killer AX1675x WiFi 6E + Bluetooth 5.2 PCIe module that supports up to 2.4 Gbps throughput when connected to an appropriate router. Also, the motherboard is equipped with a Killer E3100G 2.5GbE adapter. The adapters can be used at the same time courtesy of Killer’s DoubleShot Pro technology that aggregates bandwidth and prioritizes high-priority traffic, so the maximum networking performance can be increased up to 4.9 Gbps.

The audio department of the Z590 Phantom Gaming-ITX/TB4 is managed by the Realtek ALC1220 audio codec withNahimic Audio software enhancements and includes 7.1-channel analog outputs as well as an S/P DIF digital output.

ASRock’s Z590 Phantom Gaming-ITX/TB4 motherboard will be available starting from April 23 in Japan, reports Hermitage Akihabara. In the Land of the Rising Sun, the unit will cost ¥38,000 (around $345) without taxes and ¥41,800 with taxes.

New Algorithm Makes CPUs 15 Times Faster Than GPUs in Some AI Work

(Image credit: Intel)

GPUs are known for being significantly better than most CPUs when it comes to AI deep neural networks (DNNs) training simply because they have more execution units (or cores). But a new algorithm proposed by computer scientists from Rice University is claimed to actually flip the tables and make CPUs a whopping 15 times faster than some leading-edge GPUs.

The most complex compute challenges are usually solved using brute force methods, like either throwing more hardware at them or inventing special-purpose hardware that can solve the task. DNN training is without any doubt among the most compute-intensive workloads nowadays, so if programmers want maximum training performance, they use GPUs for their workloads. This happens to a large degree because it is easier to achieve high performance using compute GPUs as most algorithms are based on matrix multiplications.

Anshumali Shrivastava, an assistant professor of computer science at Rice’s Brown School of Engineering, and his colleagues have presented an algorithm that can greatly speed up DNN training on modern AVX512 and AVX512_BF16-enabled CPUs.

“Companies are spending millions of dollars a week just to train and fine-tune their AI workloads,” said Shrivastava in a conversation with TechXplore. “The whole industry is fixated on one kind of improvement — faster matrix multiplications. Everyone is looking at specialized hardware and architectures to push matrix multiplication. People are now even talking about having specialized hardware-software stacks for specific kinds of deep learning. Instead of taking a [computationally] expensive algorithm and throwing the whole world of system optimization at it, I’m saying, ‘Let’s revisit the algorithm.'”

To prove their point, the scientists took SLIDE (Sub-LInear Deep Learning Engine), a C++ OpenMP-based engine that combines smart hashing randomized algorithms with modest multi-core parallelism on CPU, and optimized it heavily for Intel’s AVX512 and AVX512-bfloat16-supporting processors.

(Image credit: Intel)

The engine employs Locality Sensitive Hashing (LSH) to identify neurons during each update adaptively, which optimizes compute performance requirements. Even without modifications, it can be faster in training a 200-million-parameter neural network, in terms of wall clock time, than the optimized TensorFlow implementation on an Nvidia V100 GPU, according to the paper.

“Hash table-based acceleration already outperforms GPU, but CPUs are also evolving,” said study co-author Shabnam Daghaghi.

To make hashing faster, the researchers vectorized and quantized the algorithm so that it could be better handled by Intel’s AVX512 and AVX512_BF16 engines. They also implemented some memory optimizations.

“We leveraged [AVX512 and AVX512_BF16] CPU innovations to take SLIDE even further, showing that if you aren’t fixated on matrix multiplications, you can leverage the power in modern CPUs and train AI models four to 15 times faster than the best specialized hardware alternative.”

(Image credit: Anshumali Shrivastava/Rice University)

The results they obtained with Amazon-670K, WikiLSHTC-325K, and Text8 datasets are indeed very promising with the optimized SLIDE engine. Intel’s Cooper Lake (CPX) processor can outperform Nvidia’s Tesla V100 by about 7.8 times with Amazon-670K, by approximately 5.2 times with WikiLSHTC-325K, and by roughly 15.5 times with Text8. In fact, even an optimized Cascade Lake (CLX) processor can be 2.55–11.6 times faster than Nvidia’s Tesla V100.

(Image credit: Anshumali Shrivastava/Rice University)

Without any doubt, optimized DNN algorithms for AVX512 and AVX512_BF16-enabled CPUs make a lot of sense since processors are pervasive as they are used by client devices, data center servers, and HPC machines. To that end, it is very important to take advantage of all of their capabilities.

But there might be a catch when it comes to absolute performance, so let’s speculate for a moment. Nvidia’s A100 promises to be 3–6 times faster than Nvidia’s Tesla V100 used by researchers for comparison (perhaps because getting an A100 is hard) in training. Unfortunately, we do not have any A100 numbers with Amazon-670K, WikiLSHTC-325K, and Text8 datasets. Perhaps, an A100 cannot beat Intel’s Cooper Lake when it uses an optimized algorithm, but these AVX512_BF16-enabled CPUs are not exactly widely available (like the A100). So, the question is, how does Nvidia’s A100 stack up against Intel’s Cascade Lake and Ice Lake CPUs?

Specifications: Neo Forza eSports NFP075 1 TB
Brand:	Neo Forza
Model:	NFP075PCI1T-3400200
Capacity:	1024 GB (953 GB usable) No additional overprovisioning
Controller:	Phison PS5012-E12
Flash:	YMTC 64-Layer 3D TLC CABBG64A0A
DRAM:	1x 256 MB Kingston DDR3-1866 D1216ECMDXGJD
Endurance:	1350 TBW
Form Factor:	M.2 2280
Interface:	PCIe Gen 3 x4, NVMe 1.3
Device ID:	Neo Forza NFP075PCI1T-3400000
Firmware:	ECFM32.1
Warranty:	3 years
Price at Time of Review:	$130 / 13 cents per GB

Our Verdict

The Corsair K70 RGB TKL is a powerful yet compact gaming keyboard. We didn’t notice an immediate benefit from the 8,000 Hz polling rate, but with a sleek look plyus premium media controls and keycaps, this keyboard’s in a league of its own.

For

+ Space-saving, durable build
+ Premium keycaps
+ Media keys
+ Some software-free RGB control

Against

– Close keys can require getting used to
– Expensive

Let’s be real: Mechanical keyboards can get expensive. While the best budget mechanical keyboards can give you the switches you need, the best gaming keyboards often come with extra bells and whistles that up the price. At $140, the Corsair K70 RGB TKL is one example, but you get a lot for that price.

Corsair’s been dubbing keyboards “K70” for a while. Just look at our Corsair K70 RGB Red review from 2016 or the most recent iteration, the low-profile Corsair K70 RGB MK.2. Our review focus brings the tenkeyless (TKL) form factor to the lineup.

The K70 RGB TKL is a competitive board that earns its price with extra features, like programmable keys, per-key RGB via manageable software. And as someone who games full-time, the quality of the keyboard’s build seems like a great investment. This is a sturdy keyboard that should hold up over extended use. And since this is a TKL keyboard, you’ll have all the space you need on your desk for your mouse, to let you focus exclusively on playing.

On top of that, Corsair is continuing its trend of upping the polling rate of its gaming keyboards, with the K70 RGB TKL offering an 8,000 Hz polling rate — 8 times the 1,000 Hz you usually see. The usefulness of that high spec, however, is debatable.

Corsair K70 RGB TKL Specs

Switches	Cherry MX Red (tested), Cherry MX Silent Red or Cherry MX Speed Silver
Lighting	Per-key RGB
Onboard Storage	8MB
Media Keys	Yes
Interface	USB Type-A
Cable	6 feet (1.8m) USB-C to USB-A , braided, detachable
Additional Ports	None
Keycaps	Doubleshot PBT plastic
Software	Corsair iCue
Dimensions (LxWxH)	14.2 x 6.5 x 1.9 inches
Weight	2.1 pounds
Extra	1x ABS plastic A, S, D, Q, E and R keycaps, 2x ABS plastic W and D keycaps, 1x keycap puller

Design

(Image credit: Tom’s Hardware)

The Corsair K70 RGB TKL Champion Series is a tournament-ready keyboard with a colorful and durable design in a small form factor. As a TKL keyboard, it forgoes the numpad in favor of more desk space, which makes it great for people who don’t have a lot of room on their desk or travel a lot. At 14.2 x 6.5 x 1.9 inches, the K70 RGB TKL is similar but slightly taller than other TKL keyboards, such as the Razer BlackWidow V3 Tenkeyless (14.3 x 6.1 x 1.6 inches) and more petite Roccat Vulcan TKL Pro (14.2 x 5.3 x 1.3 inches). Another downside for travel is the K70 RGB TKL’s weight. It’s 2.1 pounds compared to 1.9 pounds for the Razer and 1.5 pounds for the Roccat.

But part of that slightly larger design comes thanks to the K70 RGB TKL’s inclusion of luxurious media keys. There are five dedicated hotkeys, plus an aluminum, textured volume roller, which are all a decent accomplishment to include on a TKL.

All those keys felt pretty solid, especially compared to the cheap plastic alternatives available on lower-priced keyboards.

This brings us to the overall durability of the keyboard. The K70 RGB TKL feels more rigid and sturdy than the ~$250 Logitech G915 Lightspeed full-sized wireless gaming keyboard I often use, (which has an identical design to its TKL counterpart, the Logitech G915 TKL). The Logitech is conveniently lightweight (2.3 pounds) and thin (0.9 inches) but feels like it might break if dropped. Suddenly, the K70 RGB TKL’s $140 price tag starts to make more sense. The K70 RGB TKL lives in a plastic chassis with a black matte finish and aluminum frame.

With its media key layout and brushed aluminum finish the K70 RGB TKL looks more interesting than a lot of other TKLs (looking at you, Razer BlackWidow V3 Tenkeyless). And it’s mature and subdued enough to go well with any setup. But I’m not wowed by its overall look; it’s not earning any style points from me at first. Out of the box, this appears to be a tool for competitive gamers, not a showy looker. You can add a little more flair, however, if you use the included silver W, A, S, D, Q, E, R, D or F keycaps. These keycaps are a cheaper ABS plastic than the doubleshot PBT that the keys come with by default, but do add more color to the design and a slight texturing that I like a lot.

For even more customization, you’ll have to rely on the K70 RGB TKL’s per-key RGB effects. You’ll need the software to create and play with different RGB effects. But you can also toggle through 10 different presets and control speed and direction using FN shortcuts. You can also create profiles in the iCue software with different RGB effects and store them in your onboard memory. When you toggle through profiles with the dedicated profile switch button, the RGB will change accordingly. As somebody who loves the variety of RGB settings on my keyboard, it is wonderful to be able to control these settings regardless of whether iCUE is running or not.

Next to the profile switch button are an RGB brightness key and Windows lock key as well. These and the media keys are also reprogrammable via iCue for ultimate customization.

Corsair didn’t skimp when it came to the keycaps. The use of doubleshot PBT plastic delivers a more premium feel than standard ABS plastic. And doubleshot means the legends will never fade. The keycaps feel strong at 1.5mm thick and have a matte coating that easily fought off grease and fingerprints during my testing. With many still working from home, you’d be hard-pressed to find someone who isn’t eating near their keyboard, so this feature is highly appealing.

The K70 RGB TKL uses a detachable USB-C to USB-A cable that’s high-quality braided. Some keyboard’s USB cables can feel thin or cheap, but this one should survive a good amount of bending and wear. Our review focus’ cable is 6-feet-long, which is standard among gaming keyboards but can still feel a little long in actual use, which is why I prefer one of the best wireless keyboards when possible.

Typing Experience on Corsair K70 RGB TKL

(Image credit: Tom’s Hardware)

The Corsair K70 RGB TKL comes with either Cherry MX Speed Silver, Cherry MX Silent Red or Cherry MX Red switches. All three options actuate with 45g of force and are linear, a mechanical switch style that tends to be a favorite among gamers for its interruption-free travel. Our review unit came with Red switches, which are specced for 2.0mm pretravel and 4.0mm total travel. Those who want less travel, (perhaps, potentially, for more speed, may prefer the Speed Silver switches (1.9mm / 3.7mm) or even the quieter Silent Reds (1.2mm / 3.4mm).

Pressing keys on the K70 RGB TKL felt lovely and easy because it felt like the keys registered quickly. But there’s very little space between the keys which, in addition to the lighter actuation force of Cherry MX Reds, made typos more common. As such, the K70 RGB TKL may require a slight adjustment period in order to use it smoothly, but this wasn’t a huge concern, as I was eventually able to adapt.

The doubleshot PBT keycaps were also a boon, both for typing and gaming. The quality plastic was more comfortable than the keyboards on most other keyboards I’ve tried. My typing accuracy increased slightly but like I stated I used less pressure to type, which I believe made typing easier.

8,000 Hz Polling Rate

(Image credit: Corsair)

Initially kicked off with the 4,000 Hz Corsair K100 RGB last year, Corsair is continuing its polling rate race with the 8,000 Hz K70 RGB TKL. It’s launching alongside the Corsair Sabre RGB Pro gaming mouse, which also has an 8,000 Hz polling rate, showing a newfound dedication to Hz from the gaming brand.

Your keyboard (or other peripheral) polling rate tells you how many times per second the device sends data to your PC. Instead of doing so 1,000 times a second, like the vast majority of gaming keyboards, the K70 RGB TKL can do it 8,000 times per second. It achieves this through what Corsair calls Axon, “an embedded onboard system with Corsair’s purpose-engineered, real-time operating system” running on a system-on-chip (SoC) with multi-threading in order to “process multiple complex instructions in parallel.” Corsair claims Axon uses an advanced scheduling algorithm. There are some caveats though.

First, there are some requirements. You’ll need a USB 3.0 port and to download iCue software and change the polling rate (from 1,000 Hz) in order to use the 8,000 Hz polling rate. Corsair also noted in its reviewer’s guide that the keyboard “transits keystrokes to the PC up to 8x faster than standard” but can only “detect keypresses up to 4x faster than conventional gaming keyboards.” The vendor doesn’t get too specific in terms of system requirements for 8,000 Hz. A rep told us, “Keyboards send a lot less data, so 8,000 Hz has only a small added CPU usage impact” but added, “the more up-to-date the system is – the smoother the experience.”

But similarly to when we used the 4,000 Hz polling rate on the K100 RGB, I didn’t notice a difference when moving from 1,000 Hz on the K70 RGB TKL to 8,000 Hz, despite using a system running an AMD Ryzen 5950X CPU. There’s a bit of future-proofing here, and it wouldn’t hurt for a very competitive pro player to have this feature handy. But as a low-level competitive player, I didn’t notice my speed or accuracy increase in Fortnite or Destiny.

Gaming Experience on Corsair K70 RGB TKL

This is still a powerful gaming weapon though, as it feels incredibly responsive and fast on the battlefield (whether gaming at 1,000 Hz or 8,000 Hz). I used the K70 RGB TKL during intense Fortnite matches, as well as crucible matches in Destiny, and it didn’t disappoint. The quick and easy actuation of the go-to Cherry MX Red switches honestly made me feel like I was able to better focus on gameplay without looking at my keyboard as often as I normally do.

The best part was how lightly I had to touch the keys for them to register. This really cut down on hand fatigue. When I play, I usually overpress buttons and can even be guilty of mashing (gasp!). On Corsair’s TKL, I quickly realized I didn’t need to press the keys nearly as hard. That really reduced hand pain, which I sometimes experience after several hours of gaming.

And while the tight spacing of the keys was a bit of a hindrance for general typing, this became helpful when gaming, as it meant my fingers had less distance to travel to input my next move. Meanwhile, the TKL form factor gave me a little more room to breathe with my mouse, and I found it easier to focus on the game than when using a full-sized keyboard. I have always been a fan of a larger build but now I am thinking compact is the way moving forward.

Those doubleshot PBT keycaps also came in handy in action. The premium plastic doesn’t get slick, including from sweaty hands. These keys managed to stay dry during high-pressure gaming.

Features and Software on Corsair K70 RGB TKL

Image 1 of 2

(Image credit: Tom’s Hardware)

Image 2 of 2

(Image credit: Tom’s Hardware)

To create new RGB effects or make onboard or software-based profiles, you’ll need iCue, which I found user-friendly. The Corsair K70 RGB TKL features 8MB of onboard storage allowing you to customize to your heart’s content. You can store up to 50 onboard profiles, depending on the configuration, that allow you to customize your RGB settings with up to 20 lighting layers, as well as custom macros.

(Image credit: Corsair)

A unique feature, the keyboard also includes a Tournament Switch on the top edge. This could help you focus on your game more by swapping the keyboard to static backlighting to reduce distractions and disabling programmed actions / macros. As someone who’s been known to press incorrect buttons or clumsily drop things in the heat of battle, I found this to be a great addition.

Bottom Line

(Image credit: Tom’s Hardware)

If you want a powerhouse of a keyboard made for competitive gameplay, the Corsair K70 RGB TKL is an immediate must-have. This keyboard isn’t just pleasant to look at, it is an efficient tool that will take your gameplay to the next level, thanks to responsive keys, high-end PBT keycaps and a lot of customization options both with or without software.

At $140, this is an expensive wired gaming keyboard though. For comparison, the HyperX Alloy Origins Core, one of the best budget mechanical keyboards, is about half the price, and the Razer BlackWidow V3 Tenkeyless is currently $100. But the K70 RGB Pro gives you a lot for the price. Not only is there a robust featureset, including media keys, this is a tough keyboard. I will definitely be utilizing it more for my tournament gaming needs. And there are pricier TKLs than the K70 RGB TKL, such as the $160 Roccat Vulcan TKL Pro with its optical-mechanical switches or the wireless Logitech G915 TKL, which starts at about $200 and is excellent but not for everyone, since it’s low-profile.

Ultimately, the K70 RGB TKL can be an efficient weapon in your gaming toolkit, granting you the look and functionality you need for your most competitive setup.

Our Verdict

For

Against

Features and Specifications

Specifications – Gigabyte Z590 Aorus Master

Features

Don’t Throw Away Your LGA115x Cooler

A Major Upgrade

Introduction

The Sapphire Rapids CPU

The Eagle Stream Platform

Some Grains of Salt

In about three months, Samsung will announce the Galaxy A22. Read all about the expected budget phone here and take a look at the possible design.

Samsung A-Series smartphone 2021

Budget phone with quad camera

Hardware & Software

Android 11 smartphone

Battery & charging options

Samsung A22 price and model variations

Alternatives for Samsung Galaxy A22

Our Verdict

For

Against

Corsair K70 RGB TKL Specs

Design

Typing Experience on Corsair K70 RGB TKL

8,000 Hz Polling Rate

Gaming Experience on Corsair K70 RGB TKL

Features and Software on Corsair K70 RGB TKL

Bottom Line