News

Pascal GTX 1080 Async Compute Explored

Last week, a report came out suggesting that Pascal may include improved asynchronous compute support. However, Nvidia also claimed asynchronous compute support with Maxwell but that proved to be less than optimal solution. From the leak of the GTX 1080 slide deck, we’re now able to glean a few more details about what Nvidia has meant about async compute support and how it has been improved with Pascal.

Async compute basically means working on a graphics workload at the same time as a compute one, making the most of the GPU’s resources. This can work on a number of levels, either at the GPU level or SM/CU level. Maxwell only worked at a GPU level, assigning each SM to either a graphics or compute task. Scheduling was done in the shader and in order add/switch tasks, the previous task had to finish or be preempted/stopped. Furthermore, Maxwell only had static partitioning, so graphics and compute tasks scheduling at the same time had to both finish and weren’t able to dynamically reallocate resources if one task finished first. This led to GCN leading when it came to async compute.

Pascal brings a number of changes. First off, the promised improved preemption has come. Pascal will be able to offer more fine-grained control, entering a new task between pixels or instructions. This will allow for better and likely faster preemption. The next change is dynamic load balancing. This allows Pascal to reallocate resources dedicated to either graphics or compute on the fly. This GPU level means that once a graphics/compute task is finished, the idle SMs can now be added to those working on another graphics/compute task, speeding up completion. This should allow for much better async compute performance compared to Maxwell.

Even with all of these additions, Pascal still won’t quite match GCN. GCN is able to run async compute at the SM/CU level, meaning each SM/CU can work on both graphics and compute at the same time, allowing even better efficiency. Nonetheless, Pascal is finally offering a solution with hardware scheduled async compute, bringing Nvidia closer to AMD. Either way, with both Nvidia and AMD working on async compute, developers are more likely to take notice and make sure our GPUs are fully utilized.

Samuel Wan

Samuel joined eTeknix in 2015 after becoming engrossed in technology and PC hardware. With his passion for gaming and hardware, tech writing was the logical step to share the latest news with the world. When he’s not busy dreaming about the latest hardware, he enjoys gaming, music, camping and reading.

Disqus Comments Loading...

Recent Posts

Ducky One 3 Classic Fullsize USB RGB Mechanical Gaming Keyboard Cherry

The One 3 series features Ducky's all new QUACK Mechanics design philosophy which focuses on…

3 hours ago

Ducky Keyboard Coiled Cable V2 Cotton Candy

Coiled cable with long straight section connected by 5-pin aviation head USB-A to USB-C cable…

3 hours ago

Philips 24″ EVNIA 24M2N3201A/00 1920×1080 Fast IPS 180Hz 1ms Gaming Monitor

0.5 ms ultra-fast speed for crisp images and smooth gameplay SmartImage game mode optimised for…

3 hours ago

OcUK Gaming Claymore

CaseKolink Observatory Z Mesh ARGB Super Midi Tower Case - BlackPower Supply1000W 80+ Gold Rated…

3 hours ago

OcUK Gaming Radiance Ember – Intel Core i5, RTX 4070Ti – Powered By Asus Gaming PC

CaseAsus TUG Gaming GT 502 Gaming Case - BlackPower Supply850W 80Plus Modular Gold Rated PCIE5.0…

3 hours ago

AOC Agon 34″ AG346UCD 3440×1440 QD-OLED 175hz Curved Ultrawide Gaming Monitor

LightingLightingYesSpeakersSpeakersYesDimensionsLength / Depth294.3 mmWidth811.7 mmHeight551.2 mmWeight10.8 kgStandards / SpecificationsAdaptive Sync Technology (G-Sync / Freesync)adaptiveSyncColourPrimary ColourBlackDisplayDisplay…

3 hours ago