News

Nvidia releases CUDA Toolkit Version 4.1

NVIDIA today released a new version of its CUDA parallel computing platform, which will make it easier for computational biologists, chemists, physicists, geophysicists, other researchers, and engineers to advance their simulations and computational work by using GPUs.

The new NVIDIA CUDA parallel computing platform features three key enhancements that make parallel programing with GPUs easier, more accessible and faster. These include:

– Re-designed Visual Profiler with automated performance analysis, providing an easier path to application acceleration
– New compiler, based on the widely-used LLVM open-source compiler infrastructure, delivering up to 10 percent speed up in application performance
– Hundreds of new imaging and signal processing functions, doubling the size of the NVIDIA Performance Primitives (NPP) library

“The new visual profiler is amazing,” said Joshua Anderson, lead developer of the HOOMD-blue open source molecular dynamics project. “With just a few clicks, it performs an automated performance analysis of your application, highlights likely problem areas, and then provides links to best-practice suggestions on improving them. It makes it quick and easy for virtually all developers to accelerate a broad range of applications.”

“The LLVM complier gave me an almost immediate 10 percent performance speed up, just by recompiling my existing real-time financial risk analysis code,” said Gilles Civario, senior software architect at the Irish Centre for High-End Computing. “I can only imagine the additional performance gains I can achieve with additional tuning using the new CUDA release.”

Among the new features of the latest CUDA parallel computing platform release (available free of charge on the NVIDIA developer web site here) are:

New Visual Profiler – Easiest path to performance optimization

The new Visual Profiler makes it easy for developers at all experience levels to optimize their code for maximum performance. Featuring automated performance analysis and an expert guidance system that delivers step-by-step optimization suggestions, the Visual Profiler identifies application performance bottlenecks and recommends actions, with links to the optimization guides. Using the new Visual Profiler, performance bottlenecks are easily identified and actionable.

LLVM Compiler – Instant 10 percent increase in application performance

LLVM is a widely-used open-source compiler infrastructure featuring a modular design that makes it easy to add support for new programming languages and processor architectures. Using the new LLVM-based CUDA compiler, developers can achieve up to 10 percent additional performance gains on existing GPU-accelerated applications with a simple recompile.

In addition, LLVM’s modular design allows third-party software tool developers to provide a custom LLVM solution for non-NVIDIA processor architectures, enabling CUDA applications to run across NVIDIA GPUs, as well as those from other vendors. (CUDA to be available on AMD cards? Intepret this how you like)

New Image, Signal Processing Library Functions – “Drop-in” Acceleration with NPP Library

NVIDIA has doubled the size of its NPP library, with the addition of hundreds of new image and signal processing functions. This enables virtually any developer using image or signal processing algorithms to easily gain the benefit of GPU acceleration, with the simple addition of library calls into their application. The updated NPP library can be used for a wide variety of image and signal processing algorithms, ranging from basic filtering to advanced workflows.

Source: Press Release

Ryan Martin

Disqus Comments Loading...

Recent Posts

Corsair 45″ 45WQHD240 UltraWide Quad HD 240Hz FreeSync OLED HDR Flexible Gaming Monitor

Set the curve with the CORSAIR XENEON FLEX 45WQHD240 OLED Bendable UltraWide Gaming Display, built…

17 mins ago

MSI NVIDIA GeForce RTX 4090 24GB GAMING X TRIO Ada Lovelace Graphics Card

Say hello to the future of graphics, with the MSI GeForce RTX 4090 GAMING X…

19 mins ago

Gaming PC with NVIDIA GeForce RTX 3050 and Intel Core i5 12400F

This Scan Gamer RTX features the 8GB NVIDIA GeForce RTX 3050 graphics card featuring new…

22 mins ago

MSI MAG Z790 TOMAHAWK WiFi + INTEL i7-14700K + MSI MAG CORELIQUID E360 AIO Bundle

The MAG series fights alongside gamers in pursuit of honor. With added military-inspired elements in…

25 mins ago

Logitech G733 LIGHTSPEED Wireless Gaming Headset 7.1Ch Virtual Surround PC/MAC/Console

Wireless gaming headset designed for performance and comfort. Outfitted with all the surround sound, voice…

27 mins ago

NZXT H6 Flow RGB Black Compact Dual-Chamber Tempered Glass PC Case

The H6 Flow's innovative compact design emphasizes GPU cooling with a strategically angled front corner,…

51 mins ago