nVidia CUDA 2.3 Beta 封測

| | 0 Comments| 09:47

消息來源一樣是 nVidia 官方論壇,《CUDA Toolkit and SDK 2.3 betas available to registered developers》,內容如下:

The CUDA Toolkit and SDK 2.3 betas are now available to registered developers. They include the following features:

  • The CUFFT Library now supports double-precision transforms and includes significant performance improvements for single-precision transforms as well. See the CUDA Toolkit release notes for details.
  • The CUDA-GDB hardware debugger is now available for all supported Linux platforms and is included in the CUDA Toolkit installer.
  • GPUs in an SLI group are now enumerated individually, so you can achieve multi-GPU performance even when SLI is enabled for graphics.
  • New support for fp16 <-> conversion intrinsics allows storage of data in fp16 format with computation in fp32. Use of fp16 format is ideal for applications that require higher numerical range than 16-bit integer but less precision than fp32 and reduces memory space and bandwidth consumption.
  • The CUDA SDK has been updated to include:
    • A new pitchLinearTexure code sample that shows how to effeciently texture from pitch linear memory.
    • A new PTXJIT code sample illustrating how to use cuModuleLoadDataEx() to load PTX source from memory instead of loading a file.
    • Two new code samples for Windows, showing how to use the NVCUVID library to decode MPEG-2, VC-1, and H.264 content and pass frames to OpenGL or Direct3D for display.
    • Updated code samples showing how to properly align CUDA kernel function parameters so the same code works on both x32 and x64 systems.
  • The Visual Profiler (packaged separately) includes several enhancements:
    • All memory transfer API calls are now reported
    • Support for profiling multiple contexts per GPU.
    • Synchronized clocks for requested start time on the CPU and start/end times on the GPU for all kernel launches and memory transfers.
    • Global memory load and store efficiency metrics for GPUs with compute capability 1.2 and higher.
  • The CUDA Driver for MacOS is now packaged separately from the CUDA Toolkit.
  • Support for major Linux distros, MacOS X, and Windows:
    • Fedora 10, RHEL 4.7 & 5.3, SLED 10.2 & 11.0, OpenSUSE 11.1, and Ubuntu 8.10 & 9.04
    • Windows XP/Vista/7 with Visual Studio 8 (VC2005) and 9 (VC2008)
    • MacOS X 10.5.6 and later (32-bit)

Heresy 個人覺得的重點嗎∼主要應該是在 SLI 方面的改進,然後一度消失的 video decoder sample 看來也加回來了;而其他像是 CUFFT、CUDA-GDB、Visual Profiler 看來也都加強了不少∼

不過,Heresy 最近也沒什麼在玩 CUDA,所以對 Heresy 影響不大就是了。 :p

Leave a Reply

發佈留言必須填寫的電子郵件地址不會公開。 必填欄位標示為 *