NVIDIA Unveils Next Generation CUDA GPU Architecture – Codenamed “Fermi”

nvidia-logoNVIDIA Corp. introduced its next generation CUDA™ GPU architecture, codenamed “Fermi”. An entirely new ground-up design, the “Fermi”™ architecture is the foundation for the world’s first computational graphics processing units (GPUs), delivering breakthroughs in both graphics and GPU computing.

“NVIDIA and the Fermi team have taken a giant step towards making GPUs attractive for a broader class of programs,” said Dave Patterson, director Parallel Computing Research Laboratory, U.C. Berkeley and co-author of Computer Architecture: A Quantitative Approach. “I believe history will record Fermi as a significant milestone.”

Fermi Architecture

Presented at the company’s inaugural GPU Technology Conference, in San Jose, California, “Fermi” delivers a feature set that accelerates performance on a wider array of computational applications than ever before. Joining NVIDIA’s press conference was Oak Ridge National Laboratory who announced plans for a new supercomputer that will use NVIDIA® GPUs based on the “Fermi” architecture. “Fermi” also garnered the support of leading organizations including Bloomberg, Cray, Dell, HP, IBM and Microsoft.

“It is completely clear that GPUs are now general purpose parallel computing processors with amazing graphics, and not just graphics chips anymore,” said Jen-Hsun Huang, co-founder and CEO of NVIDIA. “The Fermi architecture, the integrated tools, libraries and engines are the direct results of the insights we have gained from working with thousands of CUDA developers around the world. We will look back in the coming years and see that Fermi started the new GPU industry.”

Nvidia Parallel DataCache

As the foundation for NVIDIA’s family of next generation GPUs namely GeForce®, Quadro® and Tesla® − “Fermi” features a host of new technologies that are “must-have” features for the computing space, including:

  • C++, complementing existing support for C, Fortran, Java, Python, OpenCL and DirectCompute.
  • ECC, a critical requirement for datacenters and supercomputing centers deploying GPUs on a large scale
  • 512 CUDA Cores™ featuring the new IEEE 754-2008 floating-point standard, surpassing even the most advanced CPUs
  • 8x the peak double precision arithmetic performance over NVIDIA’s last generation GPU. Double precision is critical for high-performance computing (HPC) applications such as linear algebra, numerical simulation, and quantum chemistry
  • NVIDIA Parallel DataCache™ – the world’s first true cache hierarchy in a GPU that speeds up algorithms such as physics solvers, raytracing, and sparse matrix multiplication where data addresses are not known beforehand
  • NVIDIA GigaThread™ Engine with support for concurrent kernel execution, where different kernels of the same application context can execute on the GPU at the same time (eg: PhysX® fluid and rigid body solvers)
  • Nexus – the world’s first fully integrated heterogeneous computing application development environment within Microsoft Visual Studio

Images, technical whitepapers, presentations, videos and more on “Fermi” can all be found at: www.nvidia.com/fermi

Source: Nvidia

Related posts

Toshiba debuts AL13SX hard drives

Toshiba debuts AL13SX hard drives

The Japanese tech company Toshiba has presented new hard drives that belong to the AL13SX line. The new company products come with capacities of 300, 450 and 600 GB, 2.5-inch chassis and platter speed of 15 000 rpm. The line includes support for the 512n industry standard, 4K and 512e...

Circle

Circle

When you see a game called Circle what would you think? We did not know what to expect so we decided to try this game. Well, what can we say? We’ll start by saying that Circle is an arcade game that requires you to tap all the time like games such as Flappy Bird. In fact this time you do not...

Intel releases new Atom x5 processors

Intel releases new Atom x5 processors

The US chip maker Intel has released new Atom x5 8000 processors for mobile devices. There are three new energy-efficient chips – the Atom x5-Z8330 and Atom x5-Z8350 belong to the Cherry Trail chip family (14 nm), while the Atom x5-E8000 belongs to the 14 nm Braswell family. All three new...

Leave a comment