NVIDIA Opengl in 2012 Mark Kilgard

NVIDIA OpenGL in 2012

Mark Kilgard

• Principal System Software Engineer

– OpenGL driver and API evolution

– Cg (“C for graphics”) shading language

– GPU-accelerated path rendering

• OpenGL Utility Toolkit (GLUT) implementer

• Author of OpenGL for the X Window System • Co-author of Cg T u torial

Outline

• OpenGL’s importance to NVIDIA

• OpenGL API improvements & new features

– OpenGL 4.2 – Direct3D interoperability

– GPU-accelerated path rendering

– Kepler Improvements

• Bindless Textures

• Linux improvements & new features

• Cg 3.1 update

NVIDIA’s OpenGL Leverage

GeForce

Cg
Parallel Nsight
Quadro
Tegra
OptiX

Example of Hybrid Rendering with OptiX

OpenGL (Rasterization)
OptiX (Ray tracing)

Parallel Nsight Provides OpenGL Profiling

Configure

Application

Trace Settings

Magnified trace options shows specific OpenGL (and Cg) tracing options

Trace of mix of OpenGL and CUDA shows glFinish & OpenGL draw calls

Only Cross Platform 3D API
OpenGL 3D Graphics API

• cross-platform • most functional

• peak performance

• open standard • inter-operable • well specified & documented • 20 years of compatibility

OpenGL Spawns Closely Related Standards

Congratulations: WebGL officially approved, February 2012

“The web is now 3D enabled”

Buffer and
Event

Interop

OpenGL 4 – DirectX 11 Superset

• Interop with a complete compute solution

– OpenGL is for graphics – CUDA / OpenCL is for compute

• Shaders can be saved to and loaded from binary blobs

– Ability to query a binary shader, and save it for reuse later

• Flow of content between desktop and mobile

– All of OpenGL ES 2.0 capabilities available on desktop – EGL on Desktop in the works

– WebGL bridging desktop and mobile

• Cross platform

– Mac, Windows, Linux, Android, Solaris, FreeBSD

– Result of being an open standard

Increasing Functionality of OpenGL

Tessellation and Compute Features

4.X

Geometry Shaders

3.X

Vertex and Fragment Shaders

2.X

Fixed Function

1.X

Classic OpenGL State Machine

• From 1991-2007

* vertex & fragment processing got programmable 2001 & 2003

[source: GL 1.0 specification, 1992]

Complicated from inception

OpenGL 3.x Conceptual Processing Flow (pre-2010)

uniform/

parameters buffer objects

primitive topology,

Legend

transformed vertex data

Geometric primitive assembly &
Vertex assembly
Vertex processing

vertices primitive

batch type,

programmable operations

processing

transformed vertex attributes

pixels

fragments

filtered texels

buffer data

vertex data

point, line, and polygon fragments

fixed-function operations

geometry texture

fetches

Transform

feedback

vertex buffer transform feedback buffer objects

pixels in framebuffer object textures

objects

stenciling, depth testing, blending, accumulation

primitive batch type, vertex indices, vertex attributes

vertex

texture

fetches

texture buffer objects buffer data,

unmap buffer

Texture mapping
Fragment processing
Raster operations
Framebuffer

fragment texture fetches

Command parser
Buffer store

pixel pack buffer objects map buffer, get buffer data

image and bitmap fragments

texture image specification

pixel

pixel image or texture image specification

unpack buffer objects

Pixel packing

OpenGL 3.2

pixels to pack

image

rectangles, bitmaps

Image primitive

processing

Pixel unpacking
Pixel processing

unpacked pixels copy pixels, copy texture image

Patch assembly & processing
Control point processing
Patch tessellation generation
Patch evaluation processing

transformed control points

transformed

patch, bivariate

domain transformed

patch

patch control points patch

tessellation texture fetches

patch topology, evaluated patch vertex primitive topology,

Legend

transformed vertex data

Geometric primitive assembly &
Vertex assembly
Vertex processing

patch data

primitive batch type,

programmable operations

vertices

processing

transformed vertex attributes

pixels

point, line,

and polygon

fragments

vertex data

fragments

filtered texels

buffer data

fixed-function

operations

geometry texture fetches

Transform feedback

vertex buffer objects transform feedback buffer

pixels in framebuffer object textures stenciling, depth testing, blending, accumulation

objects

primitive batch type, vertex indices, vertex attributes

vertex

texture fetches

texture buffer

Texture mapping
Fragment processing
Raster operations

buffer data, unmap objects

Framebuffer

fragment texture fetches

pixel pack buffer

objects

buffer

Command parser
Buffer store

map buffer, get buffer data

image and bitmap fragments

texture image specification

pixel

Pixel packing

pixel image or texture image

specification

unpack buffer objects

OpenGL 4.2

pixels to pack image

rectangles, bitmaps

Image primitive processing
Pixel unpacking
Pixel processing

copy pixels, copy texture image unpacked pixels

Accelerating OpenGL Innovation

NOW!

2012
2011
2010

2006

2008

2009

2004

2005

2007

DirectX 11

DirectX 10.0

DirectX 10.1

DirectX 9.0c

It’s

cooking!

OpenGL 3.3
+
OpenGL 3.1

OpenGL 2.0

OpenGL 2.1

OpenGL 3.0

OpenGL 3.2

OpenGL 4.0

OpenGL 4.1

• OpenGL increased pace of innovation

- Expect 8 new spec versions in four years

- Actual implementations following specifications closely

OpenGL 4.2

• OpenGL 4.2 is a superset of DirectX 11 functionality

- While retaining backwards compatibility

NVIDIA Opengl in 2012 Mark Kilgard

Efficient Supersampling Antialiasing for High-Performance Architectures

Msalvi Temporal Supersampling.Pdf

Discontinuity Edge Overdraw Pedro V

Developer Tools Showcase

Manycore GPU Architectures and Programming, Part 1

1.2 Molecular Dynamics Simulations

Study of Supersampling Methods for Computer Graphics Hardware Antialiasing

Dense and Sparse Parallel Linear Algebra Algorithms on Graphics Processing Units

Sampling & Anti-Aliasing

Reviewer's Guide

Lecture: Manycore GPU Architectures and Programming, Part 1

Aliasing and Supersampling D.A