<<

GPU-ACCELERATED APPLICATIONS Test Drive the World’s Fastest Accelerator – Free! Take the GPU Test Drive, a free and easy way to experience accelerated computing on GPUs. You can run your own application or try one of the preloaded ones, all running on a remote cluster. Try it today. www.nvidia.com/gputestdrive GPU‑ACCELERATED APPLICATIONS

Accelerated computing has revolutionized a broad range of industries with over four hundred applications optimized for GPUs to help you accelerate your work.

CONTENTS 01 Computational Finance 02 Climate, Weather and Ocean Modeling 02 & Analytics 04 and 06 Public Sector 07 Manufacturing/AEC: CAD and CAE COMPUTATIONAL FLUID DYNAMICS COMPUTATIONAL STRUCTURAL MECHANICS DESIGN AND VISUALIZATION ELECTRONIC DESIGN AUTOMATION 12 Media and Entertainment ANIMATION, MODELING AND RENDERING COLOR CORRECTION AND GRAIN MANAGEMENT COMPOSITING, FINISHING AND EFFECTS EDITING ENCODING AND DIGITAL DISTRIBUTION ON-AIR GRAPHICS ON-SET, REVIEW AND STEREO TOOLS WEATHER GRAPHICS 16 Medical Imaging 16 Oil and Gas 17 Research: Higher Education and Supercomputing COMPUTATIONAL CHEMISTRY AND BIOLOGY NUMERICAL ANALYTICS PHYSICS SCIENTIFIC VISUALIZATION 25 Safety & Security

Computational Finance APPLICATION DESCRIPTION SUPPORTED FEATURES MULTI-GPU SUPPORT Aon Benfield Pathwise™ Specialized platform for real-time hedging, Spreadsheet-like modeling interfaces, Yes valuation, pricing and risk management Python-based scripting environment and Grid middleware Altimesh’s Hybridizer C# Multi-target C# framework for data parallel C# with translation to GPU or Multi-Core Yes computing. Xeon Elsen Accelerated Secure, accessible, and accelerated back- Web-like API with Native bindings for Yes Computing Engine (TM) testing, scenario analysis, risk analytics Python, , Scala, C. Custom models and and real-time trading designed for easy data streams are easy to add. integration and rapid development. Global Valuation Esther In-memory risk analytics system for OTC High quality models not admitting closed Yes portfolios with a particular focus on XVA form solutions, efficient solvers based on metrics and balance sheet simulations. full matrix linear algebra powered by GPUs and Monte Carlo algorithms. Hanweck Associates Real-time options analytical engine (Volera) Real-time options analytics engine Yes MiAccLib 2.0.1 Accelerated libraries which encompasses Text Processing : Exact Match, Yes high speed multi-algorithm search engines, Approximate\Similarity Text, data security engine and also video analytics Wild Card, MultiKeyword and engines for text processing, encryption/ MultiColumnMultiKeyword, etc decryption and video surveillance Data Security: Accelerated Encryption/ respectively. Description for AES-128 Vide Analytics: Accelerated Intrusion Detection Algorithm MISYS Global Risk Regulatory compliance and enterprise wide Risk analytics Yes risk transparency package. Murex MACS Analytics Analytics library for modeling valuation and Market standard models for all asset Yes Library risk for derivatives across multiple asset classes paired with the most efficient classes. resolution methods (Monte Carlo simulations and Partial Differential Equations) Numerical Algorithms Random number generators, Brownian Monte Carlo and PDE solvers Single only Group (NAG) bridges, and PDE solvers. * Numerix Numerix introduced GPU support for Equity/FX basket models with Black- Yes Forward Monte Carlo simulation for Capital Scholes/Local Vol models for individual Markets and Insurance. equities and FX, Algorithms: AAD (Automatic Algebraic Differential) New approaches to AAD to reduce time to market for fast Price Greeks and XVA Greeks QuantAlea’s Alea.cuBase F# package enabling a growing set of F# F# for GPU accelerators Yes F# capability to run on a GPU RMS Catastrophic risk modeling for FSI Risk analytics Yes (earthquakes, hurricanes, terrorism, infectuous diseases) SciComp, Inc Derivative pricing (SciFinance) Monte Carlo and PDE pricing models Single only SunGard- Adaptiv A flexible and extensible engine for fast Existing models code in C# supported Yes Analytics calculations of a wide variety of pricing and transparently, with minimal code changes, risk measures on a broad range of asset Supports multiple backends including classes and derivatives. CUDA and OpenCL, Switches transparently between multiple GPUs and CPUS depending on the deal support and load factors. Synerscope- Synerscope Visual big data exploration and insight tools Graphical exploration of large network Single only datasets including geo-spatial and temporal components. Xcelerit SDK Development Kit (SDK) to boost C++ programming language, cross- Yes the performance of Financial applications platform (back-end generates CUDA and (e.g. Monte-Carlo, Finite-difference) with optimized CPU code), supports Windows minimum changes to existing code. and operating systems. * Indicates new application POPULAR GPU‑ACCELERATED APPLICATIONS CATALOG | May17 | 01 Climate, Weather and Ocean Modeling APPLICATION DESCRIPTION SUPPORTED FEATURES MULTI-GPU SUPPORT ACME-Atmosphere Global atmospheric component model for Dynamics only Yes ACME global coupled climate model COSMO Regional numerical weather prediction and Radiation only Yes climate model * GALES Regional numerical weather prediction model Full model Yes Data Science & Analytics APPLICATION DESCRIPTION SUPPORTED FEATURES MULTI-GPU SUPPORT * Altair PBS Professional® Workload management software HPC & data center management Yes BIDMach The fastest machine learning library Written in Scala and supports Scala and Yes available. Holds the record for many Java interfaces. Supports linear regression, common macine learning algortihms. Both logistic regression, SVM, LDA, K-Means BIDMach and its sister library BIDmat were and other operations. originated at UC Berkley. * BlazingDB GPU-accelerated relational database for Modern data warehousing application Yes data warehousing scenarios available for supporting petabyte scale applications. AWS and on-premise deployment. Blazegraph The first and fastest GPU-accelerated Support for RDF/SPARQL and Yes platform for graph analytics. It provides Tinkerpop/Blueprints stack. Scala-based high-level graph database APIs with graph analytic and machine learning transparent GPU acceleration for graph application language. Ease of integration query. It delivers graph analytics at over 32 into Spark and Hadoop. Support for GPU billion traversed edges per second. cluster deployment. * Capio In-house and Cloud-based Speech Real-time and offline (batch) speech Yes Recognition technologies recognition, Exceptional accuracy for transcription of conversational speech, Continuous Learning (System becomes more accurate as more data is pushed to the platform) * Datalogue Deep learning powered pipelines that Automated ontology mapping and detection Yes automatically ingest data in any format from (including PII and other types of sensitive any , delivering ready to use data for information); Field standardization; Semi- enterprise analytics, BI and data governance structured field parsing. workflows. * Deepgram Deepgram increases your company’s Keyword and phrase search, Speech Yes revenue by analyzing your audio data. We transcription, Speech analytics for use AI to transcribe, spot keywords, and get compliance, Topic modeling insights from phone calls, video footage, and online media. * Graphistry The fastest graph visualization and analysis Able to show billions of individual Yes solution for very large amount data. connections. Support for CVS, Sprak and Graphistry is able to present millions of . events on a graph within seconds. * Gridspace Voice analytics to turn your streaming Speech-to-text transcription, Compliance, Yes speech audio into useful data and service Call grading, Call topic modelling, metrics. Instrument your contact / call Customer service enhancement, Customer center and work communications today churn prediction with powerful deep learning-driven voice analytics Gunrock Gunrock is a library for graph processing Direction-optimizing BFS, SSSP, PageRank, Yes on the GPU. Gunrock achieves a balance Connected Components, Betweenness- between performance and expressiveness centrality by coupling high performance GPU implementations with a high-level programming model, that requires minimal GPU programming knowledge.

* Indicates new application 02 | POPULAR GPU‑ACCELERATED APPLICATIONS CATALOG | May17 * Intelligent Voice Intelligent Voice takes your company’s Keyword and phrase search, Speech Yes phone calls (and email and IM) and turns transcription, Speech analytics for them into smart data using World’s Fastest compliance, Topic modeling Speech to Text Engine Jedox Helps with portfolio analysis, management This database holds all relevant data in GPU Yes consolidation, liquidity controlling, cash memory and is thus an ideal application to flow statements, profit center accounting, utilize the Tesla K40’s 12 GB on-board RAM. treasury management, customer value Scale that up with multiple GPUs and keep analysis and many more applications, all close to 100 GB of compressed data in GPU accessible in a powerful web and mobile memory on a single server system for fast application or Excel environment. analysis, reporting and planning. Kinetica In-memory relational database build to Query against Big Data in real time. Yes leverage the power of GPUs and to precess SQL support. No pre-indexing allows for massive amount of data extremely fast. Full complex, ad-hoc query chains. Interactively suite of geospacial application capabilities. explore large, streaming data sets. MapD Technologies MapD is a GPU-powered data exploration MapD in-memory, column store, relational Yes platform that combines a database and visual database supports standard SQL queries analytics platform to deliver millisecond and was built from the ground up to take performance for at-scale data challenges that advantage of the parallelism of GPUs. run to the billions of rows. With speeds-ups Similarly, the Immerse visual analytics of 100x to 1,000x than even the fastest CPU- front-end takes advantage of the GPU powered solutions organizations can tackle in novel ways to render billions of rows problems that were previously considered too with millisecond latency – even across large, complex or lengthy. challenging tasks such as point maps. * PolyAnalyst General purpose corporate-level data & text Practically all popular Yes mining system. Great set of data exploration algorithms are implemented. Decision tree, methods for solution of wide range of data naive bayes, SVM, neural networks, logistic analysis problems. Primarly targeted at regression, bagging and boosting methods, work with big data from retail, banking, linear and non-linear regression, various insurance, manufacturing and other data- methods for time series analysis, k-means, rich business domains. density-based clustering, Kohonen maps, factor analysis, and many others. GPU cluster support is planned in next versions. * Polymatica Business analytics platform for fast OLAP, Business Intelligence, Data Yes analytical processing of Big Data using Data Discovery, , Multidimensional Mining algorithms and Machine Learning data analysis, Visual analytical work, methods. Polymatica is built on OLAP- Interactive dashboards. in-GPU-memory technologies with full support of GPU acceleration in OLAP ad-hoc operations and Data Mining calculations. Sqream DB GPU accelerated SQL database engine for Up to 100TB of raw data can be stored and Yes big data analytics. Sqream speeds SQL queried in a standard 2U server. Inserts analytics by 100X by translating SQL queries and analyzes hundreds of billions of into highly parallel algorithms run on the records in seconds. No indexes required. GPU. No changes to SQL code or data science paradigms required. * SynerScope Big data visualization and data discovery Real-time Interaction with data Yes platform for data analytics, cyber public sector and on IoT scenarios * Tanay ZX Lib (Fuzzy Financial analytics and data mining library Monte Carlo simulations, pricing of vanilla Yes Logic) and exotic options, fixed income analytics, data mining.

* Indicates new application POPULAR GPU‑ACCELERATED APPLICATIONS CATALOG | May17 | 03 Deep Learning and Machine Learning APPLICATION DESCRIPTION SUPPORTED FEATURES MULTI-GPU SUPPORT ANACONDA Anaconda is the leading Open Data Science Anaconda has been downloaded over 15M Yes - For Deep platform powered by Python, the fastest times and is used for AI & ML data science Learning growing data science language. It is a free, workloads using TensorFlow, Theano, Packages and high-performance Python & R distribution Keras, Caffe, Neon,Lasagne,NLTK, spaCY. Numba with 1000+ curated packages. Anaconda’s Numba is a revolutionary Python-to-GPU compiler that compiles easy-to-read Python code to many-core and GPU architectures. Also includes single- line install of key deep learning packages for GPUs. ANACONDA Enterprise Anaconda Enterprise takes Anaconda to Anaconda Enterprise opens up the full Yes the next level and makes it easy, secure, capabilities of your GPU or multi-core and manageable to scale powerful analytics processor to the Python programming workflows from the to the server and language. Common operations like linear then scaled out to your cluster, while also algebra, random number generation, FFT incorporating collaboration, publishing, and Monte Carlo simulation run faster, and security, and Hadoop-optimized deployment. take advantage of multiple cores. Identify and remedy performance bottlenecks easily with data, code and in-notebook profilers. Includes Bindings to CUDA libraries: cuBLAS, cuFFT, cuSPARSE, cuRAND, and sorting algorithms from the CUB and Modern GPU libraries. BidMach GPU-accelerated classical machine learning Logistic regression, SVM, LDA, SFA, NMF, Yes library ICA, random forests, clustering, word2vec * Bons.ai Bons.ai is an platform Easy to use programming interface Yes which abstracts away the low-level, inner workings of machine learning systems to empower more developers to integrate richer intelligence models into their work. Caffe The Caffe deep learning framework makes Process over 40M images per day with a Single only implementing state-of-the-art deep single NVIDIA K40 or GPU. learning easy. Caffe* Parallel This is a faster framework for deep learning, Using the GPU cluster processing mass Yes it’s forked from BVLC/caffe (master branch). image data This allows data-parallel via MPI. DL framework that makes the construction Dynamic NN construction, which makes Yes of neural networks (NN) flexible and debugging easier. CPU/GPU-agnostic intuitive. coding, which is promoted by CuPy, partially -compatible multidimensional array library for CUDA. Data-dependent NN construction, which fully exploits the control flows of Python without magic. Clarifai Clarifai brings a new level of understanding GPU-based training and inference. Yes to visual content through deep learning Recognizes and indexes images with technologies. Clarifai uses GPUs to train predefined classifiers, or with custom large neural networks to solve practical classifiers. problems in advertising, media, and search across a wide variety of industries. CNTK ’s Computational Network Toolkit Supports many applications, including Yes (CNTK) is a unified computational network Speech Recognition, Machine Translation, framework that describes deep neural Image Recognition, Image Captioning, networks as a series of computational steps Text Processing and Relevance, Language via a directed graph. Understanding, Language Modeling * Cylance Advanced machine leaning end point End Point malware detection build using Yes malware detection solution GPU deep learning technology.

* Indicates new application 04 | POPULAR GPU‑ACCELERATED APPLICATIONS CATALOG | May17 * DeepBench The primary purpose of DeepBench is to DeepBench consists of a set of basic Yes benchmark operations that are important operations (dense matrix multiplies, to deep learning on different hardware convolutions and communication) as platforms. well as some recurrent layer types. Both forward and backward operations are tested. This first version of the benchmark will focus on training performance in 32-bit floating-point arithmetic. Deeplearning4j Deeplearning4j is the most popular deep Integrates with Hadoop and Spark to run Yes learning framework for the JVM, and includes distributed. Java and Scala APIs. Composable all major neural nets such as convolutional, framework that facilitates building your own recurrent (LSTMs) and feedforward. nets. Includes ND4J, the Numpy for Java. * DeepInstinct Zero day end point malware detection Zero-day threats & APT attack detection on Yes endpoints, servers and mobile devices. Dextro Dextro’s API uses deep learning systems to Object and scene detection, Machine Yes analyze and categorize videos in real-time. transcription for audio Motion and movement detection. * H2O H2O is a popular machine learning platform Supports TensorFlow, Caffe and MXNet Yes which offers GPU-accelerated deep learning by integrating popular deep learning frameworks. IntelligentVoice Far more than a transcription tool, this Advanced Speech Recognition across Yes speech recognition software learns large data sets, JumpTo Technology, for what is important in a telephone call, data visualisation, E-Discovery, extraction extracts information and stores a visual from phone calls, IM & Email defining representation of phone calls to be key phrases and emotional analysis. combined with text/instant messaging and Compliance, defining key conversations and E-mail. Intelligent Voice’s search and alert interactions makes it possible to tackle issues before they arise, address data security concerns and monitor physical to data. * Keras Keras is a minimalist, highly modular cuDNN version depends on the version Yes neural networks library, written in of TensorFlow and Theano installed with Python, and capable of running on top of Keras. either TensorFlow or Theano. Keras was Supported Interfaces: Python developed with a focus on enabling fast experimentation. Labellio The world’s easiest deep learning web Neural net fine-tuning for image data, data Yes service for computer vision, which allows crawling, data browsing as well as drag- everyone to build own image classifier with and-drop style data cleansing backed by AI only web browser. support. MatConvNet CNNs for MathWorks MATLAB, allows you Building Blocks, Simple CNN wrapper, Yes to use MATLAB GPU support natively rather DagNN wrapper, cuDNN implemented than writing your own CUDA code * Meson ’s general purpose workflow It manages the lifecycle of several ML Yes orchestration and scheduling framework pipelines that build, train and validate built to manage ML pipelines that execute personalization algorithms that drive video workloads across heterogeneous systems. recommendations. MetaMind Provides a deep learning API for image GPU-based training and inference. Yes recognition and text . Recognizes image and analyzes text, Uses either prebuilt, public, or custom creates and trains classifiers with tooling classifiers. for uploading and managing datasets. * MXNET MXnet is a deep learning framework MXnet supports cuDNN v5 for GPU Yes designed for both efficiency and flexibility acceleration. that allows you to mix the flavors of symbolic programming and imperative programming to maximize efficiency and productivity.

* Indicates new application POPULAR GPU‑ACCELERATED APPLICATIONS CATALOG | May17 | 05 Neon Neon is a fast, scalable, easy-to-use Python Supported Interfaces: Python, R, C++, Julia Yes based deep learning framework that has been optimized down to the assembler level. Neon features a rich set of example and pre-trained models for image, video, text, deep reinforcement learning and speech applications. * PaddlePaddle PaddlePaddle (PArallel Distributed Deep Optimized math operations through SSE/ Yes LEarning) is an easy-to-use, efficient, AVX intrinsics, BLAS libraries (e.g. MKL, flexible and scalable deep learning platform, ATLAS, cuBLAS) or customized CPU/GPU which is originally developed by kernels. scientists and engineers for the purpose of Highly optimized recurrent networks which applying deep learning to many products at can handle variable-length sequence Baidu. without padding. Optimized local and distributed training for models with high dimensional sparse data. Tensorflow ’s TensorFlow is an TensorFlow is flexible, portable and Yes software library for numerical computation performant creating an open standard for using data flow graphs. Nodes in the exchanging research ideas and putting graph represent mathematical operations, machine learning in products. while the graph edges represent the multidimensional data arrays (tensors) communicated between them. Theano Theano is a symbolic expression compiler Abstract expression graphs for transparent Yes that powers large-scale computationally GPU acceleration. intensive scientific investigations. Torch7 Torch7 is an interactive development Computational back-ends for multicore Single only environment for machine learning and GPUs. computer vision. Trakomatic OSense, Video Analytics Solution for retail, People detection & tracking, Crowd density Yes Otrack supermarkets, shopping mall and banking. estimation, Gender classification and age estimation, Person re-identification. * UETorch It provides an embedded Torch environment Game interaction and physics, CUDA- Yes within the powerful 4. This optimized deep learning and neural allows one to have deep learning models networks. CuDNN supported. directly interact with the game world, and paves way for powerful research. An example of doing AI Research using UETorch is for a neural network to learn physics and intuition about the real world. Public Sector APPLICATION DESCRIPTION SUPPORTED FEATURES MULTI-GPU SUPPORT Comprimato JPEG2000 A high performing, GPU powered, JPEG2000 Very large image processing, specific area Yes Codec encoder and decoder SDK which can be decoding, multi resolution/quality decoding integrated into almost any application. supporting all GEOSPATIAL image formats. (eg NITF, BIIF). Mobile and embedded platform friendly. DigitalGlobe - Advanced Geospatial visualization Image orthorectification Yes Ortho Series Elcomsoft High-performance distributed password GPU acceleration for password recovery, Yes recovery software with NVIDIA GPU 10-100x speedup for password recovery. acceleration and scalability to over 10,000 . Esri ArcGIS for Desktop Determines the raster locations Viewshed2 transforms the elevation surface Yes (ArcMap and ArcGIS Pro) visible to a set of observer features, using into a geocentric 3D coordinate system and – Spatial Analyst and 3D geodesic methods. runs 3D sightlines to each transformed cell Analyst center. Eternix - Blaze Terra Geospatial visualization 3D visualization of geospatial data Yes GeoWeb3d Desktop Geospatial visualization 3D visualization of geospatial data Yes Harris ENVI Image Processing and Analytics Image orthorectification, Image Yes transformation, atmospheric correction, Panchromatic co-occurrence texture filter * Indicates new application 06 | POPULAR GPU‑ACCELERATED APPLICATIONS CATALOG | May17 Herta Security - Real time facial recognition and forensic Supports crowded scenes, difficult lighting, Yes BioSurveillance NEXT, alerts against multiple watchlists. faster than real-time analysis, partial face BioFinder concealment. Intergraph Motion Video Video filters and mosaic’ing - Geo-fuses Full motion video ortho mosaic processing, Single only Analyst FMV analytics with intelligence data. de-hazing algorithms. Intuvision Panoptes 3.0 Video analytics Object recognition and change detection Yes LuciadLightspeed Geospatial visualization and analysis Geospatial situational awareness Single only Manifold Systems Full-featured GIS, vector/raster processing Manifold surface tools Yes & analysis MotionDSP - Ikena ISR Real-time full motion video (FMV) and wide- Real-time super-resolution-based video Yes area motion imagery (WAMI) enhancement enhancement on live streams, geospatial and computer-vision-based analytics visualization, target detection and tracking, software for intelligence analysts and fast 2-D mapping NerVve Visual Search Video/Image Live and Forensic Search Video and image content search Yes Solution (NVSS) OpCoast SNEAK Electromagnetic signals propagation , DTED and remote sensing Yes modeling for complex urban and terrain inputs. environments. PCI Geomatics GXL Image processing Image orthorectification and additional Yes image processing Skyline Software - PhotoMesh integrates a GPU-based, fast 3D model building from imagery; building Yes Terrabuilder PhotoMesh algorithm, able to automatically build texture generation. 3D models from simple photographs. PhotoMesh revolutionizes the use of geospatial data by fully automating the generation of high-resolution, textured, 3D mesh models from standard 2D images. SocetGXP - BAE Systems The Automatic Spatial Modeler (ASM) is Automated 3D feature extraction Yes designed to generate 3-D point clouds with accuracy similar to LiDAR, which can extract 3-D objects from stereo images. ASM can extract dense 3-D point clouds from stereo images, and extract accurate building edges and corners from stereo images with high resolution, large overlaps, and high dynamic range. SynerScope Big data visualization and data discovery, for Real-time Interaction with data Single only combining Analytics on Analytics with IoT compute-at-the-edge smart sensors. Manufacturing/AEC: CAD and CAE COMPUTATIONAL FLUID DYNAMICS APPLICATION DESCRIPTION SUPPORTED FEATURES MULTI-GPU SUPPORT Altair AcuSolve General purpose CFD software Linear equation solver Yes - Fluent General purpose CFD software Radiation heat transfer model, linear Yes equation solver ANSYS - Polyflow CFD software for the analysis of polymer Direct Solvers Yes and glass processing - Moldflow Plastic mold injection software Linear equation solver Single only CPFD Barracuda-VR and Fluidized bed modeling software Linear equation solver, particle calculations Single only Barracuda DHI - MIKE 21 2D hydrological modelling of coast and sea Hydrodynamics; Advection-dispersion; sand Yes and mud transport; coupled modelling; particle tracking; oil spill; ecological modelling; agent based modelling; various wave models. DHI - MIKE FLOOD 1D & 2D urban, coastal, and riverine flood Hydrodynamics Yes modelling

* Indicates new application POPULAR GPU‑ACCELERATED APPLICATIONS CATALOG | May17 | 07 FluiDyna - Culises for Solver library for general purpose CFD Linear equation solvers Yes OpenFOAM software FluiDyna nanoFluidX Meshless CFD solver (Smoothed Particle Single/multi-phase flows, thermal, moving/ Yes Hydrodynamics, SPH) rotating geometries, inlet/outlet boundary conditions FluiDyna ultraFluidX Lattice-Boltzmann-based CFD solver for Single-phase flows, isothermal, integrated Yes ground transportation aerodynamics volume mesh generation, local refinement, LES turbulence modeling * HiFUN - by Sandl High Resolution Flow Solver on HiFUN imbibes most recent CFD Yes Unstructured Meshes. State-of-the art technologies; many of them home grown. Euler/RANS solver. Super scalability on HiFUN exhibits highly scalable parallel massively parallel HPC platforms. The code performance with its ability to scale upto is ported using OpenACC directives for several thousand processors on massively Nvidia GPU. platforms. Capable of handling complex geometries and flow physics arising in high lift flows. midas NFX(CFD) General purpose CFD software based on Linear equation solver (Iterative Solver and Single only FEM AMG Preconditioner) Numeca Fine/ Turbo software product—a structured, Multi-grid solver Yes multi-block, multi-grid CFD solver targeting the turbo machinery industry Prometech - Particle-based CFD software Implicit and explicit solvers Yes Particleworks * Realflow - DYVERSO 3D modeling, animation, and rendering Fluid solver (DY-SPH, DY-PBD) Single only Turbostream Ltd. CFD software for turbomachinery flows Explicit solver Yes Vratis Speed IT FLOW Incompressible single-phase CFD software Finite volume solver Single only Vratis SpeedIT for Solver library for general purpose CFD Linear equation solvers Yes OpenFOAM software * Zeus Numerix Simulation of Flow around buildings Discrete computational technique Yes [underway]

Research CFD Developments APPLICATION DESCRIPTION SUPPORTED FEATURES MULTI-GPU SUPPORT DualSPHysics SPH-based CFD software SPH model Yes * ELBE Lattice Boltzmann Method (LBM) flow solver LBM solver Yes FEFLO (GMU - Lohner) General purpose CFD software for Implicit and explicit solver Yes compressible and incompressible flows GIN3D (Boise St - General purpose CFD software for Implicit solver Yes Senocak) incompressible flows HiFiLES (Stanford - General purpose CFD software for Explicit solver Yes Jameson) compressible flows. HiPSTAR (University CFD software for compressible reacting Explicit solver Yes of Southampton - flows Sandberg) * INCOMP3D Fully implicit 3D incompressible flow solver Linear solver Yes JENRE, Propel (NRL) CFD software for compressible flows Explicit solver Yes NASA FUN3D General purpose CFD software Linear equation solver Single only PyFR (Imperial College - General purpose CFD software for High-order FR solver Yes Vincent) compressible flows. S3D (Sandia and Oak Direct numerical solver (DNS) for turbulent Chemistry model Yes Ridge NL) combustion

* Indicates new application 08 | POPULAR GPU‑ACCELERATED APPLICATIONS CATALOG | May17 COMPUTATIONAL STRUCTURAL MECHANICS APPLICATION DESCRIPTION SUPPORTED FEATURES MULTI-GPU SUPPORT Altair OptiStruct Industry proven, modern structural analysis Direct solvers Single Only solver and solution for structural design and optimization. Altair RADIOSS Implicit Simulation and analysis tool for structural Iterative solvers Yes mechanics ANSYS - Mechanical Simulation and analysis tool for structural Direct and iterative solvers Yes mechanics Dassault Systèmes Simulation and analysis tool for structural Direct sparse solver Yes SIMULIA Abaqus/ mechanics Standard Dassault Systèmes Realistic simulation solution (Uses Abaqus Direct sparse solver Single only SIMULIA 3DEXPERIENCE Standard for GPU computing). Impetus Afea Predicts large deformations of structures Non-linear Explicit Finite-Element Solver Yes and components exposed to extreme loading conditions. LS-DYNA Implicit Simulation and analysis tool for structural Linear equation solver Yes mechanics midas GTS NX Simulation tool for geo-technical analysis Linear equation solver(Multi Frontal Solver) Single only midas NFX(Structural) Simulation and analysis tool for structural Linear equation solver(Multi Frontal Solver) Single only mechanics MSC - Marc Simulation and analysis tool for structural Direct sparse solver Yes mechanics MSC Nastran Simulation and analysis tool for structural Direct sparse solver Yes mechanics Rocky DEM Discrete Element Modeling (DEM)-based Explicit DEM solver (dry/sticky contact Single only particle simulation software. rheologies), 1-way & 2-way coupling with ANSYS Fluent and ANSYS Mechanical. Siemens NX Nastran Simulation and analysis tool for structural Linear equation solver Single only mechanics.

DESIGN AND VISUALIZATION APPLICATION DESCRIPTION SUPPORTED FEATURES MULTI-GPU SUPPORT Allegorithmic Substance Material edition, market reference Iray rendering including textures/substances Yes + NVIDIA Designer for procedural texture creation. and bitmap texture export to render in any VCA Iray powered compatible with MDL. Allegorithmic Substance Intuitive interactive 3D painting software Iray rendering to enhance all artwork Yes Painter with physics and particle support. released with the software Autodesk - AutoCAD 2D and 3D CAD design, drafting, modeling, Surface, mesh, and solid modeling tools, Single only architectural drawing, and engineering model documentation tools, parametric software. Supports Open GL. Native DWG™ drawing capabilities. Native DWG™ support. support. GRID Support. Autodesk - AutoCAD AutoCAD 2014 software, plus tools to create, 2D/3D display of designs, interactive 3D Single only Design Suite capture, connect, and showcase designs. presentation with realistic materials, rendering-ray tracing. Autodesk - 3ds Max 3D animation creative toolset for modeling, 3D modeling, mesh and surface modeling, Yes animation, simulation, and rendering for improved Nitrous viewport performance, product and building designs. iray rendering. Autodesk - Inventor 3D mechanical design, documentation, and Uses BIM for intelligent building Single only product simulation. components to improve design accuracy. * Autodesk - Remake ReMake is a solution for converting reality Generation of 3D meshed models from Yes captured with photos or scans into high- laser scans or photos of an object. GPU definition 3D meshes. These meshes that accelerated photogrammetry process from can be cleaned up, fixed, edited, scaled, 2D to 3D. 3D model display accelerated by measured, re-topologized, decimated, GPU’s for smooth navigation of converted aligned, compared and optimized for models in all display modes. downstream workflows entirely in ReMake.

* Indicates new application POPULAR GPU‑ACCELERATED APPLICATIONS CATALOG | May17 | 09 Autodesk - Revit Building Information Modeling (BIM) for Modeling (BIM) to design, build, and Single only architecture, engineering, and construction. maintain higher-quality, more energy- efficient buildings. GRID support. * Autodesk- Stingray The Stingray engine includes 3D game Fully featured viewing technology Yes creation tools, design visualization, real- accelerated by GPU’s for core graphics time 3D rendering, and display as well as complete VR workflows. support. Stingray has great workflows with 3ds Max, Maya, and Maya LT. Autodesk- VRED VRED™ 3D visualization software helps Enhanced geometry behavior, Automotive Yes automotive designers and engineers create product interoperability, Navigation in product presentations, design reviews, and a scene, Import Alias layer structure, virtual prototypes. Use Digital Prototyping to Asset Manager improvements, Integrated quickly visualize ideas and evaluate designs. file converter, Analytic rendering modes, Gap Analysis tool, Oculus Rift support, Animation module, Multiple rendering modes, Subsurface scattering, Displacement mapping Cast Software - WYSIWYG The WYSIWYG software products, designed The speed of wysiwyg’s Shaded Views Yes specifically for lighting professionals, offers depends entirely on GPU, the GPU will a range of solutions to meet the needs of have an easier time rendering ten risers designers, assistants, electricians, console consolidated into one Mesh, than rendering operators, teachers, and students. them as individual risers, Wysiwys also support NVIDIA SLI technologies. * Chaos Group - V-Ray RT GPU renderer CUDA interactive GPU rendering Yes Dassault Systèmes - 3DEXPERIENCE R2017x highly accelerated Load and render smoothly your large Single only CATIA and improved real-time engine with native assembly models with Substance support VR support and optimized GPU scaling. for gamelike experience with native professional CAD data. Experience your CAD model design in VR with no data transformation. Dassault Systèmes - Realistic 3D Rendering on full CATIA 3D Physically Based Rendering with no data Yes + NVIDIA CATIA Live Rendering CAD model preparation thanks to native NVIDIA Iray Quadro VCA Photoreal integration and interactive realistic rendering using NVIDIA Iray IRT. Dassault Systèmes - Redefines high-end 3D visualization and Interactive ray tracing and global Yes 3DEXCITE DeltaGen realtime interaction. This latest version illumination. Integration with Siemens gives users a broad suite of robust new TeamCenter. Cluster support Realtime features to truly revolutionize processes & Offline Production Process Integration and help increase visual quality, speed, and and scene building. Scene Analysis, Xplore flexibility. DeltaGen, SDK for DeltaGen. Dassault Systèmes - Covers all aspects of product development High performance in Shaded, Shaded Single only SOLIDWORKS process with a seamless, integrated w/ Edges, and RealView modes, FSAA workflow—design, verification, sustainable for sharp edges, Order Independent design, communication and data Transparency management. Real time photorealistic renderings with SOLIDWORKS Visualize, an Iray-based application. Dassault Systèmes – Easy to use photorealistic rendering Iray-based ray-tracing, animation support, Yes + NVIDIA SOLIDWORKS Visualize software network rendering. Quadro VCA * ESI Group – IC.IDO 3D immersive virtual prototyping solution High performance optimized OpenGL Yes with real-time physics simulation pipeline built on NV Pro Pipeline NVIDIA Iray A ready-to-integrate, physically-based, Iray Interactive; Iray Photoreal; Iray Cluster. Yes photorealistic rendering solution. Fast interactive ray tracing; Physically- based, global-illumination rendering; Distributed cluster rendering. * Optis VRXPERIENCE Professional VR experience for training and Run your professional CAD data with No validation haptics feedback powered by PhysX and accurate light simulation powered by Optis SPEOS (SPEOS powered by CUDA soon) Otoy - Octane Render GPU renderer GPU rendering Yes

* Indicates new application 10 | POPULAR GPU‑ACCELERATED APPLICATIONS CATALOG | May17 PTC - Creo Parametric Parametric design solution suite. Anti-aliasing, better lighting and enhanced Single only shaded-with-edges mode. Immersive design environment with realistic materials. GRID Support. Support for enhanced line display generated with GPU support. Siemens PLM Software Product lifecycle management solutions Design software, NX, and PLM viewer Single only NX and Teamcenter from design to simulation to production to applications, TcVis and Active Workspace. service. GRID support. Top Systems T- CAD 3D and 2D parametric design, simulation, High performance visualization, real time Yes photorealistic rendering. photorealistic rendering

ELECTRONIC DESIGN AUTOMATION APPLICATION DESCRIPTION SUPPORTED FEATURES MULTI-GPU SUPPORT Altair FEKO 3D EM modeling and simulation FDTD solver, MoM solver, CMA Solver Yes; Single for FDTD solver ANSYS - HFSS Simulation tool for modeling 3-D full-wave Transient solver Yes electromagnetic fields in high-frequency and high-speed electronic components. ANSYS - Nexxim Circuit simulation engine for RF/analog/ AMI analysis Single only mixed-signal IC design; IBIS-AMI analysis speedup with GPU computing. ANSYS - Savant Simulation tool for installed antenna High-frequency solver Yes performance and antenna-to-antenna coupling. CST STUDIO SUITE® Accurate and efficient computational Transient Solver Yes and CST MICROWAVE solution for 3D simulation of Integral Equation Solver STUDIO® electromagnetic devices in a wide range of Asymptotic Solver frequencies. Multilayer Solver * CST STUDIO SUITE® and Multiphysics simulation including thermal, Conjugated Heat Transfer Solver Yes CST MPHYSICS® STUDIO CFD and mechanical capabilities. Tightly integrated with CST’s electromagnetic solvers. * D2S CDP GPU-Acceleration of real-time in- Simulation-based processing Yes line enhancement of manufacturing equipment D2S TrueMask® MDP GPU-accelerated simulation and data Simulation-based processing Yes preparation for mask writing * D2S TrueModel® GPU-accelerated simulation and geometric Simulation-based processing Yes checking of curvilinear shapes JMAG FEA software for electromechanical design. EM transient solver Yes Fast solver / High quality mesh / Advanced EM time harmonic solver modeling technologies. EM static solver KeySight - ADS Simulation tool for design of RF, microwave Transient Convolution simulation with Single only and high speed digital circuits. BSIM4 models KeySight - EMPro Modeling and simulation environment for FDTD solver Yes analyzing 3D EM effects of high speed and RF/Microwave components. * Lucernhammer-Serenity EM simulation (RCS solver) tool MOM Yes Remcom - XFdtd 3D EM modeling and simulation FDTD solver Yes * Remcom - Xstream 3D EM simulation FDTD solver Yes * Remcom - Wireless Uses OptiX 3.8 for Ray-tracking and X3D ray tracer Yes InSite Propagation prediction SPEAG - SEMCAD-X 3D EM modeling and simulation FDTD solver Yes * VSim for Physics simulation and modeling software FDTD Single only Electromagnetics for EM * WIPL-D 2D EM Simulation tool Frequency domain method for moments Yes (Max- 3 GPUs) * ZMT Zurich MedTech AG - 3D EM & Acoustic modeling and simulation FDTD & Acoustic solvers Yes Sim4Life

* Indicates new application POPULAR GPU‑ACCELERATED APPLICATIONS CATALOG | May17 | 11 Media and Entertainment ANIMATION, MODELING AND RENDERING APPLICATION DESCRIPTION SUPPORTED FEATURES MULTI-GPU SUPPORT 3DAliens- Glu3d SPH fluid simulation Faster simulation Single only AAA Studio - FurryBall GPU renderer CUDA and DirectX GPU rendering Single only Autodesk - 3ds Max + 3D modeling, animation, and rendering Iray interactive, photorealistic and Yes NVIDIA iray physically correct rendering Autodesk - Maya 3D modeling, animation, and rendering Increased model complexity, larger scenes Yes Autodesk - Motion Character animation and motion capture Increased model complexity at interactive Single only Builder rates Autodesk - Mudbox 3D sculpting Increased model complexity at interactive Single only rates Blastcode - Kilton/ Physics-based simulation plug in Faster simulation Single only Megaton Cebas - moskitoRender GPU renderer CUDA-based GPU rendering Yes Chaos Group - V-Ray RT GPU renderer CUDA interactive GPU rendering Yes Jawset - TurbulenceFD Physics-based simulation plug-in GPU simulation using CUDA Single only Maxon - 3D modeling, animation, and rendering Increased model complexity at interactive Single only rates NewTek - Lightwave 3D modeling, animation, and rendering Increased model complexity at interactive Single only rates * Next Limit – Maxwell GPU renderer CUDA-accelerated rendering Yes Otoy - Octane Render GPU renderer GPU rendering Yes Pixologic - Sculptris 3D sculpting Increased model complexity at interactive Single only rates Redshift - Renderer GPU-accelerated, biased renderer CUDA-based GPU final-frame rendering Yes Side Effects - Houdini 3D simulation and rendering GPU simulation using OpenCL Single only The Foundry - Mari 3D paint Increased model complexity at interactive Single only rates The Foundry - Modo 3D modeling, animation and rendering Increased model complexity, larger scenes Single only

COLOR CORRECTION AND GRAIN MANAGEMENT APPLICATION DESCRIPTION SUPPORTED FEATURES MULTI-GPU SUPPORT Adobe - SpeedGrade CC Color grading Real-time grading and finishing with Single only Lumetri Deep Color Engine. ARRI - RAW Converter RAW de-Bayering and primary color grading CUDA-accelerated de-bayering and grading Single only Assimilate - Scratch Color grading and finishing Accelerated debayering for real-time digital Single only finishing Blackmagic Design - Color grading and editing Real-time color correction and de-noising Yes DaVinci Resolve Canon - Cinema RAW RAW de-bayering GPU-accelerated de-bayering Single only SDK Cinnafilm - Dark Energy Application and plug-in for image Image de-noising and restoration Yes enhancement Digital Vision - Nucoda Color grading De-bayering for color correction Single only Fastvideo - Fast CUDA software for extremely fast RAW video High quality GPU-based RAW video Yes CinemaDNG & photo processing with benchmark option processing, up to 160 fps speed, more than 4K resolution, sophisticated (wavelet) realtime denoising (pre and post bayer), all standard color corection features and monitoring options, export to 16-bit TIF or 10-bit ProRes Fastvideo - GPU Debayer High performance GPU debayer High performance debayer on CUDA Yes

* Indicates new application 12 | POPULAR GPU‑ACCELERATED APPLICATIONS CATALOG | May17 * FilmLight - Baselight Color grading Real-time color correction Yes Marquise Technologies Color grading CUDA-based real-time color correction Single only - Rain Red Digital Cinema - Primary color grading CUDA-accelerated de-bayering and grading Single only REDCINE-X PRO Red Giant - Magic Bullet Color and finishing tools Faster effects Single only Looks Snell Advanced Media - Color grading and finishing Real time color correction Yes Pablo Rio SGO - Mistika Color grading and finishing Real-time color correction and finishing Single only The Foundry - Color grading Accelerated color grading Single only COLORWAY The Pixel Farm PFClean Image restoration and remastering CUDA-based image processing acceleration Single only Wavelet Beam - Grain Video noise reduction CUDA-accelerated grain and noise Yes and Noise Reducer reduction

COMPOSITING, FINISHING AND EFFECTS APPLICATION DESCRIPTION SUPPORTED FEATURES MULTI-GPU SUPPORT Adobe - After Effects CC Motion graphics and effects 3D ray tracing engine based on NVIDIA OptiX Yes Autodesk - Flame Finishing and color grading Integrated toolset for 3D VFX, editorial, and Yes Premium color grading Blackmagic Design - Effects and compositing Faster effects Single only Fusion Boris FX - Continuum Visual effects plug-in Faster effects Single only Complete Boris FX - Monsters GT Visual effects plug-in Faster effects Single only Boris FX - Sapphire Visual effects plug-in Faster effects Single only CoreMelt - Complete Visual effects plug-in Faster effects Single only Neat Video - Open FX Video noise reduction plug-in Faster effects Single only NewBlueFX - Video Video effects plug-in Faster effects Single only Essentials Pixelan - FilmTouch Video effects plug-in Faster effects Single only Re:Vision Effects - Visual effects plug-in Faster effects Single only Twixtor Red Giant - Effects Suite Visual effects plug-in Faster effects Single only ROBUSKEY Chroma keyer plug-in Faster effects Single only SGO - Mamba FX High-end compositing Faster keying, tracking, painting and Single only restoration The Foundry - HIERO Shot management, conform and review Better interactivity Single only timeline The Foundry - NUKE, Compositing tools with 3D tracker Faster effects Single only NUKEX and NUKE Studio Video Copilot - 3D object based particle system Faster effects Yes Element 3D Video Copilot - Twitch Video effects plug-in for After Effects Faster effects Single only

EDITING APPLICATION DESCRIPTION SUPPORTED FEATURES MULTI-GPU SUPPORT * Adobe – Illustrator CC Digital design Accelerated canvas for faster pan and Single only zoom. Optimized for NVIDIA based on NV Path Rendering * Adobe – Lightroom CC Photo editing Faster photo edits throughout entire Single only Develop module

* Indicates new application POPULAR GPU‑ACCELERATED APPLICATIONS CATALOG | May17 | 13 * Adobe Media Encoder Video editing Faster output rendering based on Mercury Yes Playback Engine Adobe - Photoshop CC Image editing Over 30 effects for smoother image Single only manipulation in Mercury Graphics Engine Adobe - Premiere Pro CC Video editing Real-time video editing & accelerated Yes output rendering based on Mercury Playback Engine Apple - Final Cut Pro Video editing Faster effects Single only Autodesk - Smoke Finishing and editing Faster effects Single only

Avid - Media Composer Video editing Faster video effects, unique stereo 3D Single only capabilities EditShare - Lightworks Video editing Faster effects Single only Grass Valley - Edius Pro Video editing Faster effects Single only Imagine Communications Video editing Faster effects Single only - Velocity Magix - Vegas Pro Video editing Faster video effects and encoding Single only Snell Advanced Media - Broadcast video editing Faster video effects, unique stereo 3D Single only Qube capabilities - Catalyst Browse, Video editing Faster effects, transitions and encoding Single only Prepare and Edit

ENCODING AND DIGITAL DISTRIBUTION APPLICATION DESCRIPTION SUPPORTED FEATURES MULTI-GPU SUPPORT ArcVideo - Core Video processing and transcoding Accelerated transcoding and encoding Yes ArcVideo - Live High-density, real-time video processing Accelerated broadcast encoding with Yes and encoding. NVIDIA CUDA and NVENC. Cinnafilm - Tachyon Standards conversion Video processing and frame rate conversion Yes Comprimato - JPEG2000 JPEG2000 encoding and decoding for DCP, Faster than real-time UltraHD / 4K, lossy Yes Codec IMF, video editing, broadcast contribution, and mathematically lossless, high bit- and archiving. depth (HDR), performance scalable, GPU accelerated. Dalet - Amberfin Transcoding and video quality analysis GPU-accelerated video procession and Single only encoding Elemental - Elemental Live streaming video processing and Video encoding and video processing Yes Live encoding Elemental - Elemental File-based video processing and encoding Video encoding and video processing Yes Server ERLAB - Multiplatform Video processing and encoding software Pre-processing encoding, decoding, post- Single only Transcoder processing and delivery Fastvideo - GPU Image Full image processing pipeline on CUDA Full image processing pipeline on GPU for Yes Processing SDK real-time imaging applications: Flat Field correction, Demosaicing, Denoising, Color correction, LUT, Resize, Sharp, OpenGL output, JPEG, JPEG2000, Raw Bayer, H.264 encoding Fastvideo - H.264 H.264 encoding on GPU NVENC accelerated video encoding Yes encoder Fastvideo - SDK JPEG, JPEG2000, Raw Bayer codecs Fast JPEG, JPEG2000, Raw Bayer encoding Yes and decoding on CUDA Interra - Baton Video quality analysis GPU accelerated video quality assessment Single only isovideo - Viarte Video standards conversion CUDA-accelerated video procession and Yes encoding METUS - Ingest Video recording, transcoding, and streaming CUDA Accelerated video recording, Single only software. encoding and broadcast transcoding Root6 - Content Agent Automated transcoding and workflow GPU-accelerated video procession and Yes management encoding

* Indicates new application 14 | POPULAR GPU‑ACCELERATED APPLICATIONS CATALOG | May17 Sorenson Media - Video transcoding application and plug-In Video encoding and video processing Yes Squeeze Snell Advanced Media - Video standards conversion GPU-accelerated video procession and Yes Alchemist on Demand encoding Tektronix - Aurora Automated video quality measurement GPU-accelerated video quality assessment Single only Telestream - Vantage Video transcoding and processing Video encoding and video processing Yes Lightspeed Wowza - Streaming H.264 video encoding NVENC accelerated video encoding Single only Engine Transcoder

ON-AIR GRAPHICS APPLICATION DESCRIPTION SUPPORTED FEATURES MULTI-GPU SUPPORT Brainstorm - eStudio Virtual sets and motion graphics Real-time rendering Single only ChyronHego - GS2 On-air graphics Real-time rendering Single only Graphics Engine ChyronHego - Mosaic On-air graphics Real-time rendering Single only Cinegy - Type On-air Graphics Real-time rendering Single only Dalet - Cube On-air Graphics Real-time rendering Single only Grass Valley - Vertigo On-air Graphics Real-time rendering Single only Imagine Communications On-air graphics Real-time rendering Yes - Nexio Channelbrand Imagine Communications On-air graphics Real-time rendering Single only - Nexio G8 Imagine Communications On-air graphics Real-time rendering Single only - Nexio TitleOne Monarch - Brodcaast 3D on-air graphics Real-time rendering Single only Dscript 3D Monarch - Virtuoso Virtual sets and motion graphics Real-time rendering Single only Pixel Power - Clarity On-air graphics Real-time rendering Single only RT Software - tOG On-air graphics Real-time rendering Single only Vizrt - Viz Engine On-air graphics and virtual sets Real-time rendering Single only Wasp3D - CG On-air graphics and virtual sets Real-time rendering Single only

ON-SET, REVIEW AND STEREO TOOLS APPLICATION DESCRIPTION SUPPORTED FEATURES MULTI-GPU SUPPORT Autodesk - RV Review and approval of 4K content Real-time Single only 3ality Technica - 3D stereo camera adjustment CUDA-based 3D imaging Single only Intellicam Binocle3D - Disparity 3D stereoscopic workflow CUDA-based 3D imaging Single only Killer Blackmagic Design - 3D stereoscopic workflow Real-time Single only Dimension BlueFish - Fluid 4K Review and approval of 4K content Real-time video review Single only Review Colorfront - On-Set Review, color grading and transcoding on Real-time Yes Dailies set Lightcraft - Previzion On-set virtual production Real-time, virtual set production Single only MTI Film - Cortex Dailies Review, color grading and transcoding on set CUDA accelerated grading and transcoding Single only The Pixel Farm - PFTrack 3D scene creation and tracking CUDA-accelerated tracking Yes

* Indicates new application POPULAR GPU‑ACCELERATED APPLICATIONS CATALOG | May17 | 15 WEATHER GRAPHICS APPLICATION DESCRIPTION SUPPORTED FEATURES MULTI-GPU SUPPORT Accuweather - Weather graphics Real-time Single only Cinemative HD Accuweather - Weather graphics Real-time Single only Storyteller ChyronHego - Metacast Weather graphics Real-time Single only MeteoGraphics - Weather graphics Real-time Single only MeteoEarth WSI - Max Weather Weather graphics Real-time Single only Medical Imaging APPLICATION DESCRIPTION SUPPORTED FEATURES MULTI-GPU SUPPORT PowerGrid Advanced MRI reconstruction modeling Discrete Fourier Tranform Yes Oil and Gas APPLICATION DESCRIPTION SUPPORTED FEATURES MULTI-GPU SUPPORT Acceleware Seismic processing RTM, Kirchhoff, control source, Yes AxRTM electromagnetism, forward modeling. AxKTM BRS Labs AISight for Proactive integrity management and real- 24/7 real-time analysis and alerting scaling Yes SCADA time precursor alerts for enhanced SCADA to thousands of sensors across remote operations in oil and gas. and geographically dispersed locations including historical analysis and trend reports. CGG- GeoVation Seismic processing Multiple algorithms (RTM, etc) Yes CGG- InsightEarth Seismic interpretation Horizon orientation attributes; automated Yes fault extraction, 3D Curvature Attributes. Echelon Stoneridge Reservoir simulator Fully GPU-accelerated reservoir model, Yes Technology including dual-perm, dual porosity, pressure varying perm and porosity. Eclipse compatible input deck. Esri ArcGIS for Desktop Determines the raster surface locations Viewshed2 transforms the elevation surface Yes (ArcMap and ArcGIS Pro) visible to a set of observer features, using into a geocentric 3D coordinate system and – Spatial Analyst and 3D geodesic methods. runs 3D sightlines to each transformed cell Analyst center. ffA Geoteric Seismic interpretation Attributes calculations, geobodies Yes extraction ffA SEA3D Pro Seismic interpretation Attributes calculations, geobodies Yes extraction ffA SVI Pro Seismic interpretation Attributes calculations, geobodies Yes extraction GeoMage Multifocusing Seismic processing Advanced seismic imaging technologies Yes and services, as well as interpretation, geological modeling, and reservoir characterization. * Giant Gray – Graydient S Machine learning anomaly detection for Proactive integrity management and Yes (SCADA) large scale industrial data. real-time precursor alerts for enhanced SCADA operations in oil and gas. 24/7 real-time analysis and alerting scaling to thousands of sensors across remote and geographically dispersed location. HUE Headwave Suite Seismic interpretation Attributes calculations, Volume Rendering Yes HUE HUEspace Seismic interpretation Interpretation development platform Yes OpenGeo Solutions Seismic processing Spectral Decomposition Yes OpenSeis Panorama Tech Seismic processing, Modeling Multiple algorithms (RTM, etc) Yes

* Indicates new application 16 | POPULAR GPU‑ACCELERATED APPLICATIONS CATALOG | May17 Paradigm Echos RTM Seismic processing RTM algorithm Yes Paradigm Geophysical Seismic interpretation Volume Rendering, Horizon Flattening Yes VoxelGeo Paradigm SKUA Reservoir modeling Faults, Horizons and Flow Simulation Grid Yes PumaFlow IFP Reservoir simulation GPU-accelerated linear solver Yes Ridgeway Kite Simulator Reservoir simulation Fully GPU-accelerated reservoir model, Yes including surface facilities and multiple realization history matching. Roxar RMS Reservoir modeling Multi GPU capabilities via HUEspace Yes Schlumberger Omega2 Seismic processing Multiple algorithms (RTM, etc) Yes RTM Seismic City Prestack Seismic processing Multiple algorithms (RTM, etc) Yes Interpretation SpectraSeis Seismic processing Full elastic wave-equation imaging and Yes analysis of microseismic fracture data. Stoneridge Technologies Reservoir simulation GPU Algebraic MultiGrid Package Yes GAMPACK Tsunami A2011 Seismic processing/Imaging package RTM processing Yes Tsunami RTM Seismic processing RTM algorithm Yes Research: Higher Education and Supercomputing COMPUTATIONAL CHEMISTRY AND BIOLOGY Bioinformatics APPLICATION DESCRIPTION SUPPORTED FEATURES MULTI-GPU SUPPORT Arioc High-throughput read alignment with GPU- Single-end alignment, paired-end Yes accelerated exploration of the seed-and- alignment extend search space. • Output in SAM or database-ready binary formats • Multiple GPU implementation BarraCUDA Sequence mapping software Alignment of short sequencing reads, Yes alignment of indels with gap openings and extensions. BEAGLE-lib BEAGLE is a high-performance library that Evaluation of likelihood for sequence Yes can perform the core calculations at the heart evolution on trees and Arbitrary models of most Bayesian and Maximum Likelihood (e.g. nucleotide, amino acid, codon) phylogenetics packages. It can make use of Speed-ups (over CPU only version): highly-parallel processors such as those in nucleotide model = up to 25x, codon model graphics cards (GPUs) found in many PCs. = up to 50x. * BioEM GPU-accelerated computing of Bayesian BioEM can use CUDA for the cross- Yes inference of electron microscopy images correlation step, which essentially consists of an image multiplication in Fourier space and a Fourier back-transformation. Campaign An open-source library of GPU-accelerated K-means (and Kps-means, a K-means Single only data clustering algorithms and tools. variant for GPUs with parallel sorting for improved performance), K-medoids, K-centers (a K-medoids variant in which medoids are placed only once according to a heuristic), Hierarchical clustering and Self-organizing map. * cryoSPARC Enables rapid, unbiased structure discovery • Ab-initio reconstruction, heterogeneous Yes of proteins and molecular complexes from reconstruction, and high-speed high- cryo-EM data. resolution refinement of 3D protein structures implemented on GPUs • Lean memory usage: 768x768x768 box size on a 12GB GPU for refinement • Multiple simultaneous jobs on multiple GPUs

* Indicates new application POPULAR GPU‑ACCELERATED APPLICATIONS CATALOG | May17 | 17 CUDASW++ Open source software for Smith-Waterman Parallel search of Smith-Waterman Yes protein database searches on GPUs. database. CUSHAW Parallelized short read aligner Parallel, accurate long read aligner for Yes large genomes G-BLASTN GPU-accelerated nucleotide alignment tool Blastn and megablast modes of NCBI- Single only based on the widely used NCBI-BLAST. BLAST GPU-Blast Local search with fast k-tuple heuristic Protein alignment according to BLASTP Single only * Huygens Realize amazing deconvolution results • Deconvolution of volumetric images and Yes within seconds using high-end NVIDIA time series from widefield, confocal, GPU cards and the powerful Huygens light sheet, super-resolution STED deconvolution algorithms. The unique brick- microscopes and more. splitting possibility is also available in the • Chromatic aberration and cross-talk GPU mode, enabling you to deconvolve very correction, image stabilization and large files on the GPU, even with cards with stitching limited video-RAM • Visualization, tracking, colocalization and object analysis • Multi-GPU and cluster support mCUDA-MEME Ultrafast scalable motif discovery algorithm Scalable motif discovery algorithm based Yes based on MEME . on MEME. * Microvolution Microvolution’s method starts with the 3D deconvolution for fluorescence Yes proven Richardson-Lucy algorithm that is microscopy, Written for use only on GPUs used by most software programs. Other vendors take mathematical shortcuts to speed up iterations, resulting in imprecise images after deconvolution. Microvolution takes no shortcuts. Our software delivers accurate images, up to 200 times faster. MUMmer GPU High-throughput local sequence alignment Aligns multiple query sequences against TBD program reference sequence in parallel. NVBIO NVBIO is an open source C++ library Data structures, algorithms, and utility Yes of reusable components designed to routines useful for building complex accelerate bioinformatics applications using computational genomics applications on CUDA. CPU-GPU systems. NVBowtie A largely complete implementation of the Good coverage of Bowtie2 features and Yes Bowtie2 aligner on top of NVBIO. comparable quality results. PEANUT Read mapper for DNA or RNA sequence Achieves supreme sensitivity and speed Single only reads to a known reference genome. compared to current state of the art read mappers like BWA MEM, Bowtie2 and RazerS3. PEANUT reports both only the best hits or all hits. REACTA A modified version of GCTA with improved GRM creation, REML analysis, Regional Yes computational performance, support for Heritability (including multi-GPU). Graphics Processing Units (GPUs), and additional features. The purpose of REACTA is to quantify the contribution of genetic variation to phenotypic variation for complex traits. * RELION-2 RELION (for REgularised LIkelihood Both image classification and high- Yes OptimisatioN, pronounce rely-on) is a stand- resolution refinement have been alone computer program that employs an accelerated up to 40-fold, and template- empirical Bayesian approach to refinement based particle selection has been of (multiple) 3D reconstructions or 2D class accelerated almost 1000-fold on desktop averages in electron cryo-microscopy (cryo- hardware. Reduced memory requirements EM). • High-resolution cryo-EM structure determination in a matter of day on a single SeqNFind SeqNFind® is a powerful tool suite that Hardware and software for reference Yes addresses the need for complete and accurate assembly, blast, SW, HMM, de novo alignments of many small sequences against assembly. entire genomes utilizing a unique hardware/ software cluster system for facilitating bioinformatics research in Next Generation sequencing and genomic comparisons.

* Indicates new application 18 | POPULAR GPU‑ACCELERATED APPLICATIONS CATALOG | May17 SOAP3 GPU-based software for aligning short Short read alignment tool that is not Yes reads with a reference sequence. It can heuristic based; reports all answers. find all alignments with k mismatches, where k is chosen from 0 to 3. SOAP3-dp SOAP3-dp: Ultra-fast GPU-based tool for Borrows-Wheeler Transformation, Dynamic Yes short read alignment via index-assisted Programming. dynamic programming. UGene Open source Smith-Waterman for SSE/ Fast short read alignment. Yes CUDA, Suffix array based repeats finder and dotplot. WideLM Fits numerous linear models to a fixed Parallel linear regression on multiple Yes design and response. similarly-shaped models.

Molecular Dynamics APPLICATION DESCRIPTION SUPPORTED FEATURES MULTI-GPU SUPPORT * ACEMD GPU simulation of molecular mechanics • MD engine written for GPUs Yes force fields, implicit and explicit solvent. 610 • Support & CHARMM force fields ns/day (DHFR) • Support unbiased simulations via HTMD • Support biased MD via PLUMED AMBER Suite of programs to simulate molecular PMEMD Explicit Solvent and GB Implicit Yes dynamics on biomolecule. Solvent CHARMM MD package to simulate molecular Implicit (5x), Explicit (2x) Solvent via Yes dynamics on biomolecule. OpenMM, now ported natively to GPUs. DESMOND High-speed molecular dynamics The code uses novel parallel algorithms Yes simulations of biological systems. and numerical techniques to achieve high performance and accuracy. ESPResSo Highly versatile software package for Hydrodynamic / Electrokinetic forces Yes performing and analyzing scientific P3M electrostatics. Molecular Dynamics many-particle simulations of coarse-grained atomistic or bead-spring models as they are used in soft-matter research in physics, chemistry and molecular biology. Folding@Home A distributed computing project that studies Powerful distributed computing molecular Yes protein folding, misfolding, aggregation, and dynamics system; implicit solvent and related diseases. folding. * Genesis • Powerful parallelization for hybrid Yes (CPU+GPU) systems • Full electrostatics with PME • Large (1-100 million atoms) biological systems - See more at: http://www. nvidia.com/object/gpu-applications. html?mDicS#sthash.JXqtkvY5.dpuf * GPUgrid.net Distributed computing project with • High-throughput all- biomolecular Yes thousands of GPUs for molecular simulations simulations. • Protein folding and binding GROMACS Simulation of biochemical molecules with Implicit (5x), Explicit (2x) Solvent Yes complicated bond interactions. HALMD Large-scale simulations of simple and Simple fluids and binary mixtures (pair Single only complex liquids. potentials, high-precision NVE and NVT, dynamic correlations). HOOMD-Blue Particle dynamics package written grounds Written for use only on GPUs Yes up for GPUs. * HTMD Python environment for simulation-based • Available via Conda and github molecular discovery • Support ACEMD, PMEMD, NAMD, GROMACS • AMBER and CHARMM force fields • Adaptive sampling, Markov State Models, visualization, protein preparation and ligand parameterization LAMMPS Classical molecular dynamics package Lennard-Jones, Gay-Berne, Tersoff, and Yes dozens more potentials

* Indicates new application POPULAR GPU‑ACCELERATED APPLICATIONS CATALOG | May17 | 19 MELD OpenMM plugin written for GPUs OpenMM plugin written for GPUs. Yes Integrative approach to combine physics and information Orders of magnitude faster protein folding than brute force MD NAMD Designed for high-performance simulation Full electrostatics with PME and most Yes of large molecular systems. simulation features; 100M atom capable. OpenMM Library and application for molecular Implicit and explicit solvent, custom forces Yes dynamics for HPC with GPUs. PolyFTS Classical molecular simulation code Uses auxiliary fields as the fundamental Single only for studying polymer self-assembly and simulation degrees of freedom, Uses cuFFT thermodynamics. extensively (~ 80%), CUDA code is ~20%, Multi CPU or single GPU per job, 1x = Ivy Bridge E5-2690 CPU all 10 cores, 3-8X on K40 or K80 (utilizing 1/2 of the K80). * SOP-GPU SOP-GPU package, where SOP stands for Langevin dynamics simulations using Single only the Self Organized Polymer Model fully the coarse-grained Self Organized implemented on a GPU, is a scientific Polymer (SOP) model, Multiple software package designed to perform simulation trajectories can be performed Langevin Dynamics Simulations of simultaneously on a single GPU, Calpha the mechanical or thermal unfolding, and Calpha-Cbeta models are supported, and mechanical indentation of large Simulations of protein forced unfolding, biomolecular systems in the experimental Novel simulations of nanoindentation subsecond (millisecond-to-second) in silico, Support for hydrodynamic timescale. interactions, Up to ~100 ms of simulation time per day, Systems of up to 1,000,000 amino-acids (on GPUs with 6GB or great memory).

Quantum Chemistry APPLICATION DESCRIPTION SUPPORTED FEATURES MULTI-GPU SUPPORT Abinit Allows to find total energy, charge density Local Hamiltonian, non-local Hamiltonian, Yes and electronic structure of systems made of LOBPCG algorithm, diagonalization/ electrons and nuclei within DFT. orthogonalization. ACES III Takes best features of parallel Integrating scheduling GPU into SIAL Yes implementations of quantum chemistry programming language and SIP runtime methods for electronic structure. environment. ADF Density Functional Theory (DFT) software • GGAs only, energies, forces and Hessians Yes package that enables first-principles • ~1.5-2x faster electronic structure calculations. BigDFT Implements density functional theory DFT; Daubechies wavelets, part of Abinit Yes by solving the Kohn-Sham equations describing the electrons in a material. CASTEP [In CASTEP is a leading code for calculating TBD Yes development] the properties of materials from first principles. Using density functional theory, it can simulate a wide range of properties of materials proprieties including energetics, structure at the atomic level, vibrational properties, electronic response properties etc. CP2K Program to perform atomistic and DBCSR (space matrix multiply library) Yes molecular simulations of solid state, liquid, molecular and biological systems. GAMESS-UK The general purpose ab initio molecular (ss|ss) type integrals within calculations Yes electronic structure program for performing using Hartree-Fock ab initio methods SCF-, DFT- and MCSCF-gradient and density functional theory. Supports calculations. organics and inorganics. GAMESS-US Computational chemistry suite used to Libqc with Rys Quadrature Algorithm, Yes simulate atomic and molecular electronic Hartree-Fock, MP2 and CCSD. structure. Gaussian Predicts energies, molecular structures, Joint NVIDIA, PGI and Gaussian Yes and vibrational frequencies of molecular collaboration. systems.

* Indicates new application 20 | POPULAR GPU‑ACCELERATED APPLICATIONS CATALOG | May17 GPAW Real-space grid DFT code written in C and Electrostatic poisson equation, Yes Python orthonormalizing of vectors, residual minimization method (rmm-diis). gWL-LSMS Materials code for investigating the effects Generalized Wang-Landau method Yes of temperature on magnetism. LATTE Density matrix computations CU_BLAS, SP2 Algorithm Yes LSDalton Linear-scaling HF and DFT code suitable • (T) correction to the CCSD energy. Yes for large molecular systems, now also with • RI-MP2 energy/gradient (in some CCSD capabilities development). • CCSD energy (in development). • GPU-based ERI generator (in development). MOLCAS Methods for calculating general electronic CU_BLAS Single only structures in molecular systems in both Additional ground and excited states. GPU support coming in Version 8 MOPAC2012 Semiempirical Quantum Chemistry Pseudodiagonalization, Matrix Single only manipulation, full diagonalization, and density matrix assembling via Magma libraries. NWChem Calculations Triples part of Reg-CCSD(T), CCSD and Yes EOMCCSD task schedulers. Octopus Used for ab initio virtual experimentation Full GPU support for ground-state, TBD and quantum chemistry calculations. real-time calculations; Kohn-Sham Hamiltonian, orthogonalization, subspace diagonalization, poisson solver, time propagation. * ONETEP ONETEP (Order-N Electronic Total Energy • Scales to 1,000s of GPUs. Yes Package) is a linear-scaling code for • Core FFT box operations accelerated. quantum-mechanical calculations based on • All features utilise these core operations density-functional theory. but may introduce further bottlenecks resulting in lower speedups. PETot First principles materials code that Density functional theory (DFT) plane wave Yes computes the behavior of the electron pseudopotential calculations. structures of materials. * PWMat The fastest plane wave pseudopotential It can perform extremely fast plane wave Yes code for density functional theory DFT calculations based on GPU machines simulations based on GPU. and single precision and double precision mixed algorithm. It deploys the state-of- the-art electronic structure calculation methods with many new features and algorithm innovations. It performs ab initio material science simulations, designed for both theoretical and experimental groups. Q-CHEM Computational chemistry package designed Various features including RI-MP2 Single Only for HPC clusters. * QMCPACK Solves the many-body Schrodinger equation Main features Yes for electronic structures using a quantum Monte Carlo method. * Quantum Espresso/ An integrated suite of computer codes PWscf package: linear algebra (matix Yes PWscf for electronic structure calculations and multiply), explicit computational kernels, materials modeling at the nanoscale. 3D FFTs. QUICK QUICK is a GPU-enabled ab intio quantum Running Hartree-Fock and DFT energy on Yes chemistry software package. GPU, Supports s, p, d, f orbitals on energy calculation, HF gradient with s,p,d orbital support, GPU-based ERI generator. TeraChem Quantum chemistry software designed to Full GPU-based solution; Performance Yes run on NVIDIA GPU. compared to GAMESS CPU version.

* Indicates new application POPULAR GPU‑ACCELERATED APPLICATIONS CATALOG | May17 | 21 * VASP Complex package for performing ab-initio Hybrid Hartree-Fock DFT functionals Yes quantum-mechanical molecular dynamics including exact exchange. (MD) simulations using pseudopotentials or the projector-augmented wave method and a plane wave basis set.

Visualization and Docking APPLICATION DESCRIPTION SUPPORTED FEATURES MULTI-GPU SUPPORT Amira® A multifaceted software platform for 3D visualization of volumetric data and Single only visualizing, manipulating, and understanding surfaces Life Science and bio-medical data. BINDSURF A virtual screening methodology that uses Allows fast processing of large ligand Single only GPUs to determine protein binding sites. databases BUDE Molecular docking program Empirical Free Energy Force field Single only * Core Hopping Schrödinger’s Core Hopping program not GPU accelerated Application TBA only provides the traditional ligand-based methods for exploring different scaffolds, but also offers a receptor-based method that will accurately account for detailed ligand-receptor interactions of compounds containing novel cores. FastROCS Molecule shape comparison application Real-time shape similarity searching/ Yes comparison Interactive Molecule Experimental interactive molecule visualizer Targeting high quality images and Single only Visualizer based on a ray-tracing engine. ease of interaction, IMV uses the latest GPUcomputing acceleration techniques, combined with natural user interfaces such as Kinect and Wiimotes. Molegro Virtual Docker 6 Method for performing high accuracy Energy grid computation, pose evaluation Single only flexible molecular docking. and guided differential evolution. * PaPaRa 2.0 A Vectorized Algorithm for Probabilistic Up to 15-fold run time improvements Single only Phylogeny-Aware Alignment Extension. by deploying SIMD vector intrinsics to accelerate the alignment kernel. PIPER Protein Docking Protein-protein docking program Molecule docking TBD PyMol User-sponsored molecular visualization Increased real-time rendering Single only system on an open-source foundation performance. Lines: 460% increase Cartoons: 1246% increase Surface: 1746% increase Spheres: 753% increase Ribbon: 426% increase VEGA ZZ Molecular Modeling Toolkit Virtual logP, molecular surface values Single only VMD Visualization and analyzing large bio- High quality rendering, large structures Yes molecular systems in 3-D graphics (100M atoms), analysis and visualization tasks, multiple GPU support for display of molecular orbitals

NUMERICAL ANALYTICS APPLICATION DESCRIPTION SUPPORTED FEATURES MULTI-GPU SUPPORT Accelereyes- ArrayFire Comprehensive GPU function library Hundreds of functions for math, signal/ Yes image processing, statistics, and more. Available for C, C++, Fortran, and other languages HiPLAR 3High Performance Linear Algebra in R Supports GPU and multi-core platforms, Yes compatible with legacy R code, no new data (for algebra types or operators, auto-tuning, support for functions via R Matrix package. Magma 1.5 or later) Mathematica Wolfram A symbolic technical computing language Development environment for CUDA and Yes and development environment. OpenCL. GPU acceleration for Wolfram Finance Platform.

* Indicates new application 22 | POPULAR GPU‑ACCELERATED APPLICATIONS CATALOG | May17 * Mathworks - MATLAB GPU acceleration for MATLAB (high-level Support for 200+ of most used MATLAB Yes technical computing language). functions (incl. Signal Processing, Image Processing, Communications Systems, etc). NMath Premium GPU-accelerated math and statistics for Automatically offloads computations to the Single only .NET, automatically detects the presence GPU. of a CUDA-enabled GPU at runtime and seamlessly redirects appropriate computations to it.

PHYSICS APPLICATION DESCRIPTION SUPPORTED FEATURES MULTI-GPU SUPPORT AWP The Anelastic Wave Propagation, AWP- 3D Finite Difference Computation Single only ODC, independently simulates the dynamic rupture and wave propagation that occurs during an earthquake. Dynamic rupture produces friction, traction, slip, and slip rate information on the fault. The moment function is constructed from this fault data and used to initialize wave propagation. BQCD Lattice quantum chromodynamics Wilson-clover fermion linear solver Yes application, used for nuclear ad high energy physics calculations. CASTRO A multicomponent compressible Gravitational Field Solver Yes hydrodynamic code for astrophysical flows including self-gravity, nuclear reactions and radiation. CASTRO uses an Eulerian grid and incorporates adaptive mesh refinement (AMR). The approach uses a nested hierarchy of logically-rectangular grids with simultaneous refinement in both space and time. Changa Astrophysics code performs collisionless Gravitational Model has been accelerated Single only N-body simulations. It can perform using CUDA cosmological simulations with periodic boundary conditions in comoving coordinates or simulations of isolated stellar systems. Chemora Chemora is a system for performing Chemora embeds the equations’ Yes simulations of systems described computational kernels into dynamically by differential equations running on compiled loop nests shaped for input size accelerated computational clusters. and GPU structure. Chroma Lattice Quantum Chromodynamics (LQCD) Wilson-clover fermions, Krylov solvers, Yes Domain-decomposition CPS Lattice quantum chromodynamics Wilson, domain-wall and Möbius fermion Yes application, used for nuclear ad high energy linear solvers physics calculations. * CST STUDIO SUITE® and Self-consistent simulation of charged Particle-in-Cell Solver Yes CST PARTICLE STUDIO® particles in electromagnetic fields. ENZO 3D block-structured AMR code for Accelerated magneto hydrodynamics Yes cosmological structure formation. solvers GTC Simulates microturbulence and transport in Electron push and shift (accounting for Yes magnetically confined fusion plasma. >80% of run time) GTC-P A development code for optimization of Optimized with CUDA. OpenACC Yes plasma physics. Full science and data sets development underway are included, but in a simplified form to allow performance testing and tuning. GTS Simulates microturbulence and the motion Push and shift for both electron and ion Yes of charged particles and interactions in dynamics fusion plasma. HACC Simulates N-Body Astrophysics This code has been optimized with CUDA Yes runs in full production mode.

* Indicates new application POPULAR GPU‑ACCELERATED APPLICATIONS CATALOG | May17 | 23 MAESTRO A low Mach number stellar hydrodynamics Gravitational Field Solver Yes code that can be used to simulate long- time, low-speed flows that would be prohibitively expensive to model using traditional compressible code. MILC Lattice Quantum Chromodynamics (LQCD) Staggered fermions, Krylov solvers, Gauge- Yes codes simulate how elemental particles link fattening. are formed and bound by the “strong force” to create larger particles like protons and neutrons. OSIRIS Simulates Plasma Physics including Laser 2 dimensions of the particle push have Yes interaction been optimized with CUDA. Additional optimization is being planned with OpenACC. PIConGPU A relativistic Particle-in-Cell code that Simulation of laser-wakefield acceleration Yes describes the dynamics of a plasma by of electrons. computing the motion of electrons and ions subject to the Maxwell-Vlasov equation. PPM Piecewise parabolic method, a higher- Turbulent, compressible mixing of gases Single only order extension of Godunov’s method which in the context of stars near the ends of uses spatial interpolation and allows for a their lives and also in inertial confinement steeper representation of discontinuities, fusion. particularly contact discontinuities. QUDA Library for Lattice QCD calculations using CUDA supports the following fermion Yes GPUs. formulations: Wilson,Wilson-clover,Twisted mass,Improved staggered (asqtad or HISQ) and Domain wall. RAMSES Simulates astrophysical problems on CUDA acceleration is applied for radiative Yes different scales (e.g. star formation, transfer for reionization, and the galaxy dynamics, cosmological structure hydrodynamic solver using AMR. formation). XGC Simulates edge effects for MHD plasma The particle push portion has been Yes physics optimized with CUDA and is being fully optimized with OpenACC and CUDA.

SCIENTIFIC VISUALIZATION APPLICATION DESCRIPTION SUPPORTED FEATURES MULTI-GPU SUPPORT 3D Slicer Medical visualization & segmentation Rendering, image processing Single only CEI EnSight Visualization and analysis application for CAE Rendering Yes FluoRender (SCI, U of Interactive rendering tool for confocal Multi-channel volume rendering Single only Utah) microscopy data visualization. GPULib for IDL Data analysis application Analysis tasks Single only * GVDB GPU framework for OpenVDB data Volumetric rendering of 3D voxels for full Single Only structures that integrates with OptiX volume rendering, hole filling, and user defined operations. HVR (LCSE, U of Interactive volume rendering application Volume rendering Yes Minnesota) ImageVis3D (SCI, U of Simple, scalable, and interactive volume Out-of-core volume rendering Single only Utah) rendering application. * IndeX Interactive or real-time volumetric Parallel distributed 3D rendering of dense Yes visualization or sparse volumes. Accurate ray casting or ray tracing at high resolution of full size datasets. Plug-in to ParaView also available. IntelligentLight Visualization application for CFD Rendering Single only FieldView MathWorks - MATLAB Data analysis and visualization application Rendering and analysis tasks Single only ParaView Scalable data anlysis and visualization Rendering and analysis tasks Yes application Seg3D (SCI, U of Utah) Segmentation application for medical data Rendering, image processing Single only

* Indicates new application 24 | POPULAR GPU‑ACCELERATED APPLICATIONS CATALOG | May17 * OptiX OptiX API is framework for high- Programmable intersection, ray generation, Yes performance ray tracing. shading, data payloads. Visulalization Toolkit Data anlysis and visualization toolkit Rendering Single only (VTK) VisIt Scalable data anlysis and visualization Rendering and analysis tasks Yes application vl3 (Argonne National Large dataset visualization in cosmology, Volume rendering of particles Yes Lab) astrophysics, and biosciences fields. VMD (U of Illionis, Visualization and analysis of large bio- High-qulity rendering, large structures Yes Urbana-Champaign) molecular systems in 3-D graphics. (100M atoms), analysis and visualization tasks, multiple GPU support for display of molecular orbitals. Safety & Security APPLICATION DESCRIPTION SUPPORTED FEATURES MULTI-GPU SUPPORT Cognika - Perseus Real-Time Alerting and Visual Search for Real-Time alerting on humans or vehicles; Yes Fixed and PTZ Cameras. Content-based image search on humans or vehicles. Genetec – Security GPU accelerated decode & rendering Offloads the workload of video stream Single only Center 5.3 enables the display of more high-resolution decode and display rendering of multiple streams from a single workstation, as well streams to GPU. as enhancing video playback performance. * Giant Gray – Graydient V Machine learning anomaly detection for Proactive event detection and real-time Yes (Video) enhanced video analytics. alerts for safety, unauthorized access prevention, and loss prevention. 24/7 real-time analysis and alerting scaling to thousands of video streams across remote and geographically dispersed locations. Herta Security - Real time facial recognition and forensic Supports crowded scenes, difficult lighting, Yes BioSurveillance NEXT, alerts against multiple watchlists. faster than real-time analysis, partial face BioFinder concealment. iCetana - iMotionFocus Intelligent analysis of video on 1,000+ GPU accelerated machine learning to Yes camera streams to significantly filter and identify abnormal activity within video reduce the camera streams requiring an streams operator view. intuVision - intuVision VA Real-time alerts and data reporting from Robust and user trainable object Yes use cases include Security, Traffic, Retail classification for tracking. Using distributed and Parking, Analysis of video streams in architecture real-time and on archived video at up to 20x real-time speeds. IQrity Inc. - IQrity RTFace Deep Learning facial recognition SDK Real-time face detection, verification or Yes -300/600, IQrity LDFace with 25 bytes template for real-time suspect identification against multi-million - 800 identification applications and large-scale datasets based on an artificial neural IdM solutions. network. Macroscop Open-platform video management software H264 decoding for CPU offload, zooming, Yes for scalable IP video surveillance systems image conversion shader from 24 to 32 bits with advanced video analytics. delivering better color combination. Mi-AccLib Accelerated library for video analysis on Accelerated Intrusion Detection Algorithm. Yes video surveillance. MotionDSP - Ikena Real-time (render-less) super-resolution- Multi-filter, render-less video Yes Forensic, Ikena Spotlight based video enhancement and redaction reconstruction (super-resolution, software for forensic analysts and law stabilization, light/color correction), and enforcement professionals automatic tracking for redaction video from body cameras, CCTV and other sources. NEC NeoFace® Watch Face recognition for real-time video Detects & recognizes multiple faces Yes surveillance and offline search compared simultaneously in crowds and variable against multiple watch-lists. lighting, scales to more cameras, larger face databases.

* Indicates new application POPULAR GPU‑ACCELERATED APPLICATIONS CATALOG | May17 | 25 Nervve - Visual Search High speed visual search and analysis Uses images instead of keywords to search Yes Solution (NVSS) for objects or scenes of interest within video and imagery. Reliant solely on pixel data with no training, keywords, or tags required. Network Optix - Nx IP video management system designed GPU accelerated conversion of YUV images Yes Witness for auto discovering, managing, recording, to RGB, drawing and scaling YUV images in analyzing and searching thousands of video desktop client, dewarping fisheye (circular streams at the same time. or panamorph) live or recorded video streams * OpenALPR Automatic license plate recognition high accuracy license plate character Yes software applied to video streams from IP recognition spanning North America, cameras. Europe, United Kingdom, Australia, Korea, Singapore and Brazil. APIs and source code available for embedded applications and web services. Smilart Platform Real-time face recognition in cooperative Critical core segments written in CUDA Yes and uncooperative scenarios adaptable for a allowing for unlimited parallelization and multitude of applications to detect, identify transparent clustering. or verify people and objects. * VOCORD FaceControl Detects and recognizes the faces of people, Non-cooperative biometrical facial Yes freely passing-by cameras, providing an recognition system, operating “on-the-go”. instant alert to people on a watchlist, recognizes age and gender, counts people by faces, tags newcomers and regular visitors. The system uses deep neural network algorithms and performs recognition with extremely high accuracy in field applications.

For more information on GPU-accelerated applications please visit, www.nvidia.com/teslaapps

* Indicates new application 26 | POPULAR GPU‑ACCELERATED APPLICATIONS CATALOG | May17

© 2017 NVIDIA Corporation. All rights reserved. NVIDIA, the NVIDIA logo, and CUDA, are trademarks and/or registered trademarks of NVIDIA Corporation. All company and product names are trademarks or registered trademarks of the respective owners with which they are associated. Features, pricing, availability, and specifications are all subject to change without notice. May17