Real-Time Large Crowd Rendering with Efficient Character and Instance Management on GPU

Hindawi International Journal of Computer Games Technology Volume 2019, Article ID 1792304, 15 pages https://doi.org/10.1155/2019/1792304 Research Article Real-Time Large Crowd Rendering with Efficient Character and Instance Management on GPU Yangzi Dong and Chao Peng Department of Computer Science, University of Alabama in Huntsville, Huntsville, AL 35899, USA Correspondence should be addressed to Yangzi Dong; [email protected] Received 20 October 2018; Revised 26 January 2019; Accepted 3 March 2019; Published 26 March 2019 Academic Editor: Ali Arya Copyright © 2019 Yangzi Dong and Chao Peng. Tis is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Achieving the efcient rendering of a large animated crowd with realistic visual appearance is a challenging task when players interact with a complex game scene. We present a real-time crowd rendering system that efciently manages multiple types of character data on the GPU and integrates seamlessly with level-of-detail and visibility culling techniques. Te character data, including vertices, triangles, vertex normals, texture coordinates, skeletons, and skinning weights, are stored as either bufer objects or textures in accordance with their access requirements at the rendering stage. Our system preserves the view-dependent visual appearance of individual character instances in the crowd and is executed with a fne-grained parallelization scheme. We compare our approach with the existing crowd rendering techniques. Te experimental results show that our approach achieves better rendering performance and visual quality. Our approach is able to render a large crowd composed of tens of thousands of animated instances in real time by managing each type of character data in a single bufer object. 1. Introduction speed and reduce memory consumption while preserving the crowd’s visual fdelity. Crowd rendering is an important form of visual efects. In To improve the diversity of character appearances in a video games, thousands of computer-articulated polygonal crowd, a common method is duplicating a character’s mesh characters with a variety of appearances can be generated many times and then assigning each duplication with a to inhabit in a virtual scene like a village, a city, or a forest. diferent texture and a varied animation. Some advanced Movements of the crowd are usually programmed through a methods allow developers to modify the shape proportion crowd simulator [1–4] with given goals. To achieve a realistic of duplications and then retarget rigs and animations to visual approximation of the crowd, each character is usually the modifed meshes [10, 11] or synthesize new motions tessellated with tessellation algorithms [5], which increases [12, 13]. With the support of hardware-accelerated geometry- the character’s mesh complexity to a sufcient level, so that instancing and pseudo-instancing techniques [9, 14–16], fne geometric details and smooth mesh deformations can multiple data of a character, including vertices, triangles, be preserved in the virtual scene. As a result, the virtual textures, skeletons, skinning weights, and animations, can scene may end up with a composition of millions of, or even be cached in the memory of a graphics processing unit hundreds of millions of, vertices and triangles. Rasterizing (GPU). At each time when the virtual scene needs to be such massive amount of vertices and triangles into pixels rendered, the renderer will alter and assemble those data is a high computational cost. Also, when storing them in dynamically without the need of fetching them from CPU memory, the required amount of memory may be beyond main memory. However, storing the duplications on the the storage capability of a graphic hardware. Tus, in the GPU consumes a large amount of memory and limits the production of video games [6–9], advanced crowd rendering number of instances that can be rendered. Furthermore, technologies are needed in order to increase the rendering even though the instancing technique reduces the CPU-GPU 2 International Journal of Computer Games Technology communication overhead, it may sufer the lack of dynamic status of a character at a given time frame. McKenzie et al. mesh adaption (e.g., continuous level-of-detail). [28] developed a crowd simulator to generate noncombatant In this work, we present a rendering system, which civilian behaviors which is interoperable with a simulation achieves a real-time rendering rate for a crowd composed of modern military operations. Zhou et al. [29] classifed the of tens of thousands of animated characters. Te system existing crowd modeling and simulation technologies based ensuresafullyutilizationofGPUmemoryandcomputational on the size and time scale of simulated crowds and evaluated power through the integration with continuous level-of- them based on their fexibility, extensibility, execution ef- detail (LOD) and View-Frustum Culling techniques. Te ciency, and scalability. Zhang et al. [30] presented a unifed size of memory allocated for each character is adjusted interaction framework on GPU to simulate the behavior of dynamically in response to the change of levels of detail, as a crowd at interactive frame rates in a fne-grained parallel the camera’s viewing parameters change. Te scene of the fashion. Malinowski et al. [31] were able to perform large scale crowd may end up with more than one hundred million simulations that resulted in tens of thousands of simulated triangles. Diferent from existing instancing techniques, our agents. approach is capable of rendering all diferent characters Visualizing a large number of simulated agents using through a single bufer object for each type of data. Te animated characters is a challenging computing task and system encapsulates multiple data of each unique source worth an in-depth study. Beacco et al. [9] surveyed previous characters into bufer objects and textures, which can then approaches for real-time crowd rendering. Tey reviewed be accessed quickly by shader programs on the GPU as and examined existing acceleration techniques and pointed well as maintained efciently by a general-purpose GPU out that LOD techniques have been used widely in order programming framework. to achieve high rendering performance, where a far-away Te rest of the paper is organized as follows. Section 2 character can be represented with a coarse version of the char- reviews the previous works about crowd simulation and acter as the alternative for rendering. A well-known approach crowd rendering. Section 3 gives an overview of our system’s is using discrete LOD representations, which are a set of rendering pipeline. In Section 4, we describe fundamentals ofine simplifed versions of a mesh. At the rendering stage, of continues LOD and animation techniques and discuss the renderer selects a desired version and renders it without their parallelization on the GPU. Section 5 describes how to any additional processing cost at runtime. However, Discrete process and store the source character’s multiple data and LODs require too much memory for storing all simplifed how to manage instances on the GPU. Section 6 presents our versions of the mesh. Also, as mentioned by Cleju and Saupe experimental results and compares our approach with the [32], discrete LODs could cause “popping” visual artifacts existing crowd rendering techniques. We conclude our work because of the unsmooth shape transition between simplifed in Section 7. versions. Dobbyn et al. [33] introduced a hybrid rendering approach that combines image-based and geometry-based rendering techniques. Tey evaluated the rendering quality 2. Related Work and performance in an urban crowd simulation. In their approach, the characters in the distance were rendered with Simulation and rendering are two primary computing com- image-based LOD representations, and they were switched ponents in a crowd application. Tey are ofen tightly to geometric representations when they were within a closer integrated as an entity to enable a special form of in situ distance. Although the visual quality seemed better than visualization, which in general means data is rendered and using discrete LOD representations, popping artifacts also displayed by a renderer in real time while a simulation is occurred when the renderer switches content between image running and generating new data [17–19]. One example is representations and geometric representations. Ulicny et al. the work presented by Hernandez et al. [20] that simulated a [34] presented an authoring tool to create crowd scenes of wandering crowd behavior and visualized it using animated thousands of characters. To provide users immediate visual 3D virtual characters on GPU clusters. Another example is feedback, they used low-poly meshes for source characters. the work presented by Perez et al. [21] that simulated and visu- Te meshes were kept in OpenGL’s display lists on GPU for alized crowds in a virtual city. In this section, we frst briefy fast rendering. review some previous work contributing to crowd simulation. Characters in a crowd are polygonal meshes. Te mesh Ten, more related to our work, we focus on acceleration is rigged by a skeleton. Rotations of the skeleton’s joints techniques contributing to crowd rendering, including level- transform surrounding vertices, and subsequently the mesh of-detail (LOD), visibility culling, and instancing

Real-Time Large Crowd Rendering with Efficient Character and Instance Management on GPU

Graphics Development for Mobile Systems Marco Agus, KAUST & CRS4

NVIDIA's Opengl Functionality

Opengl® ES™ 3.0 Programming Guide, Second Edition

Mobile Graphics Siggraph Asia 2017 Course Marco Agus, KAUST & CRS4 Enrico Gobbetti, CRS4 Fabio Marton, CRS4 Giovanni Pintore, CRS4 Pere-Pau Vázquez, UPC

Real-Time Voxel Rendering Algorithm Based on Screen Space Billboard Voxel Buffer with Sparse Lookup Textures

Efficient Tile-Based Deferred Shading Pipeline

Opengl 4.0 Review 23 March 2010, Christophe Riccio