- shader authors can define variations of shaders with a number of systems
- each system can inject code/resources/constants into the shader
- a material allows specifications of which shaders belong together, which systems they use and allows to insert command at the correct time in the Frame
- all of the resulting shaders get a shared resource binder and constant buffer to reduce
data:image/s3,"s3://crabby-images/8f176/8f1768668aee51ac41fe8c17dabfd754a14b612d" alt=""
explanations how ray tracing shaders in OpenGL can access the necessary textures for all objects in the world using GL_ARB_bindless_texture
data:image/s3,"s3://crabby-images/55e75/55e75324c11106e920e8992e24f63a66d885023c" alt=""
- moving the scene information into group shared memory to speed up the ray tracing code a lot
- problems on metal, slower unless passing data by value instead of by const reference
- look at the iOS implementation with an overview of performance tools for CPU and GPU
data:image/s3,"s3://crabby-images/34180/341805975494c6af31dc3ba64e9c23e9bf3ec1e0" alt=""
- now using Logarithmic depth buffer when available
- writes custom depth in a pixel shader, this does disable early depth optimizations but still a performance win for their use-case
data:image/s3,"s3://crabby-images/2c343/2c343c76f75aa305142df8364d7e6e2e9e550009" alt=""
- look at the implementation of Witcher 3 rendering from an outside perspective using RenderDoc
- breakdown of rendering Frame structure
- how normals are stored in the g-buffer
- explanation of a number of techniques from the d3d disassembly
- sun rendering
- blinking stars
- eye adaptation
- tonemapping
- vignette
data:image/s3,"s3://crabby-images/baa44/baa445e4873ce9f7b6c444ed08e0b4d1efd3e5b6" alt=""
- discussion of tradeoffs for the basis file format
- comparison of different images formats transcoded from the same source format
data:image/s3,"s3://crabby-images/81685/81685a995b21f85a8c15341790272e9d36a59714" alt=""
- extending previous work in CNN(Convolutional neural network) based denoising
- using a modular architecture that improves temporal stability and detail preservation
data:image/s3,"s3://crabby-images/68509/68509224ccb7d145757ec4273be42759dbe23c1d" alt=""
- the longer the CPU/GPU can idle, the less power is needed
- optimizing a CPU bound game might cause more frames to be rendered (hitting 60 fps instead of 30). causing less GPU idle time and increasing power consumption
- kernel heuristics might trigger higher frequency mode which increases battery usage significantly
- Tutorial about the implementation of water surface movement of using a flow map in unity
- how to deal with deformation of normals using derivative maps
data:image/s3,"s3://crabby-images/356bc/356bc775a619ee1a327872c867718d5dea2237f1" alt=""
- NVIDIA Nsight Systems visualizes system-wide application interactions across CPU and GPU
- Nsight
- Volta, Vulkan 1.1 and Cuda 9.2, are supported
- user configurable memory view
data:image/s3,"s3://crabby-images/31125/311253b452455e5e8d73243cd81b0c499979cf15" alt=""
-how to investigate GPU starvation and detect CPU/GPU synchronizations and overview of other tools
data:image/s3,"s3://crabby-images/8185e/8185eb99fbf10003385cc30eb4c46963195bd326" alt=""