Can NeRF be Used in Game?

Link Copied!

1. Introduction

In this article, we'll explore some of the limitations of using NeRF (Neural Radiance Fields) in game production and how to work around them.

kor version: link

Can NeRF be Used in Game Production?

As discussed in "Creating Realistic 3D Models with NeRF," NeRF is a technique for reconstructing 3D models from a set of images taken from various angles. With the ability to easily obtain high-quality 3D models, NeRF appears to be a promising technology for game development. But are NeRF models ready to be integrated into game production?

Gaussian RT — **Figure 1.** NeRF Reconstruction of Cookie Box

Can we use the 3D model of a cookie box, as an object in a game? Unfortunately, the answer is still 'No'.

Several limitations need to be addressed before NeRF can be used directly in commercial applications or game production. In this post, I'll outline the primary obstacles preventing NeRF's direct application in game development and the solutions being developed to overcome them.

2. NeRF & Illumination Control

NeRF's Inability to Separate Lighting Effects

The image above shows a set of 3D objects generated with NeRF. While the models are of high quality, there is a significant limitation:
The shadows are fixed.

Why is this problematic?

Imagine rotating an object in your space—the direction of the shadow would change with the object's movement, right? However, in the case of a NeRF model, the light and object remain static, meaning the shadow stays consistent with what an observer would see when moving around the object.

In other words, NeRF models cannot separate the reflections of the captured light from the object itself. This is a critical flaw, especially considering that Physics-Based Rendering (PBR), essential for creating realistic games, relies on realistic lighting effects.

Even in Phong shading, one of the simplest forms of light shading, the reflection effects are handled by three components:

Ambient: Represents the ambient light.
Diffuse: Represents the scattered light.
Specular: Represents the shiny reflections.

A model trained with NeRF cannot separate these lighting effects, resulting in a 3D model where the captured lighting conditions are baked into the object. To apply even basic relighting, we need to separate at least the diffuse color, which closely resembles the intrinsic color of the object.

Separating Lighting Effects from NeRF

So, how can we separate lighting effects in NeRF models? There are several methods, but one simple approach involves modifying the model architecture to output specular and diffuse components separately.

Unlike the standard NeRF network structure, which takes coordinates $x$ (spatial location) and $d$ (direction) as inputs and outputs an RGB color $c$, a modified NeRF model outputs both specular color $c_s$ and diffuse color $c_d$.

In this structure, the Spatial MLP not only outputs the diffuse color $c_d$ but also the normal vector $n$ at that location. This normal vector is used to represent the reflected light more accurately through a technique called 'reflection reparameterization.' Consequently, the reflected light parameter $w_r$ is input into the Directional MLP, which then outputs the specular color $c_s$.

Understanding the reflection reparameterization formula might not be straightforward initially, but consider the following scenario to understand why we can represent reflected light $w_r$ through the normal vector and the observation direction.

As illustrated above, the reflected light $w_r$ at a point on an object has the following relationship with the observer's direction $d$ and normal $n$:

$$ \frac{(w_r + d)}{2} = (d \cdot n) \cdot n $$

By slightly modifying this relationship, we can derive the reflection reparameterization used earlier. Below is an example of how the lighting effects appear using this method.

Full	Diffuse	Specular

Full	Diffuse	Specular

While this does not achieve perfect diffuse-specular separation, it shows that the object's color can be modeled independently of the lighting effects to some extent, compared to the full NeRF model on the left. Using only the diffuse NeRF model in the middle allows for relighting to be applied.

3. Other Challenges and Solutions

Slow Rendering Speed

NeRF stores information in a 3D space through a neural network, meaning that to retrieve scene information, we must pass it through the network. Regardless of how lightweight the network is, the speed difference between directly reading stored information and obtaining it through the network is significant. This results in slow rendering speeds, a major drawback of NeRF.

However, this limitation has been largely mitigated with methods like Instant-NGP, Plenoxels, TriMipRF, and Zip-NeRF, which store features in select 3D locations and use interpolation to reduce computational load.

Game Engine Compatibility

Since Neural Rendering is not compatible with the conventional renderers used by game or graphics engines, a custom renderer must be created to use NeRF. However, the slow rendering speed of NeRF, as mentioned earlier, presents a significant barrier to practical use. Therefore, in most cases, objects created with NeRF should be converted to explicit meshes using methods like Marching Cubes.

Due to the semi-transparent nature of NeRF opacity, creating high-quality meshes is challenging. However, new meshing techniques for neural rendering, such as nerf2mesh and flexicubes, are emerging to address this issue.

Recently, Gaussian Splatting, which uses explicit radiance fields and rasterization, has also been introduced. Gaussian Splatting is naturally compatible with game engines and is rapidly evolving.

4. Neural Rendering in Game Engines

In the early stage of Neural Rendering, neural rendering technologies faced several limitations that hindered their use in-game production and other applications.

However, as these challenges are gradually being resolved, novel methods are emerging that enable neural rendering technologies in commercial game engines. Technology startups focusing on this technology are also appearing.

The video above demonstrates the use of NeRF-restored objects alongside other 3D objects in graphics software. In the video, the table is an object restored through NeRF, while the dinosaur and butterfly are predefined presets.

Additionally, NeRF and neural rendering technologies can be used not only for restoring and utilizing individual objects but also for reconstructing large real-world spaces to serve as maps in games.

The video above shows an example of capturing the NCSOFT backyard park, reconstructing it into a 3D space, and using it in a game.

To efficiently capture the space, a special camera with a wide field of view was used to scan the entire 360-degree area, as shown in the video, and then reconstructed using neural rendering technology.

Not only does it replicate the appearance exactly as photographed, but it can also be used as a game environment with a unique atmosphere, as seen in the latter part of the video. I'm looking forward to seeing many exciting games based on neural rendering technology when combined with innovative ideas.

If you are more interested, please refer to my project: Neural Rendeing in Game Engine