Cache Points for Production-Scale Occlusion-Aware Many-Lights Sampling and Volumetric Scattering

用于 Production-Scale Occlusion-Aware Many-Lights Sampling 与 Volumetric Scattering 的 Cache Points

Yining Karl Li 华特迪士尼动画工作室美国伯班克 karl.li@disneyanimation.com

Peter Kutz∗ Adobe 美国旧金山 peter.kutz@gmail.com

Charlotte Zhu 华特迪士尼动画工作室美国伯班克 charlotte.zhu@disneyanimation.com

Wei-Feng Wayne Huang∗ NVIDIA 美国洛杉矶 wahuang@nvidia.com

Gregory Nichols∗ Latitude AI 美国匹兹堡 greg@nichols.pro

David Adler 华特迪士尼动画工作室美国伯班克 david.adler@disneyanimation.com

Brent Burley

华特迪士尼动画工作室美国伯班克

brent.burley@disneyanimation.com

Daniel Teece

华特迪士尼动画工作室美国伯班克

daniel.teece@disneyanimation.com

Uniform Light (b) Locally Optimal (c) Cache Points (Ours)

RMSE: 0.2280 Time: 10:16

RMSE: 0.1436 Time: 2:03:51

RMSE: 0.1443 Time: 13:13

图 1：来自 Us Again 的一个生产场景，包含 4881396 个光源（解析光源、自发光三角形和自发光体积），使用每像素 32 个样本，分别采用 uniform light selection (a)、locally optimal light selection (b) 和我们的 cache points 系统 (c) 进行渲染。Uniform light selection 产生更快的结果但收敛性差，而为每个路径顶点构建 locally optimal light distribution 则产生更收敛的结果但速度慢得多。我们的 cache points 系统 (c) 产生的噪声水平与 (b) 相似，同时保持接近 (a) 的性能。为了清晰显示噪声差异，此图未包含最终生产帧中存在的后期渲染合成。© 2024 Disney

摘要

一个将渲染器定义为生产级渲染器的标志性能力，是其能够扩展以处理极端复杂的场景，包括由大量光源投射的复杂光照。在本文中，我们提出了Cache Points，这是迪士尼Hyperion Renderer用于在包含多达数百万个光源的场景中执行直接光照的高效无偏importance sampling的系统。我们的cache points系统包含许多新颖特性。我们在将进行光源采样的点上构建空间数据结构，而不是在光源本身上构建。我们在线学习遮挡，并将其纳入我们的importance sampling分布中。我们还加速了困难的volume scattering情况下的采样。

在过去十年中，我们的cache points系统已在华特迪士尼动画工作室制作的每一部CG feature film和动画短片中得到了广泛的生产应用，使艺术家能够设计光照环境而无需担心复杂性。在本文中，我们将概述cache points系统的构建方式、工作原理、对生产光照和艺术家workflows的影响，以及它在迪士尼动画生产渲染未来中的角色。

CCS概念

• 计算方法 ! Rendering; Ray tracing.

关键词

path tracing, global illumination, light selection, importance sampling, volume rendering

ACM引用格式：

Yining Karl Li, Charlotte Zhu, Gregory Nichols, Peter Kutz, Wei-Feng Wayne Huang, David Adler, Brent Burley, and Daniel Teece. 2024. Cache Points for Production-Scale Occlusion-Aware Many-Lights Sampling and Volumetric Scattering. In The Digital Production Symposium (DigiPro ’24), July 27, 2024, Denver, CO, USA. ACM, New York, NY, USA, 19 pages. https://doi.org/10. 1145/3665320.3670993

1 引言

生产渲染中的一个主要挑战是在包含从少量到数十万甚至数百万个光源的场景中进行光源采样。此外，在任何特定制作中出现的光照场景类型往往是不可预测的。Hyperion设计的一个关键原则是强调简单性而非灵活性 $Burley et al. 2017$ : we try not to burden users with non-artistic controls as much as possible. In accordance with this principle, we prefer systems that are as su”ciently and automatically robust to as many production scenarios as possible; in this paper, we present an in-depth description of our system for guiding direct light sampling in our production scenes, from the simplest to the most complex lighting scenarios. Our system, cache points, builds locally optimal estimates for light sampling weights and incorporates an online learning metric for local light visibility estimates. Our system is able to (1) combine local estimates for analytical lights, emissive geometry, and emissive volumes into a single combined system, and (2) provide unbiased direct light sampling guiding for both surface points and points inside of participating media. Additionally, we have also extended our system for use in importance sampling volumetric in-scattering in participating media. While we have previously alluded to our cache points system for solving the many-lights sampling problem $Burley et al. 2018; Fong et al. 2017; Nichols and Eisenacher 2015$ and have described cache points for volumetric scattering $$ Huang et al. 2021

isNear(e,p) = \text{\ distance\ }(x_{e},x_{p})^{2} \leq (r_{p}*D)^{2}

其中 $x_{e}$ 是光源的质心，$x_{p}$ 是缓存点 $\mathit{p}$ 的位置，$\mathbf{r}_{p}$ 是缓存点 ? 的半径，⇡ 是缓存点分离距离的调整项，我们根据经验确定应将其设置为 4.0 以获得最佳结果。 我们将近处光源与更远的光源分开，因为来自近处光源的 irradiance 可能相对于位置和表面法线方向高度变化。近处光照分布将其所有成员放入单个 bin 中，而远处光照分布实际上由七个 bin 组成，对应于缓存点位置处的七个虚拟传感器：六个面向基本方向的定向平面和一个位于点中心的全向接收器。我们遍历场景中的所有光源，并估计每个光源在忽略遮挡的情况下对每个传感器做出的总贡献，并在每个 bin 中存储一个包含 4 到 256 个光源的列表，这些光源占到达该点的能量的 97% $$ Shirley et al. 1996 $$. We do not precompute anything for the nearby light distribution; instead, at render-time we build a light selection PDF on-the-\#y for each path vertex over the irradiance contributions from the nearby lights to the exact path vertex location. We describe this in more detail in Section 3.3. For each light in the six cardinal bins in the far light distribution, we directly store a precomputed irradiance estimate at the cache point location. For each light in the omnidirectional bin in the far light distribution, we directly store a precomputed direct \#uence estimate at the cache point location. The omnidirectional bin serves a special purpose: for surfaces that have a well dened normal, estimating irradiance makes sense since irradiance is integrated over an oriented 2D surface, but for cases where a well dened normal is either di”cult or impossible to dene, we rely on a direct \#uence estimate instead since direct \#uence is integrated over a 3D sphere. Since curve-based hair tends to have extremely complex and rapidly changing surface orientations and volumetric participating media has no dened surface normal, we use direct \#uence for driving light sampling for curves and volumes. At this stage we only estimate these values for each light but defer building CDFs and PDFs until render-time cache point lookup. There are two reasons for this: rst, as mentioned earlier, for nearby lights we can build a higher-quality sampling distribution on-the-\#y at render-time, and second, Hyperion supports a sophisticated light linking system, which means that we cannot determine which lights to exclude from a light distribution until render-time evaluation of light linking relationships has been carried out. 3.1.4 Merging Neighboring Points with Similar Light Distributions. After building a light distribution at each cache point, we perform a second pruning step that merges nearby cache points that share similar light distributions. Like in the rst pruning step, for each cache point ?, we gather the nearest 20 neighbors within a 1e-5 unit radius. We then calculate an average similarity metric $M_{avg}$ between cache point ?’s light distribution and its nearest neighbors’ light distributions. The similarity metric is calculated as follows: for two given sets of lights and ⌫, we nd the intersection of and ⌫ (meaning the set of lights that are common to both sets) and then calculate the similarity metric ” as:

M = \frac{2*\text{\ size\ }_{\text{\ weighted\ }}(\text{\ intersection\ }(A,B))}{\text{\ size\ }(A) + \text{\ size\ }(B)}

We denote size of the intersection of set and set ⌫ as being weighted because we need to take into account the possibility that a given light may exist in both sets and ⌫ but have di%erent assigned probabilities in each set. So, instead of just counting up the number of lights in 8=C4AB42C8\>= , ⌫ , we instead calculate a similarity percentage ( for each light, which we dene as:

S = 1 - \frac{\left| P_{B} - P_{A} \right|}{2*\left( \frac{P_{A} + P_{B}}{2} \right)}

where $P_{A}$ and $P_{B}$ denote the probabilities for a given light in sets $A$ and $B,$ respectively. This denition for ( works well so long as the probability for any given light is non-negative, which we guarantee since Hyperion’s lights only permit positive emission values; for systems where lights support negative emission values $$ Foundation 2024 $$, a modied metric would be required. The weighted size of the intersection of sets and ⌫ is then dened as the sum of ( for every light in the intersection. This approach for calculating a similarity metric between light distributions is somewhat ad-hoc, but in practice we have found that this approach works well. $M_{avg}$ is then dened as simply the sum of ” for every nearest neighbor cache point to ? divided by the number of nearest neighbors found. 接下来，我们计算缓存点 ? 与其收集的最近邻之间的平均距离；我们将此平均距离作为缓存点 ? 半径的初始猜测。然后，我们调整缓存点的半径，以考虑缓存点与其最近邻的相似程度；这通过简单地将半径乘以 $M_{avg}.$ 来实现。由于较小的 $M_{avg}$ 值表示 ? 中的光分布与其最近邻相对不相似，意味着该空间区域的直接照明辐射场以更高频率变化，因此缩小 ? 的影响范围是合理的，反之亦然。为了防止半径缩小影响缓存点的有效性，我们将其限制在世界空间原始大小的 25%，并保证最小投影屏幕空间大小（相当于 3 个像素）。 ![image](https://aduvfx-1252404142.cos.ap-beijing.myqcloud.com/posts/cache-points-for-production-scale-occlusion-aware-many-lights-sampling-and-volum/rId34.jpg) 1) 均匀光 (b) 局部最优 (c) 缓存点（我们的方法） ![image](https://aduvfx-1252404142.cos.ap-beijing.myqcloud.com/posts/cache-points-for-production-scale-occlusion-aware-many-lights-sampling-and-volum/rId37.jpg) RMSE: 0.0896 Time: 12:02 RMSE: 0.0321 Time: 17:48 RMSE: 0.0320 Time: 15:01 图 4：来自《Encanto》的一个生产场景，包含 38720 个光源（分析光源、自发光三角形和自发光体积），具有复杂的遮挡几何体和不透明度蒙版，使用每像素 32 个样本渲染，分别采用均匀光选择 (a)、局部最优光选择 (b) 和我们的缓存点系统 (c)。在这种情况下，动态构建局部最优分布表现相对较好，而我们的缓存点系统在采样质量方面表现不差，同时所需渲染时间更少。© 2024 Disney 最后，我们遍历之前在 ? 的调整半径内找到的最近邻；对于这个最近邻子集，我们保留索引最低的点，并原子地标记子集中所有其他点以供删除。由于半径是根据缓存点光分布之间的相似性进行调整的，我们认为位于调整半径内的点具有足够相似的光分布，因此可以直接合并。 第二个剪枝步骤中的所有上述操作都在所有缓存点上的并行循环中执行；然后执行一个串行循环以删除所有标记为删除的缓存点。在我们获得最终的缓存点集合后，我们最后一次重建 KD 树，以便在渲染期间使用。 3.1.5 跨缓存点模糊光分布。在每个缓存点构建光列表并合并具有相似光分布的点之后，我们缓存点数据结构构建过程的最后一步是模糊或聚合缓存点之间的远光分布。这一模糊步骤使得每个缓存点的远光分布受到相邻缓存点远光分布的影响，从而使所有缓存点的光分布在空间上更加保守；这一步允许我们在路径追踪期间安全地每个路径顶点仅查找一个缓存点。我们不模糊近光分布，因为相邻缓存点之间近光分布的变化通常比远光分布的变化频率更高。 对于给定的缓存点 ?，模糊步骤首先通过 kNN 搜索找到距离 ? 最近的 16 个相邻缓存点。请注意，在前面的步骤中，我们使用 20 个邻居进行 kNN 搜索操作，但在这一步我们选择 16；选择 16 的理由如下。在三维空间中，能够围绕另一个相同大小的球体密集排列的等大小球体的最大数量是 12 $$ Dai et al. 2019; Hales et al. 2017 $$, in either a face-centred cubic or hexagonal close packing conguration $$ Conway and Sloane 1999 $$. However, since our cache points have variable radii, we add an additional 4 points to the maximum perfect packing number of 12 as an empirically determined adjustment factor. 收集到 16 个最近邻后，我们接下来找到到最远的收集相邻点的距离 $d_{far};$ 我们使用 $d_{far}$ 来确定每个收集的缓存点对缓存点 ? 的相对贡献。对于缓存点 ?，我们赋予相对权重 1，对于最近的收集相邻点也赋予相对权重 1，而最远的收集相邻点赋予相对权重 1/16。对于介于最近和最远点之间的点 $\mathcal{P}n$，相对权重分配如下：

\text{weight}\left( p_{n} \right) = \text{mix}\left( 1,\frac{1}{16},\frac{\text{distance}\left( x,x_{n} \right)^{2}}{\left( d_{far} \right)^{2}} \right)

然后我们将相对权重归一化，使其总和为 1；最远点被赋予相对权重 1/16 的原因是为了确保归一化后，缓存点 ? 的 16 个收集邻居将至少占 $\mathbf{p}^{\mathbf{\prime}}\mathbf{s}$ 最终模糊光分布的一半。对于 16 个收集邻居中的每一个，我们然后从该邻居的光分布中取出所有光源，将它们乘以邻居的归一化相对权重，并将这些光源添加到 ? 的光分布中。在相邻点之间模糊光分布之后，我们保持光分布未归一化，并且尚未计算 CDF；我们将此步骤推迟到实际进行光源采样时，因为我们在每个基本方向区间之间对光分布进行插值。 ![image](https://aduvfx-1252404142.cos.ap-beijing.myqcloud.com/posts/cache-points-for-production-scale-occlusion-aware-many-lights-sampling-and-volum/rId40.jpg) ![image](https://aduvfx-1252404142.cos.ap-beijing.myqcloud.com/posts/cache-points-for-production-scale-occlusion-aware-many-lights-sampling-and-volum/rId43.jpg) 图 5：来自《Encanto》的一个生产场景，包含 4406 个光源（分析光源、自发光三角形和自发光体积），具有复杂的遮挡，使用每像素 32 个样本渲染，分别采用均匀光选择 (a)、无学习可见性估计的缓存点 (b) 和有学习可见性估计的缓存点 (c)。在此场景中，使用学习可见性估计使 RMSE 提高了 9.3%，而几乎不增加额外的渲染时间。© 2024 Disney 请注意，在此步骤中，为了允许并行处理并避免对相同点进行重复模糊，我们不能在内存中原地执行模糊；相反，我们需要将输出的模糊缓存点写入一组新的缓存点，然后交换内存。 ## 3.2 可见性估计的在线学习 3.2.1 通过跟踪样本比率进行可见性估计。缓存点最初构建时未考虑遮挡信息；在渲染过程中，我们通过在每个缓存点学习每个光源的可见性估计来改进采样。我们的方法在概念上与重要性缓存有些相似 $$ Georgiev et al. 2012 $$. Each time we select a light from a cache point, we increment an internal sample attempt counter for that light within that cache point. We then perform direct lighting and in the event that the sample successfully reaches the light and receives a useful light contribution, we atomically increment an internal successful sample counter for that same light within that same cache point. Hyperion uses a batched wave-front path tracing architecture where the rendering process is divided up into a number of discrete iterations $$ Eisenacher et al. 2013 $$; between iterations, we use the ratio ’ between successful samples $H_{success}$ and total sample attempts $H_{total}$ towards a given light to adjust the sampling weight of that light; lights with a lower ratio of successful sampling attempts are weighted down while lights with a higher ratio are weighted up. This process e%ectively corrects for cases where a bright light is initially identied as being important to a particular region of space but ends up not being important due to shadowing. The specic mechanism we use to assign every light 4 in a cache point ? a visibility weight , $\left( p,e \right)$ based on the successful sampling attempt ratio ’ is as follows:

R(p,e) = \frac{H(p,e){\text{\ success\ }} + 1}{H(p,e){\text{\ total\ }} + 1}

W(p,e) = \left{ \begin{matrix} \left( \frac{R(p,e)}{R_{\text{\ min\ }}} \right)^{2} & R(p,e) \leq R_{\text{\ min\ }} \ 1.0 & \text{\ otherwise\ } \ \end{matrix} \right.\

L(\mathbf{x},\omega) = \int_{0}^{d}\bar{T}(\mathbf{x},\mathbf{y})

\left( \mu_{a}(\mathbf{y})L_{e}(\mathbf{y},\omega) + \mu_{s}(\mathbf{y})L_{s}(\mathbf{y},\omega) + \mu_{n}(\mathbf{y})L(\mathbf{y},\omega) \right)dt,

其中 $\mathbf{y} = \mathbf{x} - t \times \mathbf{\omega}$。Null-collision 技术向异质体积中添加虚拟的 null 粒子，产生一个虚拟的均匀体积，通过该体积，沿光线的自由飞行距离可以通过一个与组合透射率 )¯ 成正比的 PDF 进行解析采样，该组合透射率由恒定的组合消光系数 $\bar{\mu}$ 形成。$\bar{\mu}$ 又是三种可能事件类型的体积系数之和：吸收 $\mu_{a}$、散射 $\mu_{s}$ 和 null-collision $\mu_{n}$。然后可以使用 Monte Carlo 估计器以概率 $P_{s}$（对于散射事件）、$P_{a}$（对于吸收事件）和 $P_{n}$（对于 null-collision 事件）选择性地评估这些事件。 ## 4.2 Volumetric In-scattering 采样 我们使用缓存点系统来学习方程 7 中的一个重要项：散射 $\mu_{s}$ 与内散射辐射亮度 $L_{s}$ 的乘积，两者结合给出了体积内散射的结果。在缓存点初始化过程中，当我们为每个缓存点遍历每个光源时，除了估计每个光源在每个缓存点处的总贡献外，我们还计算一个散射样本权重 B，该权重近似于 $\mu_{s}(\mathbf{y})$、入射辐射亮度 $L(\mathbf{y},\omega^{\prime})$ 和相位函数 $\rho(\mathbf{y},\omega,\omega^{\prime})$ 乘积在立体角上的积分，其中 y 是在缓存点影响半径内随机采样的位置，$\omega^{\prime}$ 和 l 分别是 y 到光源上随机采样位置和到相机原点的方向（图 8a）。该权重 B 按每个光源每个缓存点存储（图 8b）。 在体积路径追踪期间，我们查询沿光线最近的缓存点，并使用每个缓存点中存储的权重 B 构建分段线性的一维 CDF，以从中抽取散射样本（图 8b）。由于在某些情况下构建此一维 CDF 可能代表每条光线的巨大开销，我们目前仅对沿相机光线的直接光照样本使用此采样策略。将此技术的使用限制在相机光线上，仍能让我们显著改善视觉上突出的单次散射效果，同时保持较低的整体性能开销。 为了高效渲染诸如光学厚体积中的高阶散射等情况，我们将我们的技术与传统的基于透射率的采样相结合，并使用多重重要性采样（MIS）$$ Miller et al. 2019 $$. We start by using the 1D CDF to pick a scattering point $(\mathbf{x}_{k})$ , and then we use ratio tracking moving towards the scattering point to update the path’s PDF. This process is repeated until distance sampling steps the ray through the selected scattering point; we can represent this process as:

p_{cachepoint}(\bar{x}) = p_{select}(x_{k})\bar{T}(x_{0},x_{1})\bar{\mu}(x_{1})\bar{T}(x_{1},x_{2})

\bar{\mu}(x_{2})\bar{T}(x_{2},x_{3})…\bar{u}(x_{k - 1})\bar{T}(x_{k - 1},x_{k})\text{\quad\quad}(8)

where each $\bar{T}(x_{n},x_{n + 1})\bar{\mu}(x_{n})$ is the result of step =. To formulate the same path using null-collision tracking to get the PDF, we use the sampled distance and \`¯ to compute )¯, and we already know $P_{n}$ and $P_{s}$ based on our choice of tracking algorithm. For all of the path vertices found before our selected scattering point, we apply the PDF $P_{n}$ and repeat the distance sampling process and update the corresponding PDFs until we reach the selected scattering point and apply PDF $P_{s};$ ; this process gives us:

p_{null}(\bar{x}) = \bar{T}(x_{0},x_{1})\bar{\mu}(x_{1})P_{n}(x_{1})\bar{T}(x_{1},x_{2})\bar{\mu}(x_{2})

P_{n}(x_{2})\ldots\bar{T}(x_{k - 1},x_{k})\bar{\mu}(x_{k - 1})P_{s}(x_{k})

While the path PDFs represented by Equations 8 and 9 look very long, most of the terms cancel out to form a much simpler nal expression for the MIS weight: ![image](https://aduvfx-1252404142.cos.ap-beijing.myqcloud.com/posts/cache-points-for-production-scale-occlusion-aware-many-lights-sampling-and-volum/rId63.jpg) ![image](https://aduvfx-1252404142.cos.ap-beijing.myqcloud.com/posts/cache-points-for-production-scale-occlusion-aware-many-lights-sampling-and-volum/rId66.jpg) ![image](https://aduvfx-1252404142.cos.ap-beijing.myqcloud.com/posts/cache-points-for-production-scale-occlusion-aware-many-lights-sampling-and-volum/rId69.jpg) Figure 9: A scene consisting of bright lights embedded in a heterogeneous volume with low (top) and high (bottom) extinction coe\#cients. Null-collision tracking alone (left) does not work well with thinner volumes; our cache points based technique (middle) performs well with thin volumes but has trouble with thicker volumes. Combining both techniques through multiple importance sampling (right) e\#ciently samples both the thin and thick volume cases. Results shown are equal sample. © 2024 Disney

p_{null}(\bar{x}):p_{probes}(\bar{x}) = P_{n}(x_{1})\ldots P_{n}(x_{k - 1})

\bar{\mu}(x_{k})P_{s}(x_{k}):p_{select}(x_{k})

We demonstrate our technique working in conjunction with conventional null-scattering through MIS in an equal-sample comparison in Figure 9 and in an equal-time production comparison in Figure 10. Compared to equi-angular sampling $$ Kulla and Fajardo 2012 $$, our cache points based method performs better when rendering highly anisotropic volumes since our approach e%ectively factors in the phase function term. Additionally, our approach bypasses the need to sample a light vertex before performing distance sampling, instead, we rely on the cache points system’s scattering sample weight B as a global estimate of direct illumination. ## 4.3 体积发射采样 (Volumetric Emission Sampling) 在高度发射的非均匀体积嵌入薄各向异性介质的情况下，我们用于采样体积内散射的缓存点方法需要与一种高效收集体积发射的方法相结合。需要额外收集发射方法的原因可以从 \`¯ 通常的选择方式看出：作为 $\mu_{t} = \mu_{a} + \mu_{s}$ 的优控函数，并且介质事件以概率 $\begin{matrix} P_{t} = \frac{\mu_{t}}{\bar{\mu}},P_{a} = \frac{\mu_{a}}{\bar{\mu}} \\ \end{matrix}$ , and $\begin{matrix} P_{s} = \frac{\mu_{s}}{\bar{\mu}} \\ \end{matrix}$ . 在体积发射函数与消光函数强烈不相关的情况下，例如在低消光的高度发射体积中，null-tracking 很可能会完全跳过潜在的高度发射区域。 我们利用了 $$ 的观察结果 Kutz et al. 2017 $$ that \`¯, $,P_{a},P_{s},$ and $P_{n}$ can be treated as arbitrary uncorrelated parameters as long as their contributions are counter-balanced by appropriate sample weights. To force ratio tracking $$ Novák et al. 2014 $$ to take more steps in highly emissive regions, instead of setting $\bar{\mu}$ 对于 $\mu_{t}$ 的局部最大值，我们选择 `¯ 为 $\hat { \mu } = m a x ( \mu _ { t } , \mu _ { a } L _ { e } )$ 。我们始终将 `¯ 设置为小于平均体素大小，以避免在单个体素内进行过多的查找。由于吸收和 null-collision 事件不需要追踪新射线，我们设置 $P_{a} = P_{n} = 1$ ，这使得追踪器在每个自由路径样本处收集发射，从而产生每条射线更高质量的发射估计（算法 1）。 算法 1 | | | ------------------------------------- | | 1: function EVALUATEEMISSION(x, ω, d) | | 2: w← 1, Le← 0 | | 3: repeat | | 4: Δt← -ln(1-ζ)/μ | | 5: x← x - Δt × ω | | 6: Le← Le + w × μa(x)×Le(x)/μ | | 7: w← w × μn(x)/μ | | 8: until (t← t + Δt) \> d | | 9: return Le | | 10: end function | 接下来，为了使我们的技术可用于next event estimation，我们不仅需要更好地评估异质volume的emission，还需要采样并评估采样方向的PDF。为此，我们首先扩展Villemin & Hery $$ 2013 $$: we use an emission-energy-distribution grid, which is just a coarser version of the volume, in order to make sure that more emissive regions of the volume have a higher chance of receiving light samples. In Villemin & Hery $$ 2013 $$, point sampling is used, but point sampling can be sub-optimal when emission is occluded by heavy smoke or when the emissive region is large; in these cases, high sample counts are required to capture emission details in glossy re\#ections. Instead, we use our emission-optimized tracker to evaluate every tracking point along the ray, gathering more information in each light sample, e%ectively performing line integration. Finally, in order to use MIS to combine BSDF samples with our emissive volume light samples in the solid angle domain $$ Simon et al. 2017 $$, we track which cells in the emission-energydistribution grid that the light sample ray has passed through and integrate PDFs stored in each of these cells using a Jacobian transform: ![image](https://aduvfx-1252404142.cos.ap-beijing.myqcloud.com/posts/cache-points-for-production-scale-occlusion-aware-many-lights-sampling-and-volum/rId73.jpg) Figure 10: A production scene from Raya and the Last Dragon containing a small bright light source embedded in a thin heterogeneous volume. Equal-time comparison of a conventional null-collision approach utilizing spectral-decomposition tracking (left) and incorporating our cache point based in-scattering sampling via MIS (right). Combining our cache points based in-scattering sampling with null-collision tracking produces a robust technique that works well both further from (top two rows) and closer to (bottom row) small bright light sources. © 2024 Disney

\begin{matrix} p_{\sigma}(\omega) = \int_{0}^{\infty}p_{x}(t)t^{2}dt \ = P_{0}(t_{1}^{3} - t_{0}^{3})/3 + P_{1}(t_{2}^{3} - t_{1}^{3})/3 + P_{2}(t_{3}^{3} - t_{2}^{3})/3 \ \ \end{matrix}