40 files changed, 561 insertions, 167 deletions
diff --git a/cnn_v3/docs/HOWTO.md b/cnn_v3/docs/HOWTO.md
index 08979e7..5c5cc2a 100644
--- a/cnn_v3/docs/HOWTO.md
+++ b/cnn_v3/docs/HOWTO.md
@@ -22,57 +22,141 @@ It rasterizes proxy geometry to MRT G-buffer textures and packs them into two
 
 ### Adding to a Sequence
 
-`GBufferEffect` does not exist in `seq_compiler.py` as a named effect yet
-(no `.seq` syntax integration for Phase 1). Wire it directly in C++ alongside
-your scene code, or add it to the timeline when the full CNNv3Effect is ready.
+Both `GBufferEffect` and `GBufViewEffect` are registered in `seq_compiler.py`
+(`CLASS_TO_HEADER`) and can be wired directly in `timeline.seq`.
 
-**C++ wiring example** (e.g. inside a Sequence or main.cc):
+**Debug view (G-buffer → sink)**:
+```seq
+SEQUENCE 12.00 0 "cnn_v3_test"
+  NODE gbuf_feat0 gbuf_rgba32uint
+  NODE gbuf_feat1 gbuf_rgba32uint
+  EFFECT + GBufferEffect source -> gbuf_feat0 gbuf_feat1 0.00 8.00
+  EFFECT + GBufViewEffect gbuf_feat0 gbuf_feat1 -> sink 0.00 8.00
+```
 
-```cpp
-#include "../../cnn_v3/src/gbuffer_effect.h"
+**Full CNN pipeline**:
+```seq
+SEQUENCE 12.00 0 "cnn_v3_test"
+  NODE gbuf_feat0 gbuf_rgba32uint
+  NODE gbuf_feat1 gbuf_rgba32uint
+  NODE cnn_v3_out gbuf_albedo
+  EFFECT + GBufferEffect source -> gbuf_feat0 gbuf_feat1 0.00 8.00
+  EFFECT + CNNv3Effect gbuf_feat0 gbuf_feat1 -> cnn_v3_out 0.00 8.00
+  EFFECT + Passthrough cnn_v3_out -> sink 0.00 8.00
+```
 
-// Allocate once alongside your scene
-auto gbuf = std::make_shared<GBufferEffect>(
-    ctx, /*inputs=*/{"prev_cnn"},  // or any dummy node
-    /*outputs=*/{"gbuf_feat0", "gbuf_feat1"},
-    /*start=*/0.0f, /*end=*/60.0f);
+### Internal scene
 
-gbuf->set_scene(&my_scene, &my_camera);
+Call `set_scene()` once before the first render to populate the built-in demo
+scene. No external `Scene` or `Camera` pointer is required — the effect owns
+them.
 
-// In render loop, call before CNN pass:
-gbuf->render(encoder, params, nodes);
-```
+**What `set_scene()` creates:**
+- **20 small cubes** — random positions in [-2,2]×[-1.5,1.5]³, scale 0.1–0.25,
+  random colors. Each has a random rotation axis and speed; animated each frame
+  via `quat::from_axis(axis, time * speed)`.
+- **4 pumping spheres** — at fixed world positions, base radii 0.25–0.35.
+  Scale driven by `audio_intensity`: `r = base_r * (1 + audio_intensity * 0.8)`.
+- **Camera** — position (0, 2.5, 6), target (0, 0, 0), 45° FOV.
+  Aspect ratio updated each frame from `params.aspect_ratio`.
+- **Two directional lights** (uploaded to `lights_uniform_`, ready for shadow pass):
+  - Key: warm white (1.0, 0.92, 0.78), direction `normalize(1, 2, 1)` (upper-right-front)
+  - Fill: cool blue (0.4, 0.45, 0.8 × 0.4), direction `normalize(-1, 1, -1)` (upper-left-back)
 
 ### Internal passes
 
 Each frame, `GBufferEffect::render()` executes:
 
-1. **Pass 1 — MRT rasterization** (`gbuf_raster.wgsl`)
+1. **Pass 1 — MRT rasterization** (`gbuf_raster.wgsl`) ✅
    - Proxy box (36 verts) × N objects, instanced
    - MRT outputs: `gbuf_albedo` (rgba16float), `gbuf_normal_mat` (rgba16float)
    - Depth test + write into `gbuf_depth` (depth32float)
+   - `obj.type` written to `ObjectData.params.x` for future SDF branching
+
+2. **Pass 2 — SDF shadow raymarching** (`gbuf_shadow.wgsl`) ✅
+   - See implementation plan below.
 
-2. **Pass 2/3 — SDF + Lighting** — TODO (placeholder: shadow=1, transp=0)
+3. **Pass 3 — Transparency** — TODO (deferred; transp=0 for opaque scenes)
 
-3. **Pass 4 — Pack compute** (`gbuf_pack.wgsl`)
+4. **Pass 4 — Pack compute** (`gbuf_pack.wgsl`) ✅
    - Reads all G-buffer textures + `prev_cnn` input
    - Writes `feat_tex0` + `feat_tex1` (rgba32uint, 20 channels, 32 bytes/pixel)
+   - Shadow / transp nodes cleared to 1.0 / 0.0 via zero-draw render passes
+     until Pass 2/3 are implemented.
 
 ### Output node names
 
-By default the outputs are named from the `outputs` vector passed to the
-constructor. Use these names when binding the CNN effect input:
+Outputs are named from the `outputs` vector passed to the constructor:
 
 ```
 outputs[0]  → feat_tex0   (rgba32uint: albedo.rgb, normal.xy, depth, depth_grad.xy)
 outputs[1]  → feat_tex1   (rgba32uint: mat_id, prev.rgb, mip1.rgb, mip2.rgb, shadow, transp)
 ```
 
-### Scene data
+---
+
+## 1b. GBufferEffect — Implementation Plan (Pass 2: SDF Shadow)
+
+### What remains
+
+| Item | Status | Notes |
+|------|--------|-------|
+| Pass 1: MRT raster | ✅ Done | proxy box, all object types |
+| Pass 4: Pack compute | ✅ Done | 20 channels packed |
+| Internal scene + animation | ✅ Done | cubes + spheres + 2 lights |
+| Pass 2: SDF shadow | ✅ Done | `gbuf_shadow.wgsl`, proxy-box SDF per object |
+| Pass 3: Transparency | ❌ TODO | low priority, opaque scenes only |
+| Phase 4: type-aware SDF | ✅ Done | switch on `obj.params.x` in `dfWithID` |
+
+### Pass 2: SDF shadow raymarching
+
+**New file: `cnn_v3/shaders/gbuf_shadow.wgsl`** — fullscreen render pass.
+
+Bind layout:
+
+| Binding | Type | Content |
+|---------|------|---------|
+| 0 | `uniform` | `GlobalUniforms` (`#include "common_uniforms"`) |
+| 1 | `storage read` | `ObjectsBuffer` |
+| 2 | `texture_depth_2d` | depth from Pass 1 |
+| 3 | `sampler` (non-filtering) | depth load |
+| 4 | `uniform` | `GBufLightsUniforms` (2 lights) |
+
+Algorithm per fragment:
+1. Reconstruct world position from NDC depth + `globals.inv_view_proj`
+2. For each object: `sdBox((inv_model * world_pos).xyz, vec3(1.0))` — proxy box in local space
+3. For each light: offset ray origin by `0.02 * surface_normal`; march shadow ray toward `light.direction`
+4. Soft shadow via `shadowWithStoredDistance()` from `render/raymarching_id`
+5. Combine lights: `shadow = min(shadow_light0, shadow_light1)`
+6. Discard fragments where depth == 1.0 (sky/background → shadow = 1.0)
+7. Output shadow factor to RGBA8Unorm render target (`.r` = shadow)
+
+**C++ additions (`gbuffer_effect.h/.cc`):**
+```cpp
+RenderPipeline shadow_pipeline_;
+void create_shadow_pipeline();
+```
+In `render()` between Pass 1 and the shadow/transp node clears:
+- Build bind group (global_uniforms_buf_, objects_buf_, depth_view, sampler_, lights_uniform_)
+- Run fullscreen triangle → `node_shadow_` color attachment
+- Remove the `clear_node(node_shadow_, 1.0f)` placeholder once the pass is live
+
+**Register:**
+- `cnn_v3/shaders/gbuf_shadow.wgsl` → `SHADER_GBUF_SHADOW` in `assets.txt`
+- `extern const char* gbuf_shadow_wgsl;` in `gbuffer_effect.cc`
+
+### Phase 4: Object-type-aware SDF (optional)
+
+Branch on `obj.params.x` (populated since this commit) using `math/sdf_shapes`:
+
+| Type value | ObjectType | SDF |
+|------------|-----------|-----|
+| 0 | CUBE | `sdBox(local_p, vec3(1))` |
+| 1 | SPHERE | `sdSphere(local_p, 1.0)` |
+| 2 | PLANE | `sdPlane(local_p, vec3(0,1,0), obj.params.y)` |
+| 3 | TORUS | `sdTorus(local_p, vec2(0.8, 0.2))` |
 
-Call `set_scene(scene, camera)` before the first render. The effect uploads
-`GlobalUniforms` (view-proj, camera pos, resolution) and `ObjectData` (model
-matrix, color) to GPU storage buffers each frame.
+Only worth adding after Pass 2 is validated visually.
 
 ---
 
@@ -253,7 +337,7 @@ Test vectors generated by `cnn_v3/training/gen_test_vectors.py` (PyTorch referen
 | Phase | Status | Notes |
 |-------|--------|-------|
 | 1 — G-buffer (raster + pack) | ✅ Done | Integrated, 36/36 tests pass |
-| 1 — G-buffer (SDF + shadow passes) | TODO | Placeholder: shadow=1, transp=0 |
+| 1 — G-buffer (SDF shadow pass) | ✅ Done | `gbuf_shadow.wgsl`, proxy-box SDF |
 | 2 — Training infrastructure | ✅ Done | blender_export.py, pack_*_sample.py |
 | 3 — WGSL U-Net shaders | ✅ Done | 5 compute shaders + cnn_v3/common snippet |
 | 4 — C++ CNNv3Effect | ✅ Done | FiLM uniform upload, 36/36 tests pass |
@@ -386,13 +470,13 @@ auto gview = std::make_shared<GBufViewEffect>(ctx,
 
 | Row | Col 0 | Col 1 | Col 2 | Col 3 |
 |-----|-------|-------|-------|-------|
-| 0 | `alb.r` (red tint) | `alb.g` (green tint) | `alb.b` (blue tint) | `nrm.x` remap→[0,1] |
+| 0 | `alb.r` | `alb.g` | `alb.b` | `nrm.x` remap→[0,1] |
 | 1 | `nrm.y` remap→[0,1] | `depth` (inverted) | `dzdx` ×20+0.5 | `dzdy` ×20+0.5 |
 | 2 | `mat_id` | `prev.r` | `prev.g` | `prev.b` |
 | 3 | `mip1.r` | `mip1.g` | `mip1.b` | `mip2.r` |
 | 4 | `mip2.g` | `mip2.b` | `shadow` | `transp` |
 
-1-pixel gray grid lines separate cells. Dark background for out-of-range cells.
+All channels displayed as grayscale. 1-pixel gray grid lines separate cells. Dark background for out-of-range cells.
 
 **Shader binding layout** (no sampler needed — integer texture):
 
diff --git a/cnn_v3/shaders/gbuf_shadow.wgsl b/cnn_v3/shaders/gbuf_shadow.wgsl
new file mode 100644
index 0000000..0f5f8b4
--- /dev/null
+++ b/cnn_v3/shaders/gbuf_shadow.wgsl
@@ -0,0 +1,99 @@
+// G-buffer shadow raymarching shader for CNN v3
+// Pass 2: Reads depth from Pass 1, marches shadow rays toward lights,
+// outputs shadow factor (1.0=lit, 0.0=shadow) to RGBA8Unorm render target (.r).
+
+#include "common_uniforms"
+#include "camera_common"
+#include "math/sdf_shapes"
+#include "render/raymarching_id"
+
+@group(0) @binding(0) var<uniform>         globals:     GlobalUniforms;
+@group(0) @binding(1) var<storage, read>   object_data: ObjectsBuffer;
+@group(0) @binding(2) var                  depth_tex:   texture_depth_2d;
+
+struct GBufLight {
+    direction: vec4f,  // xyz = toward light (world space, normalized)
+    color:     vec4f,  // rgb = color, a = intensity
+}
+struct GBufLightsUniforms {
+    lights: array<GBufLight, 2>,
+    params: vec4f,  // x = num_lights
+}
+@group(0) @binding(3) var<uniform> lights: GBufLightsUniforms;
+
+// ---- SDF scene (proxy box per object in local space) ----
+
+// Stub required by render/raymarching (shadow() / rayMarch() call df()).
+fn df(p: vec3f) -> f32 { return MAX_RAY_LENGTH; }
+
+// SDF of the full scene: proxy box for each object transformed to local space.
+fn dfWithID(p: vec3f) -> RayMarchResult {
+    var res: RayMarchResult;
+    res.distance     = MAX_RAY_LENGTH;
+    res.distance_max = MAX_RAY_LENGTH;
+    res.object_id    = 0.0;
+
+    let n = u32(globals.params.x);
+    for (var i = 0u; i < n; i++) {
+        let obj      = object_data.objects[i];
+        let lp       = (obj.inv_model * vec4f(p, 1.0)).xyz;
+        let obj_type = u32(obj.params.x);
+        var d: f32;
+        switch obj_type {
+            case 1u: { d = sdSphere(lp, 1.0); }                              // SPHERE
+            case 2u: { d = sdPlane(lp, vec3f(0.0, 1.0, 0.0), obj.params.y); } // PLANE
+            case 3u: { d = sdTorus(lp, vec2f(0.8, 0.2)); }                   // TORUS
+            default: { d = sdBox(lp, vec3f(1.0)); }                          // CUBE (0) + fallback
+        }
+        if (d < res.distance) {
+            res.distance  = d;
+            res.object_id = f32(i + 1u);
+        }
+    }
+    return res;
+}
+
+// ---- Vertex: fullscreen triangle ----
+
+@vertex
+fn vs_main(@builtin(vertex_index) vid: u32) -> @builtin(position) vec4f {
+    let x = f32((vid & 1u) << 2u) - 1.0;
+    let y = f32((vid & 2u) << 1u) - 1.0;
+    return vec4f(x, y, 0.0, 1.0);
+}
+
+// ---- Fragment: shadow factor per pixel ----
+
+@fragment
+fn fs_main(@builtin(position) pos: vec4f) -> @location(0) vec4f {
+    let depth = textureLoad(depth_tex, vec2i(pos.xy), 0);
+
+    // Sky / background: fully lit.
+    if (depth >= 1.0) {
+        return vec4f(1.0);
+    }
+
+    // Reconstruct world-space position from NDC + depth.
+    let res = globals.resolution;
+    let ndc = vec2f(
+        (pos.x / res.x) * 2.0 - 1.0,
+        1.0 - (pos.y / res.y) * 2.0
+    );
+    let clip  = globals.inv_view_proj * vec4f(ndc, depth, 1.0);
+    let world = clip.xyz / clip.w;
+
+    // Surface normal estimated from SDF gradient.
+    let nor      = normalWithID(world);
+    let bias_pos = world + nor * 0.02;
+
+    // March shadow rays toward each light; take the darkest value.
+    var shadow_val  = 1.0;
+    let num_lights  = u32(lights.params.x);
+    for (var i = 0u; i < num_lights; i++) {
+        let ld = lights.lights[i].direction.xyz;
+        let s  = shadowWithStoredDistance(bias_pos, ld, MAX_RAY_LENGTH);
+        shadow_val = min(shadow_val, s);
+    }
+
+    return vec4f(shadow_val, shadow_val, shadow_val, 1.0);
+}
diff --git a/cnn_v3/shaders/gbuf_view.wgsl b/cnn_v3/shaders/gbuf_view.wgsl
index f2ae085..a5e6c91 100644
--- a/cnn_v3/shaders/gbuf_view.wgsl
+++ b/cnn_v3/shaders/gbuf_view.wgsl
@@ -93,10 +93,5 @@ fn fs_main(@builtin(position) pos: vec4f) -> @location(0) vec4f {
         disp = clamp(v, 0.0, 1.0);
     }
 
-    // Albedo channels: tint for identification (ch0=red, ch1=green, ch2=blue)
-    if      (ch == 0u) { return vec4f(disp, 0.0,  0.0,  1.0); }
-    else if (ch == 1u) { return vec4f(0.0,  disp, 0.0,  1.0); }
-    else if (ch == 2u) { return vec4f(0.0,  0.0,  disp, 1.0); }
-
     return vec4f(disp, disp, disp, 1.0);
 }
diff --git a/cnn_v3/src/gbuffer_effect.cc b/cnn_v3/src/gbuffer_effect.cc
index 750188f..06e5e66 100644
--- a/cnn_v3/src/gbuffer_effect.cc
+++ b/cnn_v3/src/gbuffer_effect.cc
@@ -14,6 +14,7 @@
 // For standalone use outside the asset system, the caller must ensure the WGSL
 // source strings are available.  They are declared here as weak-linkable externs.
 extern const char* gbuf_raster_wgsl;
+extern const char* gbuf_shadow_wgsl;
 extern const char* gbuf_pack_wgsl;
 
 // Maximum number of objects the G-buffer supports per frame.
@@ -41,53 +42,6 @@ struct GBufGlobalUniforms {
 static_assert(sizeof(GBufGlobalUniforms) == sizeof(float) * 44,
               "GBufGlobalUniforms must be 176 bytes");
 
-// Helper: create a 1×1 placeholder texture of a given format cleared to `value`.
-static WGPUTexture create_placeholder_tex(WGPUDevice device,
-                                          WGPUTextureFormat format,
-                                          float value) {
-  WGPUTextureDescriptor desc = {};
-  desc.usage = (WGPUTextureUsage)(WGPUTextureUsage_TextureBinding |
-                                  WGPUTextureUsage_CopyDst);
-  desc.dimension = WGPUTextureDimension_2D;
-  desc.size = {1, 1, 1};
-  desc.format = format;
-  desc.mipLevelCount = 1;
-  desc.sampleCount = 1;
-  WGPUTexture tex = wgpuDeviceCreateTexture(device, &desc);
-  return tex;
-}
-
-// Helper: write a single RGBA float pixel to a texture via queue.
-static void write_placeholder_pixel(WGPUQueue queue, WGPUTexture tex,
-                                    float r, float g, float b, float a) {
-  const float data[4] = {r, g, b, a};
-  WGPUTexelCopyTextureInfo dst = {};
-  dst.texture = tex;
-  dst.mipLevel = 0;
-  dst.origin = {0, 0, 0};
-  dst.aspect = WGPUTextureAspect_All;
-
-  WGPUTexelCopyBufferLayout layout = {};
-  layout.offset = 0;
-  layout.bytesPerRow = 16; // 4 × sizeof(float)
-  layout.rowsPerImage = 1;
-
-  const WGPUExtent3D extent = {1, 1, 1};
-  wgpuQueueWriteTexture(queue, &dst, data, sizeof(data), &layout, &extent);
-}
-
-// Create bilinear sampler.
-static WGPUSampler create_bilinear_sampler(WGPUDevice device) {
-  WGPUSamplerDescriptor desc = {};
-  desc.addressModeU = WGPUAddressMode_ClampToEdge;
-  desc.addressModeV = WGPUAddressMode_ClampToEdge;
-  desc.magFilter = WGPUFilterMode_Linear;
-  desc.minFilter = WGPUFilterMode_Linear;
-  desc.mipmapFilter = WGPUMipmapFilterMode_Linear;
-  desc.maxAnisotropy = 1;
-  return wgpuDeviceCreateSampler(device, &desc);
-}
-
 // ---- GBufferEffect ----
 
 GBufferEffect::GBufferEffect(const GpuContext& ctx,
@@ -104,9 +58,6 @@ GBufferEffect::GBufferEffect(const GpuContext& ctx,
   node_depth_      = prefix + "_depth";
   node_shadow_     = prefix + "_shadow";
   node_transp_     = prefix + "_transp";
-  node_feat0_      = outputs.size() > 0 ? outputs[0] : prefix + "_feat0";
-  node_feat1_      = outputs.size() > 1 ? outputs[1] : prefix + "_feat1";
-
   // Allocate GPU buffers for scene data.
   global_uniforms_buf_ =
       gpu_create_buffer(ctx_.device, sizeof(GBufGlobalUniforms),
@@ -116,33 +67,12 @@ GBufferEffect::GBufferEffect(const GpuContext& ctx,
 
   // Resolution uniform for pack shader.
   pack_res_uniform_.init(ctx_.device);
+  lights_uniform_.init(ctx_.device);
 
-  // Placeholder shadow (1.0 = fully lit) and transp (0.0 = opaque) textures.
-  shadow_placeholder_tex_.set(
-      create_placeholder_tex(ctx_.device, WGPUTextureFormat_RGBA32Float, 1.0f));
-  write_placeholder_pixel(ctx_.queue,
-                          shadow_placeholder_tex_.get(), 1.0f, 0.0f, 0.0f, 1.0f);
-
-  transp_placeholder_tex_.set(
-      create_placeholder_tex(ctx_.device, WGPUTextureFormat_RGBA32Float, 0.0f));
-  write_placeholder_pixel(ctx_.queue,
-                          transp_placeholder_tex_.get(), 0.0f, 0.0f, 0.0f, 1.0f);
-
-  WGPUTextureViewDescriptor vd = {};
-  vd.format = WGPUTextureFormat_RGBA32Float;
-  vd.dimension = WGPUTextureViewDimension_2D;
-  vd.baseMipLevel = 0;
-  vd.mipLevelCount = 1;
-  vd.baseArrayLayer = 0;
-  vd.arrayLayerCount = 1;
-  vd.aspect = WGPUTextureAspect_All;
-
-  shadow_placeholder_view_.set(
-      wgpuTextureCreateView(shadow_placeholder_tex_.get(), &vd));
-  transp_placeholder_view_.set(
-      wgpuTextureCreateView(transp_placeholder_tex_.get(), &vd));
+  create_linear_sampler();
 
   create_raster_pipeline();
+  create_shadow_pipeline();
   create_pack_pipeline();
 }
 
@@ -154,27 +84,137 @@ void GBufferEffect::declare_nodes(NodeRegistry& registry) {
   registry.declare_node(node_transp_,     NodeType::GBUF_R8,        -1, -1);
   // feat_tex0 / feat_tex1 are the declared output_nodes_ — they get registered
   // by the sequence infrastructure; declare them here as well if not already.
-  if (!registry.has_node(node_feat0_)) {
-    registry.declare_node(node_feat0_, NodeType::GBUF_RGBA32UINT, -1, -1);
+  if (!registry.has_node(output_nodes_[0])) {
+    registry.declare_node(output_nodes_[0], NodeType::GBUF_RGBA32UINT, -1, -1);
+  }
+  if (!registry.has_node(output_nodes_[1])) {
+    registry.declare_node(output_nodes_[1], NodeType::GBUF_RGBA32UINT, -1, -1);
   }
-  if (!registry.has_node(node_feat1_)) {
-    registry.declare_node(node_feat1_, NodeType::GBUF_RGBA32UINT, -1, -1);
+}
+
+void GBufferEffect::set_scene() {
+  scene_.clear();
+  cube_anims_.clear();
+  sphere_anims_.clear();
+
+  // Deterministic pseudo-random (xorshift32).
+  uint32_t seed = 0xBEEF1234u;
+  auto rnd = [&]() -> float {
+    seed ^= seed << 13;
+    seed ^= seed >> 17;
+    seed ^= seed << 5;
+    return (float)(seed >> 8) / 16777216.0f;  // [0, 1)
+  };
+  auto rrange = [&](float lo, float hi) { return lo + rnd() * (hi - lo); };
+
+  // 20 small cubes scattered in a [-2,2]×[-1.5,1.5]×[-1.5,1.5] volume.
+  static const int kNumCubes = 20;
+  for (int i = 0; i < kNumCubes; ++i) {
+    Object3D obj(ObjectType::CUBE);
+    obj.position = vec3(rrange(-2.0f, 2.0f),
+                        rrange(-1.5f, 1.5f),
+                        rrange(-1.5f, 1.5f));
+    const float s = rrange(0.10f, 0.25f);
+    obj.scale = vec3(s, s, s);
+    obj.color = vec4(rrange(0.4f, 1.0f),
+                     rrange(0.4f, 1.0f),
+                     rrange(0.4f, 1.0f), 1.0f);
+
+    // Random rotation axis (avoid degenerate zero-length axis).
+    vec3 axis = vec3(rrange(-1.0f, 1.0f),
+                     rrange(-1.0f, 1.0f),
+                     rrange(-1.0f, 1.0f));
+    if (axis.len() < 0.01f) axis = vec3(0.0f, 1.0f, 0.0f);
+    axis = axis.normalize();
+    const float speed = rrange(0.3f, 1.5f) * (rnd() > 0.5f ? 1.0f : -1.0f);
+
+    scene_.add_object(obj);
+    cube_anims_.push_back({axis, speed});
   }
+
+  // 4 pumping spheres at fixed positions; radius modulated by audio_intensity.
+  static const vec3 kSpherePos[4] = {
+    { 0.0f,  0.0f,  0.0f},
+    { 1.5f,  0.5f, -0.5f},
+    {-1.5f, -0.5f,  0.5f},
+    { 0.0f,  1.0f,  1.0f},
+  };
+  static const float kBaseSphereRadius[4] = {0.35f, 0.28f, 0.30f, 0.25f};
+  for (int i = 0; i < 4; ++i) {
+    Object3D obj(ObjectType::SPHERE);
+    obj.position = kSpherePos[i];
+    const float r = kBaseSphereRadius[i];
+    obj.scale = vec3(r, r, r);
+    obj.color = vec4(0.85f, 0.60f, 0.95f, 1.0f);
+    const int idx = (int)scene_.objects.size();
+    scene_.add_object(obj);
+    sphere_anims_.push_back({idx, r});
+  }
+
+  // Camera: above and in front of the scene, looking at origin.
+  camera_.set_look_at(vec3(0.0f, 2.5f, 6.0f),
+                      vec3(0.0f, 0.0f, 0.0f),
+                      vec3(0.0f, 1.0f, 0.0f));
+  camera_.fov_y_rad  = 0.7854f;  // 45°
+  camera_.near_plane = 0.1f;
+  camera_.far_plane  = 20.0f;
+  // aspect_ratio is updated each frame from params.resolution.
+
+  scene_ready_ = true;
 }
 
-void GBufferEffect::set_scene(const Scene* scene, const Camera* camera) {
-  scene_  = scene;
-  camera_ = camera;
+static void clear_r8_node(WGPUCommandEncoder encoder, WGPUTextureView view,
+                           float value) {
+  WGPURenderPassColorAttachment att = {};
+  att.view       = view;
+  att.loadOp     = WGPULoadOp_Clear;
+  att.storeOp    = WGPUStoreOp_Store;
+  att.clearValue = {value, value, value, value};
+  att.depthSlice = WGPU_DEPTH_SLICE_UNDEFINED;
+  WGPURenderPassDescriptor pd = {};
+  pd.colorAttachmentCount = 1;
+  pd.colorAttachments     = &att;
+  WGPURenderPassEncoder p = wgpuCommandEncoderBeginRenderPass(encoder, &pd);
+  wgpuRenderPassEncoderEnd(p);
+  wgpuRenderPassEncoderRelease(p);
 }
 
 void GBufferEffect::render(WGPUCommandEncoder encoder,
                            const UniformsSequenceParams& params,
                            NodeRegistry& nodes) {
-  if (!scene_ || !camera_) {
+  if (!scene_ready_) {
     return;
   }
 
-  upload_scene_data(*scene_, *camera_, params.time);
+  // Update camera aspect ratio from current resolution.
+  camera_.aspect_ratio = params.aspect_ratio;
+
+  // Animate cubes: axis-angle rotation driven by physical time.
+  for (int i = 0; i < (int)cube_anims_.size(); ++i) {
+    const CubeAnim& a = cube_anims_[(size_t)i];
+    scene_.objects[(size_t)i].rotation =
+        quat::from_axis(a.axis, params.time * a.speed);
+  }
+  // Pump spheres: scale with audio_intensity.
+  for (const SphereAnim& a : sphere_anims_) {
+    const float r = a.base_radius * (1.0f + params.audio_intensity * 0.8f);
+    scene_.objects[(size_t)a.obj_idx].scale = vec3(r, r, r);
+  }
+
+  // Upload two directional lights.
+  {
+    GBufLightsUniforms lu = {};
+    lu.params = vec4(2.0f, 0.0f, 0.0f, 0.0f);
+    // Key: warm sun, upper-right-front.
+    lu.lights[0].direction = vec4(0.408f, 0.816f, 0.408f, 0.0f);  // norm(1,2,1)
+    lu.lights[0].color     = vec4(1.00f, 0.92f, 0.78f, 1.0f);
+    // Fill: cool sky, upper-left-back.
+    lu.lights[1].direction = vec4(-0.577f, 0.577f, -0.577f, 0.0f);  // norm(-1,1,-1)
+    lu.lights[1].color     = vec4(0.40f, 0.45f, 0.80f, 0.4f);
+    lights_uniform_.update(ctx_.queue, lu);
+  }
+
+  upload_scene_data(scene_, camera_, params.time);
 
   // Update resolution uniform for pack shader.
   GBufResUniforms res_uni;
@@ -186,8 +226,8 @@ void GBufferEffect::render(WGPUCommandEncoder encoder,
   WGPUTextureView albedo_view     = nodes.get_view(node_albedo_);
   WGPUTextureView normal_mat_view = nodes.get_view(node_normal_mat_);
   WGPUTextureView depth_view      = nodes.get_view(node_depth_);
-  WGPUTextureView feat0_view      = nodes.get_view(node_feat0_);
-  WGPUTextureView feat1_view      = nodes.get_view(node_feat1_);
+  WGPUTextureView feat0_view      = nodes.get_view(output_nodes_[0]);
+  WGPUTextureView feat1_view      = nodes.get_view(output_nodes_[1]);
 
   // prev_cnn: first input node if available, else dummy.
   WGPUTextureView prev_view = nullptr;
@@ -228,8 +268,8 @@ void GBufferEffect::render(WGPUCommandEncoder encoder,
   raster_pass_desc.depthStencilAttachment = &depth_attachment;
 
   const int num_objects =
-      (int)(scene_->objects.size() < (size_t)kGBufMaxObjects
-                ? scene_->objects.size()
+      (int)(scene_.objects.size() < (size_t)kGBufMaxObjects
+                ? scene_.objects.size()
                 : (size_t)kGBufMaxObjects);
 
   if (num_objects > 0 && raster_pipeline_.get() != nullptr) {
@@ -250,13 +290,66 @@ void GBufferEffect::render(WGPUCommandEncoder encoder,
     wgpuRenderPassEncoderRelease(raster_pass);
   }
 
-  // Pass 2: SDF raymarching — TODO (placeholder: shadow=1, transp=0 already set)
-  // Pass 3: Lighting/shadow — TODO
+  // --- Pass 2: SDF shadow raymarching ---
+  if (shadow_pipeline_.get() != nullptr) {
+    WGPUBindGroupEntry shadow_entries[4] = {};
+    shadow_entries[0].binding = 0;
+    shadow_entries[0].buffer  = global_uniforms_buf_.buffer;
+    shadow_entries[0].size    = sizeof(GBufGlobalUniforms);
+
+    shadow_entries[1].binding = 1;
+    shadow_entries[1].buffer  = objects_buf_.buffer;
+    shadow_entries[1].size    = (size_t)objects_buf_capacity_ * sizeof(GBufObjectData);
+
+    shadow_entries[2].binding     = 2;
+    shadow_entries[2].textureView = depth_view;
+
+    shadow_entries[3].binding = 3;
+    shadow_entries[3].buffer  = lights_uniform_.get().buffer;
+    shadow_entries[3].size    = sizeof(GBufLightsUniforms);
+
+    WGPUBindGroupLayout shadow_bgl =
+        wgpuRenderPipelineGetBindGroupLayout(shadow_pipeline_.get(), 0);
+
+    WGPUBindGroupDescriptor shadow_bg_desc = {};
+    shadow_bg_desc.layout     = shadow_bgl;
+    shadow_bg_desc.entryCount = 4;
+    shadow_bg_desc.entries    = shadow_entries;
+
+    WGPUBindGroup shadow_bg =
+        wgpuDeviceCreateBindGroup(ctx_.device, &shadow_bg_desc);
+    wgpuBindGroupLayoutRelease(shadow_bgl);
+
+    WGPURenderPassColorAttachment shadow_att = {};
+    shadow_att.view       = nodes.get_view(node_shadow_);
+    shadow_att.loadOp     = WGPULoadOp_Clear;
+    shadow_att.storeOp    = WGPUStoreOp_Store;
+    shadow_att.clearValue = {1.0f, 1.0f, 1.0f, 1.0f};
+    shadow_att.depthSlice = WGPU_DEPTH_SLICE_UNDEFINED;
+
+    WGPURenderPassDescriptor shadow_pass_desc = {};
+    shadow_pass_desc.colorAttachmentCount = 1;
+    shadow_pass_desc.colorAttachments     = &shadow_att;
+
+    WGPURenderPassEncoder shadow_pass =
+        wgpuCommandEncoderBeginRenderPass(encoder, &shadow_pass_desc);
+    wgpuRenderPassEncoderSetPipeline(shadow_pass, shadow_pipeline_.get());
+    wgpuRenderPassEncoderSetBindGroup(shadow_pass, 0, shadow_bg, 0, nullptr);
+    wgpuRenderPassEncoderDraw(shadow_pass, 3, 1, 0, 0);
+    wgpuRenderPassEncoderEnd(shadow_pass);
+    wgpuRenderPassEncoderRelease(shadow_pass);
+    wgpuBindGroupRelease(shadow_bg);
+  } else {
+    // Fallback: clear to 1.0 (fully lit) if pipeline not ready.
+    clear_r8_node(encoder, nodes.get_view(node_shadow_), 1.0f);
+  }
+
+  // Pass 3: Transparency — TODO (deferred; opaque scenes only)
+  clear_r8_node(encoder, nodes.get_view(node_transp_), 0.0f);
 
   // --- Pass 4: Pack compute ---
   // Rebuild pack bind group with current node views.
-  // Construct a temporary bilinear sampler for this pass.
-  WGPUSampler bilinear = create_bilinear_sampler(ctx_.device);
+  WGPUSampler bilinear = sampler_.get();
 
   // Get texture views from nodes.
   // shadow / transp are GBUF_R8 nodes; use their views.
@@ -320,7 +413,7 @@ void GBufferEffect::render(WGPUCommandEncoder encoder,
   wgpuComputePassEncoderRelease(compute_pass);
 
   wgpuBindGroupRelease(pack_bg);
-  wgpuSamplerRelease(bilinear);
+  // bilinear is owned by sampler_ — no release here.
 }
 
 // ---- private helpers ----
@@ -373,7 +466,7 @@ void GBufferEffect::upload_scene_data(const Scene& scene,
       d.model     = m;
       d.inv_model = m.inverse();
       d.color     = obj.color;
-      d.params    = vec4(0.0f, 0.0f, 0.0f, 0.0f);
+      d.params    = vec4((float)(int)obj.type, 0.0f, 0.0f, 0.0f);
       obj_data.push_back(d);
     }
     wgpuQueueWriteBuffer(ctx_.queue, objects_buf_.buffer, 0,
@@ -392,7 +485,7 @@ void GBufferEffect::create_raster_pipeline() {
   }
 
   const std::string composed =
-      ShaderComposer::Get().Compose({"common_uniforms"}, src);
+      ShaderComposer::Get().Compose({}, src);
 
   WGPUShaderSourceWGSL wgsl_src = {};
   wgsl_src.chain.sType = WGPUSType_ShaderSourceWGSL;
@@ -462,6 +555,86 @@ void GBufferEffect::create_raster_pipeline() {
   wgpuShaderModuleRelease(shader);
 }
 
+void GBufferEffect::create_shadow_pipeline() {
+  HEADLESS_RETURN_IF_NULL(ctx_.device);
+
+  const char* src = gbuf_shadow_wgsl;
+  if (!src) {
+    return;
+  }
+
+  const std::string composed = ShaderComposer::Get().Compose({}, src);
+
+  WGPUShaderSourceWGSL wgsl_src = {};
+  wgsl_src.chain.sType = WGPUSType_ShaderSourceWGSL;
+  wgsl_src.code = str_view(composed.c_str());
+
+  WGPUShaderModuleDescriptor shader_desc = {};
+  shader_desc.nextInChain = &wgsl_src.chain;
+  WGPUShaderModule shader = wgpuDeviceCreateShaderModule(ctx_.device, &shader_desc);
+
+  // BGL: B0=GlobalUniforms, B1=ObjectsBuffer, B2=texture_depth_2d, B3=GBufLightsUniforms
+  WGPUBindGroupLayoutEntry bgl_entries[4] = {};
+
+  bgl_entries[0].binding = 0;
+  bgl_entries[0].visibility =
+      (WGPUShaderStage)(WGPUShaderStage_Vertex | WGPUShaderStage_Fragment);
+  bgl_entries[0].buffer.type = WGPUBufferBindingType_Uniform;
+  bgl_entries[0].buffer.minBindingSize = sizeof(GBufGlobalUniforms);
+
+  bgl_entries[1].binding = 1;
+  bgl_entries[1].visibility = WGPUShaderStage_Fragment;
+  bgl_entries[1].buffer.type = WGPUBufferBindingType_ReadOnlyStorage;
+  bgl_entries[1].buffer.minBindingSize = sizeof(GBufObjectData);
+
+  bgl_entries[2].binding = 2;
+  bgl_entries[2].visibility = WGPUShaderStage_Fragment;
+  bgl_entries[2].texture.sampleType = WGPUTextureSampleType_Depth;
+  bgl_entries[2].texture.viewDimension = WGPUTextureViewDimension_2D;
+
+  bgl_entries[3].binding = 3;
+  bgl_entries[3].visibility = WGPUShaderStage_Fragment;
+  bgl_entries[3].buffer.type = WGPUBufferBindingType_Uniform;
+  bgl_entries[3].buffer.minBindingSize = sizeof(GBufLightsUniforms);
+
+  WGPUBindGroupLayoutDescriptor bgl_desc = {};
+  bgl_desc.entryCount = 4;
+  bgl_desc.entries = bgl_entries;
+  WGPUBindGroupLayout bgl = wgpuDeviceCreateBindGroupLayout(ctx_.device, &bgl_desc);
+
+  WGPUPipelineLayoutDescriptor pl_desc = {};
+  pl_desc.bindGroupLayoutCount = 1;
+  pl_desc.bindGroupLayouts = &bgl;
+  WGPUPipelineLayout pl = wgpuDeviceCreatePipelineLayout(ctx_.device, &pl_desc);
+
+  // Color target: RGBA8Unorm (NodeType::GBUF_R8)
+  WGPUColorTargetState color_target = {};
+  color_target.format = WGPUTextureFormat_RGBA8Unorm;
+  color_target.writeMask = WGPUColorWriteMask_All;
+
+  WGPUFragmentState frag = {};
+  frag.module = shader;
+  frag.entryPoint = str_view("fs_main");
+  frag.targetCount = 1;
+  frag.targets = &color_target;
+
+  WGPURenderPipelineDescriptor pipe_desc = {};
+  pipe_desc.layout = pl;
+  pipe_desc.vertex.module = shader;
+  pipe_desc.vertex.entryPoint = str_view("vs_main");
+  pipe_desc.fragment = &frag;
+  pipe_desc.primitive.topology = WGPUPrimitiveTopology_TriangleList;
+  pipe_desc.primitive.cullMode = WGPUCullMode_None;
+  pipe_desc.multisample.count = 1;
+  pipe_desc.multisample.mask = 0xFFFFFFFF;
+
+  shadow_pipeline_.set(wgpuDeviceCreateRenderPipeline(ctx_.device, &pipe_desc));
+
+  wgpuPipelineLayoutRelease(pl);
+  wgpuBindGroupLayoutRelease(bgl);
+  wgpuShaderModuleRelease(shader);
+}
+
 void GBufferEffect::create_pack_pipeline() {
   HEADLESS_RETURN_IF_NULL(ctx_.device);
 
@@ -596,7 +769,3 @@ void GBufferEffect::update_raster_bind_group(NodeRegistry& nodes) {
   wgpuBindGroupLayoutRelease(bgl);
 }
 
-void GBufferEffect::update_pack_bind_group(NodeRegistry& nodes) {
-  (void)nodes;
-  // Pack bind group is rebuilt inline in render() to use current node views.
-}
diff --git a/cnn_v3/src/gbuffer_effect.h b/cnn_v3/src/gbuffer_effect.h
index 42fb0ec..13d394d 100644
--- a/cnn_v3/src/gbuffer_effect.h
+++ b/cnn_v3/src/gbuffer_effect.h
@@ -10,6 +10,7 @@
 #include "gpu/uniform_helper.h"
 #include "gpu/wgpu_resource.h"
 #include "util/mini_math.h"
+#include <vector>
 
 // Uniform for the pack compute shader
 struct GBufResUniforms {
@@ -20,6 +21,20 @@ struct GBufResUniforms {
 static_assert(sizeof(GBufResUniforms) == 16,
               "GBufResUniforms must be 16 bytes");
 
+// Single directional light: direction points *toward* the light source (world space).
+struct GBufLight {
+  vec4 direction;  // xyz = normalized direction toward light, w = unused
+  vec4 color;      // rgb = color, a = intensity
+};
+static_assert(sizeof(GBufLight) == 32, "GBufLight must be 32 bytes");
+
+struct GBufLightsUniforms {
+  GBufLight lights[2];
+  vec4      params;  // x = num_lights
+};
+static_assert(sizeof(GBufLightsUniforms) == 80,
+              "GBufLightsUniforms must be 80 bytes");
+
 class GBufferEffect : public Effect {
  public:
   GBufferEffect(const GpuContext& ctx, const std::vector<std::string>& inputs,
@@ -31,47 +46,59 @@ class GBufferEffect : public Effect {
   void render(WGPUCommandEncoder encoder, const UniformsSequenceParams& params,
               NodeRegistry& nodes) override;
 
-  void set_scene(const Scene* scene, const Camera* camera);
+  // Populate the internal scene with ~20 rotating cubes and a few pumping
+  // spheres.  Must be called once before the first render().
+  void set_scene();
 
  private:
+  // Per-cube animation state (axis-angle rotation)
+  struct CubeAnim {
+    vec3  axis;
+    float speed;  // radians/second, may be negative
+  };
+  // Per-sphere animation state (radius driven by audio_intensity)
+  struct SphereAnim {
+    int   obj_idx;     // index into scene_.objects
+    float base_radius;
+  };
+
   // Internal G-buffer node names
   std::string node_albedo_;
   std::string node_normal_mat_;
   std::string node_depth_;
   std::string node_shadow_;
   std::string node_transp_;
-  std::string node_feat0_;
-  std::string node_feat1_;
 
-  const Scene*  scene_  = nullptr;
-  const Camera* camera_ = nullptr;
+  // Owned scene and camera — populated by set_scene()
+  Scene  scene_;
+  Camera camera_;
+  bool   scene_ready_ = false;
+
+  std::vector<CubeAnim>  cube_anims_;
+  std::vector<SphereAnim> sphere_anims_;
 
   // Pass 1: MRT rasterization pipeline
   RenderPipeline raster_pipeline_;
   BindGroup      raster_bind_group_;
 
+  // Pass 2: SDF shadow pipeline
+  RenderPipeline shadow_pipeline_;
+
   // Pass 4: Pack compute pipeline
   ComputePipeline  pack_pipeline_;
-  BindGroup        pack_bind_group_;
-  UniformBuffer<GBufResUniforms> pack_res_uniform_;
-
-  // Placeholder textures for shadow/transp (white/black cleared once)
-  Texture     shadow_placeholder_tex_;
-  TextureView shadow_placeholder_view_;
-  Texture     transp_placeholder_tex_;
-  TextureView transp_placeholder_view_;
+  UniformBuffer<GBufResUniforms>    pack_res_uniform_;
+  UniformBuffer<GBufLightsUniforms> lights_uniform_;
 
   // GPU-side object data buffers (global uniforms + objects storage)
-  // These mirror the layout expected by gbuf_raster.wgsl
   GpuBuffer global_uniforms_buf_;
   GpuBuffer objects_buf_;
-  int       objects_buf_capacity_ = 0; // number of ObjectData slots allocated
+  int       objects_buf_capacity_ = 0;
 
   void create_raster_pipeline();
+  void create_shadow_pipeline();
   void create_pack_pipeline();
 
   void update_raster_bind_group(NodeRegistry& nodes);
-  void update_pack_bind_group(NodeRegistry& nodes);
 
   void upload_scene_data(const Scene& scene, const Camera& camera,
                          float time);
diff --git a/cnn_v3/tools/shaders.js b/cnn_v3/tools/shaders.js
index a1b2929..f178637 100644
--- a/cnn_v3/tools/shaders.js
+++ b/cnn_v3/tools/shaders.js
@@ -223,32 +223,40 @@ const DISP_SHADER=`
 }`;
 
 // Viz f32: show one channel of rgba16float layer
+// Uniform layout: ch(u32) _p(u32) ox(i32) oy(i32) — 16 bytes
+// ox/oy = texel offset (top-left of view); 0,0 for full-texture vignettes.
 const VIZ_F32=`
+struct Vu{ch:u32,_p:u32,ox:i32,oy:i32}
 @group(0) @binding(0) var t:texture_2d<f32>;
-@group(0) @binding(1) var<uniform> ch:u32;
+@group(0) @binding(1) var<uniform> u:Vu;
 @vertex fn vs(@builtin(vertex_index) i:u32)->@builtin(position) vec4f{
   var p=array<vec2f,6>(vec2f(-1.,-1.),vec2f(1.,-1.),vec2f(-1.,1.),vec2f(-1.,1.),vec2f(1.,-1.),vec2f(1.,1.));
   return vec4f(p[i],0.,1.);
 }
 @fragment fn fs(@builtin(position) pos:vec4f)->@location(0) vec4f{
-  let v=textureLoad(t,vec2i(pos.xy),0); var a=array<f32,4>(v.x,v.y,v.z,v.w);
-  let x=clamp(a[min(ch,3u)],0.,1.); return vec4f(x,x,x,1.);
+  let dim=vec2i(textureDimensions(t));
+  let tc=clamp(vec2i(i32(pos.x)+u.ox,i32(pos.y)+u.oy),vec2i(0),dim-vec2i(1));
+  let v=textureLoad(t,tc,0); var a=array<f32,4>(v.x,v.y,v.z,v.w);
+  let x=clamp(a[min(u.ch,3u)],0.,1.); return vec4f(x,x,x,1.);
 }`;
 
 // Viz u32: show one f16 channel of rgba32uint layer (8 channels packed)
 const VIZ_U32=`
+struct Vu{ch:u32,_p:u32,ox:i32,oy:i32}
 @group(0) @binding(0) var t:texture_2d<u32>;
-@group(0) @binding(1) var<uniform> ch:u32;
+@group(0) @binding(1) var<uniform> u:Vu;
 @vertex fn vs(@builtin(vertex_index) i:u32)->@builtin(position) vec4f{
   var p=array<vec2f,6>(vec2f(-1.,-1.),vec2f(1.,-1.),vec2f(-1.,1.),vec2f(-1.,1.),vec2f(1.,-1.),vec2f(1.,1.));
   return vec4f(p[i],0.,1.);
 }
 @fragment fn fs(@builtin(position) pos:vec4f)->@location(0) vec4f{
-  let t2=textureLoad(t,vec2i(pos.xy),0);
+  let dim=vec2i(textureDimensions(t));
+  let tc=clamp(vec2i(i32(pos.x)+u.ox,i32(pos.y)+u.oy),vec2i(0),dim-vec2i(1));
+  let t2=textureLoad(t,tc,0);
   let a=unpack2x16float(t2.x);let b=unpack2x16float(t2.y);
   let c=unpack2x16float(t2.z);let d=unpack2x16float(t2.w);
   var v=array<f32,8>(a.x,a.y,b.x,b.y,c.x,c.y,d.x,d.y);
-  let x=clamp(v[min(ch,7u)],0.,1.); return vec4f(x,x,x,1.);
+  let x=clamp(v[min(u.ch,7u)],0.,1.); return vec4f(x,x,x,1.);
 }`;
 
 // Full G-buffer pack: assembles feat_tex0/feat_tex1 from individual G-buffer images.
diff --git a/cnn_v3/tools/tester.js b/cnn_v3/tools/tester.js
index dbc7414..0412cae 100644
--- a/cnn_v3/tools/tester.js
+++ b/cnn_v3/tools/tester.js
@@ -465,7 +465,12 @@ class CNNv3Tester {
       const cvs=document.createElement('canvas');
       const name=chName(c);
       cvs.title=name;
-      cvs.onclick=()=>tester.zoomChannel(id,c,name);
+      cvs.onclick=(e)=>{
+        const r=cvs.getBoundingClientRect();
+        const tx=Math.round(e.offsetX/r.width*tex.width);
+        const ty=Math.round(e.offsetY/r.height*tex.height);
+        tester.zoomChannel(id,c,name,tx,ty);
+      };
       cell.appendChild(lbl); cell.appendChild(cvs); grid.appendChild(cell);
     }
     const pl=def.t==='f32'?this.getVizF32():this.getVizU32();
@@ -474,8 +479,8 @@ class CNNv3Tester {
       cvs.width=tex.width; cvs.height=tex.height;
       const ctx=cvs.getContext('webgpu'); if(!ctx)continue;
       try{ctx.configure({device:this.device,format:this.format});}catch(_){continue;}
-      const chBuf=this.device.createBuffer({size:4,usage:GPUBufferUsage.UNIFORM|GPUBufferUsage.COPY_DST});
-      this.device.queue.writeBuffer(chBuf,0,new Uint32Array([c]));
+      const chBuf=this.device.createBuffer({size:16,usage:GPUBufferUsage.UNIFORM|GPUBufferUsage.COPY_DST});
+      this.device.queue.writeBuffer(chBuf,0,new Int32Array([c,0,0,0]));
       const bg=this.device.createBindGroup({layout:pl.getBindGroupLayout(0),
         entries:[{binding:0,resource:tex.createView()},{binding:1,resource:{buffer:chBuf}}]});
       const enc=this.device.createCommandEncoder();
@@ -487,13 +492,13 @@ class CNNv3Tester {
     await this.device.queue.onSubmittedWorkDone();
   }
 
-  zoomChannel(layerId, ch, label) {
+  zoomChannel(layerId, ch, label, clickTx=0, clickTy=0) {
     const def = this.vizDefs?.find(d => d.id === layerId);
     const tex = this.layerTextures[layerId];
     if (!def || !tex || !this.device) return;
     const wrap = document.getElementById('chzoomWrap');
     const lbl  = document.getElementById('chzoomLbl');
-    this.activeZoom    = {layerId, ch, label};
+    this.activeZoom    = {layerId, ch, label, clickTx, clickTy};
     lbl.textContent    = label;
     wrap.style.display = 'flex';
     // Wait for layout so clientWidth/clientHeight reflect the flex-distributed size
@@ -506,12 +511,17 @@ class CNNv3Tester {
       const scale  = Math.min(1, availW / tex.width, availH / tex.height);
       dst.width  = Math.round(tex.width  * scale);
       dst.height = Math.round(tex.height * scale);
-      // Re-render via WebGPU (WebGPU canvas pixels are not readable by drawImage)
+      // Re-render via WebGPU centered on the clicked texel
+      const ox = clickTx - Math.floor(dst.width  / 2);
+      const oy = clickTy - Math.floor(dst.height / 2);
       const pl = def.t === 'f32' ? this.getVizF32() : this.getVizU32();
       const ctx = dst.getContext('webgpu');
       try { ctx.configure({device: this.device, format: this.format}); } catch(_) { return; }
-      const chBuf = this.device.createBuffer({size:4, usage:GPUBufferUsage.UNIFORM|GPUBufferUsage.COPY_DST});
-      this.device.queue.writeBuffer(chBuf, 0, new Uint32Array([ch]));
+      const uData = new ArrayBuffer(16);
+      const dv = new DataView(uData);
+      dv.setUint32(0, ch, true); dv.setInt32(8, ox, true); dv.setInt32(12, oy, true);
+      const chBuf = this.device.createBuffer({size:16, usage:GPUBufferUsage.UNIFORM|GPUBufferUsage.COPY_DST});
+      this.device.queue.writeBuffer(chBuf, 0, uData);
       const bg = this.device.createBindGroup({layout: pl.getBindGroupLayout(0),
         entries:[{binding:0, resource:tex.createView()}, {binding:1, resource:{buffer:chBuf}}]});
       const enc = this.device.createCommandEncoder();
@@ -525,8 +535,8 @@ class CNNv3Tester {
 
   refreshZoom() {
     if (this.activeZoom) {
-      const {layerId, ch, label} = this.activeZoom;
-      this.zoomChannel(layerId, ch, label);
+      const {layerId, ch, label, clickTx, clickTy} = this.activeZoom;
+      this.zoomChannel(layerId, ch, label, clickTx, clickTy);
     }
   }
 
diff --git a/cnn_v3/training/dataset/full/0001/albedo.png b/cnn_v3/training/dataset/full/0001/albedo.png
new file mode 100644
index 0000000..8f64b38
--- /dev/null
+++ b/cnn_v3/training/dataset/full/0001/albedo.png
diff --git a/cnn_v3/training/dataset/full/0001/depth.png b/cnn_v3/training/dataset/full/0001/depth.png
new file mode 100644
index 0000000..c58fcd9
--- /dev/null
+++ b/cnn_v3/training/dataset/full/0001/depth.png
diff --git a/cnn_v3/training/dataset/full/0001/matid.png b/cnn_v3/training/dataset/full/0001/matid.png
new file mode 100644
index 0000000..b4fa98f
--- /dev/null
+++ b/cnn_v3/training/dataset/full/0001/matid.png
diff --git a/cnn_v3/training/dataset/full/0001/normal.png b/cnn_v3/training/dataset/full/0001/normal.png
new file mode 100644
index 0000000..62f26e3
--- /dev/null
+++ b/cnn_v3/training/dataset/full/0001/normal.png
diff --git a/cnn_v3/training/dataset/full/0001/shadow.png b/cnn_v3/training/dataset/full/0001/shadow.png
new file mode 100644
index 0000000..0471e7f
--- /dev/null
+++ b/cnn_v3/training/dataset/full/0001/shadow.png
diff --git a/cnn_v3/training/dataset/full/0001/target.png b/cnn_v3/training/dataset/full/0001/target.png
new file mode 100644
index 0000000..587d54a
--- /dev/null
+++ b/cnn_v3/training/dataset/full/0001/target.png
diff --git a/cnn_v3/training/dataset/full/0001/transp.png b/cnn_v3/training/dataset/full/0001/transp.png
new file mode 100644
index 0000000..b4fa98f
--- /dev/null
+++ b/cnn_v3/training/dataset/full/0001/transp.png
diff --git a/cnn_v3/training/dataset/full/0002/albedo.png b/cnn_v3/training/dataset/full/0002/albedo.png
new file mode 100644
index 0000000..8f64b38
--- /dev/null
+++ b/cnn_v3/training/dataset/full/0002/albedo.png
diff --git a/cnn_v3/training/dataset/full/0002/depth.png b/cnn_v3/training/dataset/full/0002/depth.png
new file mode 100644
index 0000000..c58fcd9
--- /dev/null
+++ b/cnn_v3/training/dataset/full/0002/depth.png
diff --git a/cnn_v3/training/dataset/full/0002/matid.png b/cnn_v3/training/dataset/full/0002/matid.png
new file mode 100644
index 0000000..b4fa98f
--- /dev/null
+++ b/cnn_v3/training/dataset/full/0002/matid.png
diff --git a/cnn_v3/training/dataset/full/0002/normal.png b/cnn_v3/training/dataset/full/0002/normal.png
new file mode 100644
index 0000000..62f26e3
--- /dev/null
+++ b/cnn_v3/training/dataset/full/0002/normal.png
diff --git a/cnn_v3/training/dataset/full/0002/shadow.png b/cnn_v3/training/dataset/full/0002/shadow.png
new file mode 100644
index 0000000..0471e7f
--- /dev/null
+++ b/cnn_v3/training/dataset/full/0002/shadow.png
diff --git a/cnn_v3/training/dataset/full/0002/target.png b/cnn_v3/training/dataset/full/0002/target.png
new file mode 100644
index 0000000..587d54a
--- /dev/null
+++ b/cnn_v3/training/dataset/full/0002/target.png
diff --git a/cnn_v3/training/dataset/full/0002/transp.png b/cnn_v3/training/dataset/full/0002/transp.png
new file mode 100644
index 0000000..b4fa98f
--- /dev/null
+++ b/cnn_v3/training/dataset/full/0002/transp.png
diff --git a/cnn_v3/training/dataset/full/0003/albedo.png b/cnn_v3/training/dataset/full/0003/albedo.png
new file mode 100644
index 0000000..8f64b38
--- /dev/null
+++ b/cnn_v3/training/dataset/full/0003/albedo.png
diff --git a/cnn_v3/training/dataset/full/0003/depth.png b/cnn_v3/training/dataset/full/0003/depth.png
new file mode 100644
index 0000000..c58fcd9
--- /dev/null
+++ b/cnn_v3/training/dataset/full/0003/depth.png
diff --git a/cnn_v3/training/dataset/full/0003/matid.png b/cnn_v3/training/dataset/full/0003/matid.png
new file mode 100644
index 0000000..b4fa98f
--- /dev/null
+++ b/cnn_v3/training/dataset/full/0003/matid.png
diff --git a/cnn_v3/training/dataset/full/0003/normal.png b/cnn_v3/training/dataset/full/0003/normal.png
new file mode 100644
index 0000000..62f26e3
--- /dev/null
+++ b/cnn_v3/training/dataset/full/0003/normal.png
diff --git a/cnn_v3/training/dataset/full/0003/shadow.png b/cnn_v3/training/dataset/full/0003/shadow.png
new file mode 100644
index 0000000..0471e7f
--- /dev/null
+++ b/cnn_v3/training/dataset/full/0003/shadow.png
diff --git a/cnn_v3/training/dataset/full/0003/target.png b/cnn_v3/training/dataset/full/0003/target.png
new file mode 100644
index 0000000..587d54a
--- /dev/null
+++ b/cnn_v3/training/dataset/full/0003/target.png
diff --git a/cnn_v3/training/dataset/full/0003/transp.png b/cnn_v3/training/dataset/full/0003/transp.png
new file mode 100644
index 0000000..b4fa98f
--- /dev/null
+++ b/cnn_v3/training/dataset/full/0003/transp.png
diff --git a/cnn_v3/training/dataset/full/0004/albedo.png b/cnn_v3/training/dataset/full/0004/albedo.png
new file mode 100644
index 0000000..8f64b38
--- /dev/null
+++ b/cnn_v3/training/dataset/full/0004/albedo.png
diff --git a/cnn_v3/training/dataset/full/0004/depth.png b/cnn_v3/training/dataset/full/0004/depth.png
new file mode 100644
index 0000000..c58fcd9
--- /dev/null
+++ b/cnn_v3/training/dataset/full/0004/depth.png
diff --git a/cnn_v3/training/dataset/full/0004/matid.png b/cnn_v3/training/dataset/full/0004/matid.png
new file mode 100644
index 0000000..b4fa98f
--- /dev/null
+++ b/cnn_v3/training/dataset/full/0004/matid.png
diff --git a/cnn_v3/training/dataset/full/0004/normal.png b/cnn_v3/training/dataset/full/0004/normal.png
new file mode 100644
index 0000000..62f26e3
--- /dev/null
+++ b/cnn_v3/training/dataset/full/0004/normal.png
diff --git a/cnn_v3/training/dataset/full/0004/shadow.png b/cnn_v3/training/dataset/full/0004/shadow.png
new file mode 100644
index 0000000..0471e7f
--- /dev/null
+++ b/cnn_v3/training/dataset/full/0004/shadow.png
diff --git a/cnn_v3/training/dataset/full/0004/target.png b/cnn_v3/training/dataset/full/0004/target.png
new file mode 100644
index 0000000..587d54a
--- /dev/null
+++ b/cnn_v3/training/dataset/full/0004/target.png
diff --git a/cnn_v3/training/dataset/full/0004/transp.png b/cnn_v3/training/dataset/full/0004/transp.png
new file mode 100644
index 0000000..b4fa98f
--- /dev/null
+++ b/cnn_v3/training/dataset/full/0004/transp.png
diff --git a/src/effects/shaders.cc b/src/effects/shaders.cc
index 37b7d09..a74d920 100644
--- a/src/effects/shaders.cc
+++ b/src/effects/shaders.cc
@@ -117,6 +117,7 @@ const char* ntsc_rgb_shader_wgsl = SafeGetAsset(AssetId::ASSET_SHADER_NTSC_RGB);
 const char* ntsc_yiq_shader_wgsl = SafeGetAsset(AssetId::ASSET_SHADER_NTSC_YIQ);
 const char* gbuf_raster_wgsl     = SafeGetAsset(AssetId::ASSET_SHADER_GBUF_RASTER);
 const char* gbuf_pack_wgsl       = SafeGetAsset(AssetId::ASSET_SHADER_GBUF_PACK);
+const char* gbuf_shadow_wgsl     = SafeGetAsset(AssetId::ASSET_SHADER_GBUF_SHADOW);
 const char* gbuf_view_wgsl       = SafeGetAsset(AssetId::ASSET_SHADER_GBUF_VIEW);
 const char* cnn_v3_enc0_wgsl        = SafeGetAsset(AssetId::ASSET_SHADER_CNN_V3_ENC0);
 const char* cnn_v3_enc1_wgsl        = SafeGetAsset(AssetId::ASSET_SHADER_CNN_V3_ENC1);
diff --git a/src/effects/shaders.h b/src/effects/shaders.h
index de5dda4..1664706 100644
--- a/src/effects/shaders.h
+++ b/src/effects/shaders.h
@@ -23,6 +23,7 @@ extern const char* ntsc_yiq_shader_wgsl;
 // CNN v3 G-buffer shaders
 extern const char* gbuf_raster_wgsl;
 extern const char* gbuf_pack_wgsl;
+extern const char* gbuf_shadow_wgsl;
 extern const char* gbuf_view_wgsl;
 
 // CNN v3 inference shaders
diff --git a/tools/seq_compiler.py b/tools/seq_compiler.py
index 09188a5..2d802b2 100755
--- a/tools/seq_compiler.py
+++ b/tools/seq_compiler.py
@@ -397,8 +397,9 @@ def generate_cpp(seq: SequenceDecl, sorted_effects: List[EffectDecl],
     # Use a full #include string to override the path entirely.
     CLASS_TO_HEADER = {
         'NtscYiq':       'ntsc',
-        'GBufferEffect': '#include "../../cnn_v3/src/gbuffer_effect.h"',
-        'CNNv3Effect':   '#include "../../cnn_v3/src/cnn_v3_effect.h"',
+        'GBufferEffect':  '#include "../../cnn_v3/src/gbuffer_effect.h"',
+        'CNNv3Effect':    '#include "../../cnn_v3/src/cnn_v3_effect.h"',
+        'GBufViewEffect': '#include "../../cnn_v3/src/gbuf_view_effect.h"',
     }
     includes = set()
     for effect in seq.effects:
diff --git a/workspaces/main/assets.txt b/workspaces/main/assets.txt
index f22552b..3045ab6 100644
--- a/workspaces/main/assets.txt
+++ b/workspaces/main/assets.txt
@@ -102,6 +102,7 @@ SHADER_RENDER_NTSC_COMMON, WGSL, ../../src/shaders/render/ntsc_common.wgsl, "NTS
 SHADER_GBUF_RASTER, WGSL, ../../cnn_v3/shaders/gbuf_raster.wgsl, "CNN v3 G-buffer MRT rasterization shader"
 SHADER_GBUF_PACK, WGSL, ../../cnn_v3/shaders/gbuf_pack.wgsl, "CNN v3 G-buffer feature pack compute shader"
 SHADER_GBUF_VIEW, WGSL, ../../cnn_v3/shaders/gbuf_view.wgsl, "CNN v3 G-buffer channel visualization (4x5 grid)"
+SHADER_GBUF_SHADOW, WGSL, ../../cnn_v3/shaders/gbuf_shadow.wgsl, "CNN v3 G-buffer SDF shadow raymarching pass"
 
 # --- CNN v3 Inference ---
 SHADER_CNN_V3_COMMON, WGSL, ../../cnn_v3/shaders/cnn_v3_common.wgsl, "CNN v3 shared helpers snippet (get_w, unpack_8ch)"
diff --git a/workspaces/main/timeline.seq b/workspaces/main/timeline.seq
index 557e73d..0e1ca74 100644
--- a/workspaces/main/timeline.seq
+++ b/workspaces/main/timeline.seq
@@ -13,10 +13,8 @@ SEQUENCE 8.00 0 "scene 2"
 SEQUENCE 12.00 0 "cnn_v3_test"
   NODE gbuf_feat0 gbuf_rgba32uint
   NODE gbuf_feat1 gbuf_rgba32uint
-  NODE cnn_v3_out gbuf_albedo
   EFFECT + GBufferEffect source -> gbuf_feat0 gbuf_feat1 0.00 8.00
-  EFFECT + CNNv3Effect gbuf_feat0 gbuf_feat1 -> cnn_v3_out 0.00 8.00
-  EFFECT + Passthrough cnn_v3_out -> sink 0.00 8.00
+  EFFECT + GBufViewEffect gbuf_feat0 gbuf_feat1 -> sink 0.00 8.00
 
 SEQUENCE 20.00 2 "hybrid_heptagon"
   # Heptagon -> Hybrid3D -> sink