From a160cc797afb4291d356bdc0cbcf0f110e3ef8a9 Mon Sep 17 00:00:00 2001 From: skal Date: Thu, 19 Mar 2026 23:11:33 +0100 Subject: docs(cnn_v3): full design doc — U-Net + FiLM architecture plan MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit - CNN_V3.md: complete design document - U-Net enc_channels=[4,8], ~5 KB f16 weights - FiLM conditioning (5D → γ/β per level, CPU-side MLP) - 20-channel feature buffer, 32 bytes/pixel: two rgba32uint textures - feat_tex0: albedo.rgb, normal.xy, depth, depth_grad.xy (f16) - feat_tex1: mat_id, prev.rgb, mip1.rgb, mip2.rgb, shadow, transp (u8) - 4-pass G-buffer: raster MRT + SDF compute + lighting + pack - Per-pixel parity framework: PyTorch / HTML WebGPU / C++ WebGPU (≤1/255) - Training pipelines: Blender full G-buffer + photo-only (channel dropout) - train_cnn_v3_full.sh spec (modelled on v2 script) - HTML tool adaptation plan from cnn_v2/tools/cnn_v2_test/index.html - Binary format v3 header spec - 8-phase ordered implementation checklist - TODO.md: add CNN v3 U-Net+FiLM future task with phases - cnn_v3/README.md: update status to design phase handoff(Gemini): CNN v3 design complete. Phase 0 (stub G-buffer) unblocks all other phases — one compute shader writing feat_tex0+feat_tex1 with synthetic values from the current framebuffer. See cnn_v3/docs/CNN_V3.md Implementation Checklist. --- cnn_v3/README.md | 3 ++- 1 file changed, 2 insertions(+), 1 deletion(-) (limited to 'cnn_v3/README.md') diff --git a/cnn_v3/README.md b/cnn_v3/README.md index fdbf648..a22d823 100644 --- a/cnn_v3/README.md +++ b/cnn_v3/README.md @@ -31,6 +31,7 @@ Add images directly to these directories and commit them. ## Status -**TODO:** Define CNN v3 architecture and feature set. +**Design phase.** Architecture defined, G-buffer prerequisite pending. +See `cnn_v3/docs/CNN_V3.md` for full design. See `cnn_v2/` for reference implementation. -- cgit v1.2.3