summaryrefslogtreecommitdiff
diff options
context:
space:
mode:
-rw-r--r--doc/CNN_V2.md7
1 files changed, 6 insertions, 1 deletions
diff --git a/doc/CNN_V2.md b/doc/CNN_V2.md
index 6242747..b0aa24c 100644
--- a/doc/CNN_V2.md
+++ b/doc/CNN_V2.md
@@ -119,9 +119,11 @@ Requires quantization-aware training.
```
Layer 0: input RGBD (4D) + static (8D) = 12D → 4 channels (3×3 kernel)
Layer 1: previous (4D) + static (8D) = 12D → 4 channels (3×3 kernel)
-Layer 2: previous (4D) + static (8D) = 12D → 4 channels (3×3 kernel, output)
+Layer 2: previous (4D) + static (8D) = 12D → 4 channels (3×3 kernel, output RGBA)
```
+**Output:** 4 channels (RGBA). Training targets preserve alpha from target images.
+
### Weight Calculations
**Per-layer weights (uniform 12D→4D, 3×3 kernels):**
@@ -256,6 +258,9 @@ learning_rate = 1e-3
batch_size = 16
epochs = 5000
+# Dataset: Input RGB, Target RGBA (preserves alpha channel from image)
+# Model outputs RGBA, loss compares all 4 channels
+
# Training loop (standard PyTorch f32)
for epoch in range(epochs):
for rgb_batch, depth_batch, target_batch in dataloader: