summaryrefslogtreecommitdiff
path: root/doc
diff options
context:
space:
mode:
Diffstat (limited to 'doc')
-rw-r--r--doc/CNN_V2.md10
-rw-r--r--doc/CNN_V2_BINARY_FORMAT.md2
2 files changed, 6 insertions, 6 deletions
diff --git a/doc/CNN_V2.md b/doc/CNN_V2.md
index 2913498..78854ce 100644
--- a/doc/CNN_V2.md
+++ b/doc/CNN_V2.md
@@ -122,12 +122,12 @@ let uv_x = coord.x / resolution.x; // Horizontal position [0,1]
let uv_y = coord.y / resolution.y; // Vertical position [0,1]
// Slot 6: Multi-frequency position encoding
-let sin10_x = sin(10.0 * uv_x); // Periodic feature (frequency=10)
+let sin20_y = sin(20.0 * uv_y); // Periodic feature (frequency=20, vertical)
// Slot 7: Bias dimension (always 1.0)
let bias = 1.0; // Learned bias per output channel
-// Packed storage: [p0, p1, p2, p3, uv.x, uv.y, sin(10*uv.x), 1.0]
+// Packed storage: [p0, p1, p2, p3, uv.x, uv.y, sin(20*uv.y), 1.0]
```
### Feature Rationale
@@ -136,7 +136,7 @@ let bias = 1.0; // Learned bias per output channel
|---------|-----------|---------|----------|
| p0-p3 | 4D | Parametric auxiliary features (mips, gradients, etc.) | Essential |
| UV coords | 2D | Spatial position awareness | Essential |
-| sin(10\*uv.x) | 1D | Periodic position encoding | Medium |
+| sin(20\*uv.y) | 1D | Periodic position encoding (vertical) | Medium |
| Bias | 1D | Learned bias (standard NN) | Essential |
**Note:** Input image RGBD (mip 0) fed only to Layer 0. Subsequent layers see static features + previous layer output.
@@ -149,8 +149,8 @@ let bias = 1.0; // Learned bias per output channel
### Future Feature Extensions
-**Option: Replace sin(10\*uv.x) with:**
-- `sin(20*uv.x)` - Higher frequency encoding
+**Option: Additional encodings:**
+- `sin(40*uv.y)` - Higher frequency encoding
- `gray_mip1` - Multi-scale luminance
- `dx`, `dy` - Sobel gradients
- `variance` - Local texture measure
diff --git a/doc/CNN_V2_BINARY_FORMAT.md b/doc/CNN_V2_BINARY_FORMAT.md
index 5239e4b..f718960 100644
--- a/doc/CNN_V2_BINARY_FORMAT.md
+++ b/doc/CNN_V2_BINARY_FORMAT.md
@@ -146,7 +146,7 @@ Not stored in .bin file (computed at runtime):
4. **p3** - Parametric feature 3 (depth or from mip level)
5. **UV_X** - Normalized x coordinate [0,1]
6. **UV_Y** - Normalized y coordinate [0,1]
-7. **sin(10 × UV_X)** - Spatial frequency encoding
+7. **sin(20 × UV_Y)** - Spatial frequency encoding (vertical, frequency=20)
8. **1.0** - Bias term
**Mip Level Usage (p0-p3):**