diff options
Diffstat (limited to 'doc')
| -rw-r--r-- | doc/CNN_V2.md | 10 | ||||
| -rw-r--r-- | doc/CNN_V2_BINARY_FORMAT.md | 2 |
2 files changed, 6 insertions, 6 deletions
diff --git a/doc/CNN_V2.md b/doc/CNN_V2.md index 2913498..78854ce 100644 --- a/doc/CNN_V2.md +++ b/doc/CNN_V2.md @@ -122,12 +122,12 @@ let uv_x = coord.x / resolution.x; // Horizontal position [0,1] let uv_y = coord.y / resolution.y; // Vertical position [0,1] // Slot 6: Multi-frequency position encoding -let sin10_x = sin(10.0 * uv_x); // Periodic feature (frequency=10) +let sin20_y = sin(20.0 * uv_y); // Periodic feature (frequency=20, vertical) // Slot 7: Bias dimension (always 1.0) let bias = 1.0; // Learned bias per output channel -// Packed storage: [p0, p1, p2, p3, uv.x, uv.y, sin(10*uv.x), 1.0] +// Packed storage: [p0, p1, p2, p3, uv.x, uv.y, sin(20*uv.y), 1.0] ``` ### Feature Rationale @@ -136,7 +136,7 @@ let bias = 1.0; // Learned bias per output channel |---------|-----------|---------|----------| | p0-p3 | 4D | Parametric auxiliary features (mips, gradients, etc.) | Essential | | UV coords | 2D | Spatial position awareness | Essential | -| sin(10\*uv.x) | 1D | Periodic position encoding | Medium | +| sin(20\*uv.y) | 1D | Periodic position encoding (vertical) | Medium | | Bias | 1D | Learned bias (standard NN) | Essential | **Note:** Input image RGBD (mip 0) fed only to Layer 0. Subsequent layers see static features + previous layer output. @@ -149,8 +149,8 @@ let bias = 1.0; // Learned bias per output channel ### Future Feature Extensions -**Option: Replace sin(10\*uv.x) with:** -- `sin(20*uv.x)` - Higher frequency encoding +**Option: Additional encodings:** +- `sin(40*uv.y)` - Higher frequency encoding - `gray_mip1` - Multi-scale luminance - `dx`, `dy` - Sobel gradients - `variance` - Local texture measure diff --git a/doc/CNN_V2_BINARY_FORMAT.md b/doc/CNN_V2_BINARY_FORMAT.md index 5239e4b..f718960 100644 --- a/doc/CNN_V2_BINARY_FORMAT.md +++ b/doc/CNN_V2_BINARY_FORMAT.md @@ -146,7 +146,7 @@ Not stored in .bin file (computed at runtime): 4. **p3** - Parametric feature 3 (depth or from mip level) 5. **UV_X** - Normalized x coordinate [0,1] 6. **UV_Y** - Normalized y coordinate [0,1] -7. **sin(10 × UV_X)** - Spatial frequency encoding +7. **sin(20 × UV_Y)** - Spatial frequency encoding (vertical, frequency=20) 8. **1.0** - Bias term **Mip Level Usage (p0-p3):** |
