doc/AUDIO_TIMING_ARCHITECTURE.md


1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452

# Audio Timing Architecture - Proper Solution (February 7, 2026)

## Problem Statement

**Original Issue:** "demo is still flashing a lot" due to audio-visual timing mismatch.

**Root Causes:**
1. Multiple time sources with no clear hierarchy
2. Hardcoded latency constants (400ms) in solution proposals
3. Beat calculation using wrong time source
4. Peak decay rate not matched to music tempo

---

## Correct Architecture ✅

### Single Source of Truth: Physical Clock

```cpp
platform_get_time() → ONE authoritative wall clock from OS
```

**Everything else derives from this:**

```
Physical Time (platform_get_time())
    ↓
┌────────────────────────────────────────────────┐
│ Audio System tracks its own state:            │
│  • audio_get_playback_time()                  │
│    → Based on ring buffer samples consumed    │
│    → Automatically accounts for buffering     │
│    → NO hardcoded constants!                  │
└────────────────────────────────────────────────┘
    ↓
┌────────────────────────────────────────────────┐
│ Music Time (tracker time)                     │
│  • Derived from audio playback time           │
│  • Scaled by tempo_scale                      │
│  • Used by tracker for event triggering       │
└────────────────────────────────────────────────┘
```

### Time Sources Summary

| Time Source | Purpose | How to Get | Use For |
|-------------|---------|------------|---------|
| **Physical Time** | Wall clock, frame deltas | `platform_get_time()` | dt calculations, physics |
| **Audio Playback Time** | What's being HEARD | `audio_get_playback_time()` | Audio-visual sync, beat display |
| **Music Time** | Tracker time (tempo-scaled) | `g_music_time` | Tracker event triggering |

---

## Implementation: test_demo.cc

### Before (Wrong ❌)

```cpp
const double current_time = platform_state.time;  // Physical time

// Beat calculation based on physical time
const float beat_time = (float)current_time * 120.0f / 60.0f;

// But peak is measured at audio playback time (400ms behind!)
const float raw_peak = audio_get_realtime_peak();

// MISMATCH: beat and peak are from different time sources!
```

**Problem:** Visual beat shows beat 2 (physical time), but peak shows beat 1 (audio playback time).

### After (Correct ✅)

```cpp
const double physical_time = platform_state.time;  // For dt calculations

// Audio playback time: what's being HEARD right now
const float audio_time = audio_get_playback_time();

// Beat calculation uses AUDIO TIME (matches peak measurement)
const float beat_time = audio_time * 120.0f / 60.0f;

// Peak is measured at audio playback time
const float raw_peak = audio_get_realtime_peak();

// SYNCHRONIZED: beat and peak are from same time source!
```

**Result:** Visual beat shows beat 1, peak shows beat 1 → synchronized! ✅

---

## How audio_get_playback_time() Works

**Implementation** (audio.cc:169-173):
```cpp
float audio_get_playback_time() {
  const int64_t total_samples = g_ring_buffer.get_total_read();
  return (float)total_samples / (RING_BUFFER_SAMPLE_RATE * RING_BUFFER_CHANNELS);
}
```

**Key Points:**
1. **Tracks samples consumed** by audio callback (not samples rendered)
2. **Automatically accounts for ring buffer latency** (no hardcoded constants!)
3. **Self-consistent** with `audio_get_realtime_peak()` (measured at same moment)

**Example Timeline:**
```
T(physical) = 1.00s:
  → Ring buffer has 400ms of lookahead
  → Audio callback is playing samples rendered at T=0.60s (music time)
  → total_read counter reflects 0.60s worth of samples
  → audio_get_playback_time() returns 0.60s
  → audio_get_realtime_peak() measured from samples at 0.60s
  → Beat calculation: 0.60s * 2 = 1.2 → beat 1
  → SYNCHRONIZED! ✅
```

---

## Remaining Issues: Data-Driven Configuration

### Issue #1: Hardcoded Decay Rate

**Current** (miniaudio_backend.cc:166):
```cpp
realtime_peak_ *= 0.5f;  // Hardcoded: 50% per callback
```

**Problem:** Decay rate should match music tempo, not be hardcoded!

**Proposed Solution:**
```cpp
// AudioBackend should query decay rate from audio system:
float decay_rate = audio_get_peak_decay_rate();  // Returns BPM-adjusted rate
realtime_peak_ *= decay_rate;
```

**How to calculate:**
```cpp
// In audio system (based on current BPM):
float audio_get_peak_decay_rate() {
  const float bpm = tracker_get_bpm();  // e.g., 120
  const float beat_interval = 60.0f / bpm;  // e.g., 0.5s
  const float callback_interval = 0.128f;  // Measured from device

  // Decay to 10% within one beat:
  // decay_rate^(beat_interval / callback_interval) = 0.1
  // decay_rate = 0.1^(callback_interval / beat_interval)

  const float num_callbacks_per_beat = beat_interval / callback_interval;
  return powf(0.1f, 1.0f / num_callbacks_per_beat);
}
```

**Result:** At 120 BPM, decay to 10% in 0.5s (1 beat). At 60 BPM, decay to 10% in 1.0s (1 beat). Adapts automatically!

---

### Issue #2: Hardcoded BPM

**Current** (test_demo.cc:305):
```cpp
const float beat_time = audio_time * 120.0f / 60.0f;  // Hardcoded BPM
```

**Problem:** BPM should come from tracker/music data!

**Proposed Solution:**
```cpp
// Tracker should expose BPM:
const float bpm = tracker_get_bpm();  // From TrackerScore
const float beat_time = audio_time * bpm / 60.0f;

// Or even better, tracker calculates beat directly:
const float beat = tracker_get_current_beat(audio_time);
```

**Implementation:**
```cpp
// In tracker.h/cc:
float tracker_get_bpm() {
  return g_tracker_score.bpm;  // From parsed .track file
}

float tracker_get_current_beat(float audio_time) {
  return audio_time * (g_tracker_score.bpm / 60.0f);
}
```

**Result:** Change BPM in `.track` file → everything updates automatically!

---

### Issue #3: No API for "What time is it in sequence world?"

**User's Suggestion:**
> "ask the AudioSystem or demo system (MainSequence?) what 'time' it is in the sequence world"

**Current Approach:** Each system tracks its own time independently
- test_demo.cc: Uses `audio_get_playback_time()` directly
- main.cc: Uses `platform_state.time + seek_time`
- MainSequence: Uses `global_time` parameter passed to `render_frame()`

**Problem:** No central "what time should I use?" API

**Proposed API:**
```cpp
// In MainSequence or AudioEngine:
class TimeProvider {
 public:
  // Returns: Current time in "sequence world" (accounting for all latencies)
  float get_current_time() const {
    return audio_get_playback_time();  // Use audio playback time
  }

  // Returns: Current beat (fractional)
  float get_current_beat() const {
    return get_current_time() * (bpm_ / 60.0f);
  }

  // Returns: Current peak (synchronized with current time)
  float get_current_peak() const {
    return audio_get_realtime_peak();  // Already synchronized
  }
};

// Usage in test_demo.cc or main.cc:
const float time = g_time_provider.get_current_time();
const float beat = g_time_provider.get_current_beat();
const float peak = g_time_provider.get_current_peak();

// All guaranteed to be synchronized!
```

**Benefits:**
- Single point of query for all timing
- Hides implementation details (ring buffer, latency, etc.)
- Easy to change timing strategy globally
- Clear contract: "This is the time to use for audio-visual sync"

---

## Next Steps

### Completed ✅
1. ✅ Use `audio_get_playback_time()` instead of physical time for beat calculation
2. ✅ Faster decay rate (0.5 instead of 0.7) to prevent constant flashing
3. ✅ Peak meter visualization to verify timing visually
4. ✅ No hardcoded latency constants (system queries its own state)

### Future Work (Deferred)

#### Task: Add tracker_get_bpm() API
**Purpose:** Read BPM from `.track` file instead of hardcoding in test_demo.cc/main.cc

**Implementation:**
```cpp
// In tracker.h:
float tracker_get_bpm();  // Returns g_tracker_score.bpm

// In tracker.cc:
float tracker_get_bpm() {
  return g_tracker_score.bpm;
}

// Usage in test_demo.cc/main.cc:
const float bpm = tracker_get_bpm();  // Instead of hardcoded 120.0f
const float beat_time = audio_time * (bpm / 60.0f);
```

**Benefits:**
- Change BPM in `.track` file → everything updates automatically
- No hardcoded BPM values in demo code
- Supports variable BPM (future enhancement)

---

#### Task: BPM-Aware Peak Decay Rate
**Purpose:** Calculate decay rate based on current BPM to match beat interval

**Implementation:**
```cpp
// In audio.h:
float audio_get_peak_decay_rate();  // BPM-adjusted decay

// In audio.cc:
float audio_get_peak_decay_rate() {
  const float bpm = tracker_get_bpm();
  const float beat_interval = 60.0f / bpm;  // e.g., 0.5s at 120 BPM
  const float callback_interval = 0.128f;   // Measured from device

  // Decay to 10% within one beat:
  const float n = beat_interval / callback_interval;
  return powf(0.1f, 1.0f / n);
}

// In miniaudio_backend.cc:
realtime_peak_ *= audio_get_peak_decay_rate();  // Instead of hardcoded 0.5f
```

**Benefits:**
- Peak decays in exactly 1 beat (regardless of BPM)
- At 120 BPM: decay = 0.5 (500ms fade)
- At 60 BPM: decay = 0.7 (1000ms fade)
- Adapts automatically to tempo changes

---

#### Task: TimeProvider Class (Architectural)
**Purpose:** Centralize all timing queries with single source of truth

**Design:**
```cpp
// In audio/time_provider.h:
class TimeProvider {
 public:
  TimeProvider();

  // Returns: Current time in "sequence world" (what's being heard)
  float get_current_time() const {
    return audio_get_playback_time();
  }

  // Returns: Current beat (fractional, BPM-aware)
  float get_current_beat() const {
    const float bpm = tracker_get_bpm();
    return get_current_time() * (bpm / 60.0f);
  }

  // Returns: Current peak (synchronized with current time)
  float get_current_peak() const {
    return audio_get_realtime_peak();
  }

  // Returns: Current BPM
  float get_bpm() const {
    return tracker_get_bpm();
  }
};

// Usage in test_demo.cc, main.cc, effects:
extern TimeProvider g_time_provider;  // Global or MainSequence member

const float time = g_time_provider.get_current_time();
const float beat = g_time_provider.get_current_beat();
const float peak = g_time_provider.get_current_peak();

// All guaranteed to be synchronized!
```

**Integration with MainSequence:**
```cpp
class MainSequence {
 public:
  TimeProvider time_provider;

  void render_frame(float global_time, float beat, float peak,
                    float aspect_ratio, WGPUSurface surface) {
    // Effects can query: time_provider.get_current_time() etc.
  }
};
```

**Benefits:**
- Single point of query for all timing
- Hides implementation details (ring buffer, latency)
- Easy to change timing strategy globally
- Clear contract: "This is the time for audio-visual sync"
- No more passing time parameters everywhere

**Migration Path:**
1. Create TimeProvider class
2. Expose as global or MainSequence member
3. Gradually migrate test_demo.cc, main.cc, effects to use it
4. Remove time/beat/peak parameters from render functions
5. Everything queries TimeProvider directly

---

### Design Principles Established

1. ✅ **Single physical clock:** `platform_get_time()` is the only wall clock
2. ✅ **Systems expose their state:** `audio_get_playback_time()` knows its latency
3. ✅ **No hardcoded constants:** System queries its own state dynamically
4. ✅ **Data-driven configuration:** BPM from tracker, decay from BPM (future)
5. ✅ **Synchronized time sources:** Beat and peak from same moment

---

## Testing Verification

### With Peak Meter Visualization

Run `./build/test_demo` and observe:
- ✅ Red bar extends when kicks hit (beats 0, 2, 4, ...)
- ✅ Bar width matches FlashEffect intensity
- ✅ Bar decays before next beat (no constant red bar)
- ✅ Snares show narrower bar width (~50-70%)

### With Peak Logging

Run `./build/test_demo --log-peaks peaks.txt` and verify:
```bash
# Expected pattern (120 BPM, kicks every 1s):
Beat 0 (T=0.0s): High peak (kick)
Beat 1 (T=0.5s): Medium peak (snare)
Beat 2 (T=1.0s): High peak (kick)
Beat 3 (T=1.5s): Medium peak (snare)
...
```

### Console Output

Should show:
```
[AudioT=0.06, Beat=0, Frac=0.13, Peak=1.00]  ← Kick
[AudioT=0.58, Beat=1, Frac=0.15, Peak=0.62]  ← Snare (quieter)
[AudioT=1.09, Beat=2, Frac=0.18, Peak=0.16]  ← Decayed (between beats)
[AudioT=2.62, Beat=5, Frac=0.25, Peak=1.00]  ← Kick
```

**No more constant Peak=1.00 from beat 15 onward!**

---

## Summary

### What We Fixed
1. ✅ **Use audio playback time** instead of physical time for beat calculation
2. ✅ **Faster decay** (0.5 instead of 0.7) to match beat interval
3. ✅ **No hardcoded latency** - system queries its own state

### What Still Needs Improvement
1. ⚠️ **BPM should come from tracker** (not hardcoded 120)
2. ⚠️ **Decay rate should be calculated from BPM** (not hardcoded 0.5)
3. ⚠️ **Centralized TimeProvider** for all timing queries

### Key Insight (User's Contribution)
> "There should be a unique tick-source somewhere, that is the real physical_time. Then, we shouldn't hardcode the constants like 400ms, but really ask the AudioSystem or demo system (MainSequence?) what 'time' it is in the sequence world."

**This is the correct architectural principle!** ✅
- ONE physical clock (platform_get_time)
- Systems expose their own state (audio_get_playback_time)
- No hardcoded constants - query the system
- Data-driven configuration (BPM from tracker)

---

*Created: February 7, 2026*
*Architectural discussion and implementation complete*