ANALYSIS_VARIABLE_TEMPO.md


1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372

# Analysis: Variable Tempo / Unified Music Time

## Current Architecture Analysis

### 1. BPM Dependencies (Problems Found)

#### Location 1: `TrackerScore` (tracker.h:32)
```cpp
struct TrackerScore {
  const TrackerPatternTrigger* triggers;
  uint32_t num_triggers;
  float bpm;  // ❌ FIXED BPM - Cannot change dynamically
};
```

#### Location 2: `tracker_update()` (tracker.cc:57)
```cpp
float beat_to_sec = 60.0f / g_tracker_score.bpm;  // ❌ Used to convert beats→seconds
```

**Critical Issue (tracker.cc:84):**
```cpp
int frame_offset = (int)(event.beat * beat_to_sec * 32000.0f / DCT_SIZE);
```

**Problem:** The BPM is **baked into the spectrogram** at pattern trigger time!
- Events within a pattern are composited once with fixed spacing
- The resulting spectrogram frames encode the tempo
- Once triggered, the pattern plays at the encoded tempo
- **Cannot speed up/slow down dynamically**

#### Location 3: `main.cc` (lines 99, 166)
```cpp
// Bass pattern timing
if (t - last_beat_time > (60.0f / g_tracker_score.bpm) / 2.0) {  // ❌ BPM dependency

// Visual beat calculation
float beat = fmodf((float)current_time * g_tracker_score.bpm / 60.0f, 1.0f);  // ❌ BPM dependency
```

---

## Current Time Flow

```
Physical Time (hardware clock)
    ↓
platform_state.time (real seconds elapsed)
    ↓
current_time = platform_state.time + seek_offset
    ↓
tracker_update(current_time)  ← checks if patterns should trigger
    ↓
Pattern compositing: event.beat * (60.0 / BPM) * 32kHz
    ↓
Spectrogram generated with FIXED frame spacing
    ↓
synth_trigger_voice() → registers spectrogram
    ↓
Audio callback @ 32kHz pulls samples via synth_render()
    ↓
Voice advances through spectral frames at FIXED rate (one frame per DCT_SIZE samples)
```

**The Fundamental Problem:**
- Music time = Physical time (1:1 mapping)
- Patterns are pre-rendered at fixed tempo
- Synth playback rate is locked to 32kHz sample clock

---

## What Needs to Change

### Architecture Requirements

1. **Separate Music Time from Physical Time**
   - Music time advances independently: `music_time += dt * tempo_scale`
   - `tempo_scale = 1.0` → normal speed
   - `tempo_scale = 2.0` → double speed
   - `tempo_scale = 0.5` → half speed

2. **Remove BPM from Pattern Compositing**
   - Events should use abstract "beat" units, not seconds
   - Pattern generation should NOT convert beats→frames using BPM
   - Keep events in "music time" space

3. **Variable Playback Rate in Synth**
   - Synth must support dynamic playback speed
   - As tempo changes, synth advances through spectral frames faster/slower

---

## Proposed Solution

### Phase 1: Introduce Unified Music Time

#### Change 1: Update `TrackerScore`
```cpp
struct TrackerScore {
  const TrackerPatternTrigger* triggers;
  uint32_t num_triggers;
  // REMOVE: float bpm;
};
```

#### Change 2: Update `TrackerPatternTrigger`
```cpp
struct TrackerPatternTrigger {
  float music_time;  // Renamed from time_sec, now abstract units
  uint16_t pattern_id;
};
```

#### Change 3: Update `TrackerEvent`
```cpp
struct TrackerEvent {
  float beat;  // Keep as-is, already abstract
  uint16_t sample_id;
  float volume;
  float pan;
};
```

#### Change 4: Update `tracker_update()` signature
```cpp
// OLD:
void tracker_update(float time_sec);

// NEW:
void tracker_update(float music_time);
```

**Note:** Pattern compositing still needs tempo for now (see Phase 2).

---

### Phase 2: Variable Tempo Synth Playback

The hard part: **Dynamic playback speed without pitch shifting.**

#### Option A: Naive Approach (Pitch Shifts - NOT GOOD)
```cpp
// Simply advance through frames faster/slower
v.current_spectral_frame += playback_speed;  // ❌ Changes pitch!
```

**Problem:** Advancing through spectral frames faster/slower changes pitch.
- 2x speed → 2x frequency (octave up)
- 0.5x speed → 0.5x frequency (octave down)

#### Option B: Resample Spectrograms (Complex)
- Generate spectrograms at "reference tempo" (e.g., 1.0)
- At playback time, resample to match current tempo
- **Pros:** Preserves pitch
- **Cons:** CPU-intensive, requires interpolation

#### Option C: Time-Stretching (Best Quality)
- Use phase vocoder or similar algorithm
- Stretches/compresses time without changing pitch
- **Pros:** High quality
- **Cons:** Very complex, may exceed 64k budget

#### Option D: Pre-render at Reference Tempo (Recommended for 64k)
**Key Insight:** For a 64k demo, pre-render everything at a fixed internal tempo.

```cpp
// At pattern trigger time:
// 1. Composite pattern at "reference tempo" (e.g., 120 BPM baseline)
int frame_offset = (int)(event.beat * REFERENCE_BEAT_DURATION * 32000.0f / DCT_SIZE);

// 2. Store tempo multiplier with voice
struct Voice {
  float playback_speed;  // NEW: 1.0 = normal, 2.0 = double speed, etc.
  // ... existing fields
};

// 3. In synth_render(), advance at variable rate:
void synth_render(float* output_buffer, int num_frames) {
  for each voice {
    // Advance through spectral frames at tempo-adjusted rate
    float frame_advance = playback_speed / DCT_SIZE;

    // Use fractional frame index
    float spectral_frame_pos;  // NEW: float instead of int

    // Interpolate between frames
    int frame0 = (int)spectral_frame_pos;
    int frame1 = frame0 + 1;
    float frac = spectral_frame_pos - frame0;

    // Blend spectral data from frame0 and frame1
    for (int j = 0; j < DCT_SIZE; ++j) {
      windowed_frame[j] = lerp(spectral[frame0][j], spectral[frame1][j], frac);
    }
  }
}
```

**Trade-off:** This changes pitch! But for a demo, this might be acceptable.

---

## Sync with Physical Audio Device

### Current System
```
Audio Callback (32kHz fixed)
    ↓
synth_render(buffer, num_frames) ← pulls samples
    ↓
Voices advance: ++buffer_pos, ++current_spectral_frame
    ↓
Fixed playback rate (one spectral frame = DCT_SIZE samples)
```

### With Variable Tempo
```
Audio Callback (32kHz fixed - CANNOT CHANGE)
    ↓
synth_render(buffer, num_frames, current_playback_speed)
    ↓
Voices advance: buffer_pos += playback_speed
    ↓
Variable playback rate (playback_speed can be > 1.0 or < 1.0)
```

**Key Point:** Hardware audio rate (32kHz) is FIXED. We cannot change it.

**Solution:** Adjust how fast we consume spectrogram data:
- `playback_speed = 2.0` → consume frames 2x faster → music plays 2x speed
- `playback_speed = 0.5` → consume frames 0.5x slower → music plays 0.5x speed

---

## Implementation Roadmap

### Step 1: Remove BPM from TrackerScore ✅ Ready
- Remove `float bpm` field
- Update tracker_compiler to not generate BPM
- Update tests

### Step 2: Add Global Tempo State
```cpp
// In audio/tracker.h or main.cc
float g_current_tempo = 1.0f;  // Multiplier: 1.0 = normal, 2.0 = double speed

void tracker_set_tempo(float tempo_multiplier);
float tracker_get_tempo();
```

### Step 3: Update tracker_update()
```cpp
void tracker_update(float music_time) {
  // music_time is now abstract, not physical seconds
  // Patterns trigger when music_time >= trigger.music_time
}
```

### Step 4: Update Pattern Compositing (Keep Reference Tempo)
```cpp
// Use fixed reference tempo for compositing (e.g., 120 BPM)
const float REFERENCE_BPM = 120.0f;
const float reference_beat_to_sec = 60.0f / REFERENCE_BPM;

int frame_offset = (int)(event.beat * reference_beat_to_sec * 32000.0f / DCT_SIZE);
```

### Step 5: Variable Playback in Synth (Advanced)
```cpp
// Store playback speed with each voice
struct Voice {
  float playback_speed;  // Set when voice is triggered
  float spectral_frame_pos;  // Change to float for interpolation
  // ...
};

// In synth_trigger_voice():
v.playback_speed = g_current_tempo;

// In synth_render():
// Advance with fractional frame increment
v.spectral_frame_pos += v.playback_speed / DCT_SIZE;

// Interpolate spectral data (required for smooth tempo changes)
```

### Step 6: Update Main Loop
```cpp
// In main.cc
static float g_music_time = 0.0f;
static float g_tempo_scale = 1.0f;  // Can be animated!

void update_game_logic(double physical_time) {
  float dt = get_delta_time();

  // Music time advances at variable rate
  g_music_time += dt * g_tempo_scale;

  // Animate tempo (example)
  g_tempo_scale = 1.0f + 0.5f * sinf(physical_time * 0.1f);  // Oscillate 0.5x - 1.5x

  tracker_set_tempo(g_tempo_scale);
  tracker_update(g_music_time);
}
```

---

## Open Questions

1. **Pitch Shifting Acceptable?**
   - If tempo changes, should pitch change too?
   - For demoscene effect: probably YES (classic effect)
   - For "realistic" music: NO (need time-stretching)

2. **Tempo Curve Continuity**
   - Should tempo changes be smoothed (acceleration curves)?
   - Or instant jumps (may cause audio glitches)?

3. **Spectral Frame Interpolation**
   - Linear interpolation sufficient?
   - Or need cubic/sinc for quality?

4. **Tracker Compilation**
   - Should tracker_compiler still know about tempo?
   - Or output pure "beat" units?

---

## Recommendation

**For a 64k demo, I recommend:**

### Minimal Change Approach (Easiest)
1. Keep BPM for compositing (as "reference tempo")
2. Add `g_current_tempo` multiplier
3. Make synth advance through frames at `tempo * playback_speed`
4. Accept pitch shifting as intentional effect

**Changes needed:**
- Add `float g_current_tempo` global
- Update `synth_render()` to use `playback_speed`
- Main loop: `music_time += dt * g_current_tempo`
- Tracker: rename `time_sec` → `music_time` (conceptual only)

**Size impact:** ~100 bytes
**Complexity:** Low
**Enables:** Full variable tempo with pitch shift effect

### Advanced Approach (Better Quality)
- Implement spectral frame interpolation
- Add time-stretching (phase vocoder lite)
- Size impact: +2-3 KB

---

## Conclusion

**Is the code ready?** ❌ NO

**What needs to change?**
1. BPM must become a "reference tempo" for compositing only
2. Add global `tempo_scale` variable
3. Synth must support variable playback speed
4. Main loop must track `music_time` separately from physical time

**Sync with audio device:**
- Hardware rate (32kHz) is FIXED
- We control playback speed by advancing through spectral frames faster/slower
- This inherently changes pitch (unless we add time-stretching)

**Recommended next step:** Implement minimal change approach first, then iterate.