Skip to content

Feature/gemma 4 inference ahan moment#11

Merged
solderzzc merged 15 commits intomainfrom
feature/gemma-4-inference-ahan-moment
Apr 4, 2026
Merged

Feature/gemma 4 inference ahan moment#11
solderzzc merged 15 commits intomainfrom
feature/gemma-4-inference-ahan-moment

Conversation

@solderzzc
Copy link
Copy Markdown
Member

No description provided.

solderzzc added 12 commits April 3, 2026 22:29
- Added Aha moment documentation to README
- Stored resolved journal.md discussing SSD temporal block streaming
- Synced mlx-swift-lm submodule containing the Gemma 4 pipeline fixes
When a repeated prompt matched 100% of cached tokens, the remaining
token slice was empty (0 tokens). Passing this to the model caused
'[reshape] Cannot infer the shape of an empty array' fatal error.

Fix: replay the last cached token (with KV trim-back by 1) so the
model always receives at least 1 token for next-token logit production.
@solderzzc
Copy link
Copy Markdown
Member Author

This PR should address #10

@solderzzc solderzzc merged commit c59f6a1 into main Apr 4, 2026
4 checks passed
@solderzzc solderzzc deleted the feature/gemma-4-inference-ahan-moment branch April 4, 2026 23:26
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant