Xpod
Bookmarks · Read Aloud
ON AIR
RSS FEED
All Episodes
Three days ago I left autoresearch tuning nanochat for ~2 days on depth=12 model. It found ~20 chang…
Apr 7, 2026 · 0:20
Three days ago I left autoresearch tuning nanochat for ~2 days on depth=12 model. It found ~20 changes that improved the validation loss. I tested these changes yesterday and all of them were additive and transferred to larger (depth=24) models. Stacking up all of these changes, https://t.co/j34dSt4oht
Audio
Edit Metadata