The ML reading list #1
Infinite context lengths, evolutional model merging, and more
The ML reading list
Published: Sat, 04 May 2024
Introduction
I read a lot of interesting articles and papers on ML and often come across some very interesting ML libraries. It'd be a shame to hoard them all to myself, so I thought why not share them in the downtime between bytes that take a while to make (such as the one I'm working on right now).
My recommendations
This year, there has been a lot of focus on improving context-length with models like Claude 3 and Mamba. Here's a few papers on infinite context length models that I found interesting and could lay the foundations for some incredible models in the future:
It wasn't just infinite context models that piqued my interest recently. Here's a few things that might interest you:
What I imagine the TinyLlama team is like.