Posts
Weight sharing is the inverse of MoE
june 22 , 2024
Building a deep learning rig | part-2
february 22 , 2024
Building a deep learning rig | part-1
february 03 , 2024
What I don’t like about chains of thoughts and why language is a bottleneck to efficient reasoning
May 20, 2023