When we place everything with each other, we get both equally rapidly inference and instruction as well as unbounded context!
Mamba, like Flash focus, tries to limit the volume of times we need to go from DRAM to SRAM https://k2spiceshop.com/product/liquid-k2-on-paper-online/