vibe4211 hours agoPretty sweet hack as it's orthogonal to quantisation. And while it uses more compute, it doesn't require more VRAM.Maybe in the future circuits will become modular and composable like models are today?