Middle East war live: Donald Trump says Iran war will be over ‘very soon’
Logging the memory, it seems like it starts the forward pass, memory starts increasing on GPU 0, then OOMs. I wonder if it’s trying to be smart and planning ahead and dequantizing multiple layers at a time. Dequantizing each layer uses ~36 GB of memory so if it was doing this that could cause it to use too much memory. Maybe if we put each layer on alternating GPU’s it could help.,更多细节参见snipaste
April 3, 2026 · 9392 words,更多细节参见https://telegram官网
2026年4月7日 17:37科技