New paper on speculative decoding cuts inference latency by 40 percent · r/founders · BusellAI