So, does this snapshotting optimization support arbitrary containers?<p>I&#x27;m currently planning to deploy using Amazon SageMaker, but a cold start takes a whopping ~9 minutes: 6 minutes for instance provisioning + 3 minutes for PyTorch initialization. My Docker image is ~14 GB, and the weights are a few GB.
How long would it take to cold start this configuration on 
Modal?<p>SageMaker&#x27;s performance makes it pretty much useless without many warm instances around (= tens of thousands of dollars per month), because users won&#x27;t be happy if they have to randomly wait 9 minutes

Put differently, 1&#x2F;40 is not the same as 1x - 40x.  I’d phrase as Reduced by 97.5% or 0.975x

probably just AI slop and using wrong semantics, they mean speedup ratio.

How can you cut latency by more than 1x? I am no intending to be snarky, it just doesn’t fit my brain how you can reduce a measure time by more than the original starting time.

Cutting latencies by 40x! Unfortunately couldn&#x27;t fit the whole title in the character limit :&lt;

What is &quot;cutting by 40x&quot; supposed to mean?