Cloud
6.0
Amazon SageMaker AI Async Inference now supports inline request payloads
SageMaker Async Inference now accepts payloads up to 128 KB directly in the InvokeEndpointAsync request body, eliminating the S3 upload step. This removes a network round-trip, simplifies client code, and reduces operational overhead for asynchronous inference workloads.
Read article →