Exposes DDP training capabilities using the Sagemaker Distributed Data Parallel Library.

Data Ingestion

Three ways to get files into Sagemaker:

ApproachesDescription
File ModeSimplest option; read file from S3. Downloads dataset into instance memory
Fast File ModeStreamed into instance in real-time
EFS
FsX Lustre

When using Fast File Mode, specifying a distribution allows Sagemaker to know which partition of the training file to use for each node.