Deep Learning Studio 11.1 (Linux)

487
8
Jump to solution
10-11-2023 02:51 AM
S6
by Esri Contributor
Esri Contributor

Hi,

We have installed  Image Server 11.1 & Raster Analytics in a NVIDIA DGX machine with 4 GPU cards.

How can we leverage all 4 GPUs while training the model in Deep Learning Studio?

Thanks & regards,

Saranya

0 Kudos
1 Solution

Accepted Solutions
JayChen
Esri Contributor

Hi Saranya,

At 11.1, when it gets to train model using a RA site in an enterprise environment, each image server node within the Raster Analytics site can still use one and only one GPU. Deep Learning Studio will also carry this limitation since it's leveraging the RA site for processing. We are looking to remove this limitation in the future release.

-Jay 

View solution in original post

8 Replies
JayChen
Esri Contributor

Hi Saranya,

At 11.1, when it gets to train model using a RA site in an enterprise environment, each image server node within the Raster Analytics site can still use one and only one GPU. Deep Learning Studio will also carry this limitation since it's leveraging the RA site for processing. We are looking to remove this limitation in the future release.

-Jay 

S6
by Esri Contributor
Esri Contributor
Hi Jay,
Thank you for your prompt response.
Really appreciate it. Do you have a timeline on when this capability will be incorporated?
0 Kudos
JayChen
Esri Contributor

Hi Saranya,

We are working with Python API team on this, starting with supporting multiple GPUs on a single image server node for some of the model types. Unfortunately we don't yet have a clear timeline when this will be available for most of the model types. We do undertsdand that this is essential for some of the customers, especially when it goes to the model training. Once it becomes available, the Deep Learning Studio will natually support it too. Thank you for your patience.

-Jay

0 Kudos
S6
by Esri Contributor
Esri Contributor

Thanks for the update Jay!

0 Kudos
S6
by Esri Contributor
Esri Contributor

Hi,

I have a follow up question.

https://enterprise.arcgis.com/en/image/latest/raster-analytics/configure-and-deploy-arcgis-enterpris...

In this page, it says if I have multiple GPUs, I can run multiple instances. Does this mean if I have 16GPUs, I can run inferences on 16 images in parallel? One image per GPU?

0 Kudos
JayChen
Esri Contributor

Supporting multiple GPUs on your raster analytics node within your raster analytics site is not fully supported for all models yet. `Utilization of multiple GPUs per server node is applicable to some deep learning model configurations predefined in ArcGIS. They include Tensorflow (ObjectDetectionAPI and DeepLab), Keras (MaskRCNN), and PyTorch.`

For supported models, yes the job can be distributed to multiple GPUs on your RA node if there are multiple GPUs available and the Max instances per machine for RA services are set to be more than 1.

0 Kudos
S6
by Esri Contributor
Esri Contributor

Hi JayChen,

Thanks for your reply. Does the same apply for inferencing as well?

0 Kudos
JayChen
Esri Contributor

Yes. This should be on inferencing side.

0 Kudos