Brett Jurman
07/22/2021, 10:47 AMBrett Jurman
07/22/2021, 10:48 AMcoiled.create_software_environment(
name="gpu-env4",
container="gpuci/miniconda-cuda:10.2-runtime-ubuntu18.04",
conda={
"channels": ["conda-forge", "defaults", "fastchan"],
"dependencies": [
"python==3.8",
"pytorch",
"torchvision",
"cudatoolkit=10.2",
"prefect",
"fastai",
"scikit-image",
"numpy",
"dask",
"bokeh>=0.13.0",
]
})
Brett Jurman
07/22/2021, 10:49 AMexecutor = DaskExecutor(
cluster_class=coiled.Cluster,
cluster_kwargs={
"software": "gpu-env4",
"shutdown_on_close": False,
"name": "prefect-executor",
"worker_memory": "15 GiB",
"worker_gpu": 1,
"account": "(my account id)"
},
)
Brett Jurman
07/22/2021, 10:50 AMTask 'run_model': Exception encountered during task execution!
Traceback (most recent call last):
File "/opt/conda/envs/coiled/lib/python3.8/site-packages/prefect/engine/task_runner.py", line 861, in get_task_run_state
value = prefect.utilities.executors.run_task_with_timeout(
File "/opt/conda/envs/coiled/lib/python3.8/site-packages/prefect/utilities/executors.py", line 328, in run_task_with_timeout
return task.run(*args, **kwargs) # type: ignore
File "/tmp/ipykernel_29387/1929835160.py", line 61, in run_model
File "/opt/conda/envs/coiled/lib/python3.8/site-packages/torch/nn/modules/module.py", line 673, in to
return self._apply(convert)
File "/opt/conda/envs/coiled/lib/python3.8/site-packages/torch/nn/modules/module.py", line 387, in _apply
module._apply(fn)
File "/opt/conda/envs/coiled/lib/python3.8/site-packages/torch/nn/modules/module.py", line 387, in _apply
module._apply(fn)
File "/opt/conda/envs/coiled/lib/python3.8/site-packages/torch/nn/modules/module.py", line 387, in _apply
module._apply(fn)
File "/opt/conda/envs/coiled/lib/python3.8/site-packages/torch/nn/modules/module.py", line 409, in _apply
param_applied = fn(param)
File "/opt/conda/envs/coiled/lib/python3.8/site-packages/torch/nn/modules/module.py", line 671, in convert
return <http://t.to|t.to>(device, dtype if t.is_floating_point() or t.is_complex() else None, non_blocking)
File "/opt/conda/envs/coiled/lib/python3.8/site-packages/torch/cuda/__init__.py", line 164, in _lazy_init
raise AssertionError("Torch not compiled with CUDA enabled")
AssertionError: Torch not compiled with CUDA enabled
Kevin Kho
Brett Jurman
07/22/2021, 1:43 PMBrett Jurman
07/22/2021, 1:43 PMKevin Kho
dask-cuda
in the software environment also?Brett Jurman
07/22/2021, 2:05 PMKevin Kho
Brett Jurman
07/22/2021, 2:06 PMBrett Jurman
07/22/2021, 2:06 PMBrett Jurman
07/22/2021, 2:07 PMBrett Jurman
07/22/2021, 2:07 PMcoiled.create_software_environment(
name="gpu-env-cuda-dask",
container="gpuci/miniconda-cuda:10.2-runtime-ubuntu18.04",
conda={
"channels": ["conda-forge", "defaults", "fastchan"],
"dependencies": [
"python==3.8",
"pytorch",
"torchvision",
"cudatoolkit=10.2",
"prefect",
"fastai",
"scikit-image",
"numpy",
"dask",
"bokeh>=0.13.0",
"dask-cuda"
]
})
Kevin Kho
Brett Jurman
07/22/2021, 2:12 PMKevin Kho
Brett Jurman
07/22/2021, 3:58 PMBrett Jurman
07/22/2021, 3:58 PMKevin Kho
Kevin Kho
flow.run()
? Is the agent running on the same machine that you develop on?Brett Jurman
07/23/2021, 2:50 PMKevin Kho