ML Training

Join the conversation on this Module

I am having the following error and need a fix:

2021-04-27 10:08:13 [ADK:INFO] Initializing
2021-04-27 10:08:13 [ADK:INFO] Found module’s inputs to be {‘WFE_output_params_file’: ‘wfe_module_params_2_2.json’, ‘label_zip’: ‘/input/module_1_1/StackLabel.tif’, ‘n_estimators’: 50, ‘train_zip’: ‘/input/module_0_0/train_stack.tiff’}
2021-04-27 10:08:14.969379: W tensorflow/stream_executor/platform/default/dso_loader.cc:59] Could not load dynamic library ‘libcudart.so.10.1’; dlerror: libcudart.so.10.1: cannot open shared object file: No such file or directory
2021-04-27 10:08:14.969430: I tensorflow/stream_executor/cuda/cudart_stub.cc:29] Ignore above cudart dlerror if you do not have a GPU set up on your machine.
2021-04-27 10:08:16 [ADK:INFO] Outputs will be written to /output/wfe_module_params_2_2.json
Traceback (most recent call last):
File “/usr/src/app/trainer.py”, line 114, in main
assert train_stack.shape == segment_stack.shape, “Dimensions of Train Stack and Label Stack should be same”
AssertionError: Dimensions of Train Stack and Label Stack should be same

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
File “./apeer_main.py”, line 10, in
model_output = trainer.main(inputs[“train_zip”], inputs[“label_zip”] ,n_estimators=inputs[“n_estimators”])
File “/usr/src/app/trainer.py”, line 118, in main
assert train_stack.shape == segment_stack.shape, “Dimensions of Train Stack and Label Stack should be same”
AssertionError: Dimensions of Train Stack and Label Stack should be same